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BACKGROUND OF THE INVENTION 



Classical methods for the identification of microorganisms 

Microorganisms are classically identified by their ability to utilize different 
substrates as a source of carbon and nitrogen through the use of biochemical tests 
such as the API20E™ system (bioMerieux). For susceptibility testing, clinical 
microbiology laboratories use methods including disk diffusion, agar dilution and 
broth microdilution. Although identifications based on biochemical tests and 
antibacterial susceptibility tests are cost-effective, generally two days are required 
to obtain preliminary results due to the necessity of two successive overnight 
incubations to identify the bacteria from clinical specimens as well as to determine 
their susceptibility to antimicrobial agents. There are some commercially available 
automated systems (i.e. the MicroScan™ system from Dade Behring and the 
Vitek™ system from bioMerieux) which use sophisticated and expensive 
apparatus for faster microbial identification and susceptibility testing (Stager and 
Davis, 1992, Clin. Microbiol. Rev. 5:302-327). These systems require shorter 
incubation periods, thereby allowing most bacterial identifications and 
susceptibility testing to be performed in less than 6 hours. Nevertheless, these 
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faster systems always require the primary isolation of the bacteria or fungi as a 
pure culture, a process which takes at least 18 hours for a pure culture or 2 days for 
a mixed culture. So, the shortest time from sample reception to identification of the 
pathogen is around 24 hours. Moreover, fungi other than yeasts are often difficult 
or very slow to grow from clinical specimens. Identification must rely on labor- 

intensive techniques such as direct microscopic examination of the specimens-and 

by direct and/or indirect immunological assays. Cultivation of most parasites is 
impractical in the clinical laboratory. Hence, microscopic examination of the 
specimen, a few immunological tests and clinical symptoms are often the only 
methods used for an identification that frequently remains presumptive. 

The fastest bacterial identification system, the autoSCAN-Walk-Away™ 
system (Dade Behring) identifies both gram-negative and gram-positive bacterial 
species from standardized inoculum in as little as 2 hours and gives susceptibility 
patterns to most antibiotics in 5 to 6 hours. However, this system has a particularly 
high percentage (i.e. 3.3 to 40.5%) of non-conclusive identifications with bacterial 
species other than Enterobacteriaceae (Croize J., 1995, Lett. Infectiol. 10:109-113; 
York et al y 1992, J. Clin. Microbiol. 30:2903-2910). For Enterobacteriaceae, the 
percentage of non-conclusive identifications was 2.7 to 11.4%. The list of 
microorganisms identified by commercial systems based on classical identification 
methods is given in Table 15. 

A wide variety of bacteria and fungi are routinely isolated and identified from 
clinical specimens in microbiology laboratories. Tables 1 and 2 give the incidence 
for the most commonly isolated bacterial and fungal pathogens from various types 
of clinical specimens. These pathogens are the main organisms associated with 
nosocomial and community-acquired human infections and are therefore 
considered the most clinically important. 

Clinical specimens tested in clinical microbiology laboratories 
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Most clinical specimens received in clinical microbiology laboratories are 
urine and blood samples. At the microbiology laboratory of the Centre Hospitalier 
de TUniversite Laval (CHUL), urine and blood account for approximately 55% 
and 30% of the specimens received, respectively (Table 3). The remaining 15% of 
clinical specimens comprise various biological fluids including sputum, pus, 
-cerebrospinal fluid,-synoviaLfluid,- and-others (Table 3). Infections-of the urinary- 
tract, the respiratory tract and the bloodstream are usually of bacterial etiology and 
require antimicrobial therapy. In fact, all clinical samples received in the clinical 
microbiology laboratory are tested routinely for the identification of bacteria and 
antibiotic susceptibility. 

Conventional pathogen identification from clinical specimens 
Urine specimens 

The search for pathogens in urine specimens is so preponderant in the routine 
microbiology laboratory that a myriad of tests have been developed. However, the 
gold standard remains the classical semi-quantitative plate culture method in which 
1 jiL of urine is streaked on agar plates and incubated for 18-24 hours. Colonies 
are then counted to determine the total number of colony forming units (CFU) per 
liter of urine. A bacterial urinary tract infection (UTI) is normally associated with a 

bacterial count of 10 7 CFU/L or more in urine. However, infections with less than 

10 7 CFU/L in urine are possible, particularly in patients with a high incidence of 
diseases or those catheterized (Stark and Maki, 1984, N. Engl. J. Med. 311:560- 
564). Importantly, approximately 80% of urine specimens tested in clinical 
microbiology laboratories are considered negative (i.e. bacterial count of less than 
10 7 CFU/L; Table 3). Urine specimens found positive by culture are further 
characterized using standard biochemical tests to identify the bacterial pathogen 
and are also tested for susceptibility to antibiotics. The biochemical and 
susceptibility testing normally require 18-24 hours of incubation. 
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Accurate and rapid urine screening methods for bacterial pathogens would 
allow a faster identification of negative specimens and a more efficient treatment 
and care management of patients. Several rapid identification methods 
(Uriscreen™, UTIscreen™, Flash Track™ DNA probes and others) have been 
compared to slower standard biochemical methods, which are based on culture of 

— the bacterial-pathogens. -Although much-faster, -these -rapid- tests-showed low 

sensitivities and poor specificities as well as a high number of false negative and 
false positive results (Koening et a/., 1992, J, Clin. Microbiol. 30:342-345; Pezzlo 
et al, 1992, J. Clin. Microbiol. 30:640-684). 

Blood specimens 

The blood specimens received in the microbiology laboratory are always 
submitted for culture. Blood culture systems may be manual, semi-automated or 
completely automated. The BACTEC™ system (from Becton Dickinson) and the 
BacTAlert™ system (from Organon Teknika Corporation) are the two most widely 
used automated blood culture systems. These systems incubate blood culture 
bottles under optimal conditions for growth of most bacteria. Bacterial growth is 
monitored continuously to detect early positives by using highly sensitive bacterial 
growth detectors. Once growth is detected, a Gram stain is performed directly from 
the blood culture and then used to inoculate nutrient agar plates. Subsequently, 
bacterial identification and susceptibility testing are carried out from isolated 
bacterial colonies with automated systems as described previously. Blood culture 
bottles are normally reported as negative if no growth is detected after an 
incubation of 6 to 7 days. Normally, the vast majority of blood cultures are 
reported negative. For example, the percentage of negative blood cultures at the 
microbiology laboratory of the CHUL for the period February 1994-January 1995 
' was 93.1% (Table 3). 

Other clinical samples 
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Upon receipt by the clinical microbiology laboratory, all body fluids other 
than blood and urine that are from normally sterile sites (i.e. cerebrospinal, 
synovial, pleural, pericardial and others) are processed for direct microscopic 
examination and subsequent culture. Again, most clinical samples are negative for 
culture (Table 3). In all these normally sterile sites, tests for the universal detection 
— of algae, arehaea, bacteria v fungi and parasites-would be very useful-. 

Regarding clinical specimens which are not from sterile sites such as sputum 
or stool specimens, the laboratory diagnosis by culture is more problematic 
because of the contamination by the normal flora. The bacterial or fungal 
pathogens potentially associated with the infection are grown and separated from 
the colonizing microbes using selective methods and then identified as described 
previously. Of course, the DNA-based universal detection of bacteria would not be 
useful for the diagnosis of bacterial infections at these non-sterile sites. On the 
other hand, DNA-based assays for species or genus or family or group detection 
and identification as well as for the detection of antimicrobial agents resistance 
genes from these specimens would be very useful and would offer several 
advantages over classical identification and susceptibility testing methods. 

DNA-based assays with any specimen 

There is an obvious need for rapid and accurate diagnostic tests for the 
detection and identification of algae, arehaea, bacteria, fungi and parasites directly 
from clinical specimens. DNA-based technologies are rapid and accurate and offer 
a great potential to improve the diagnosis of infectious diseases (Persing et a/., 
1993, Diagnostic Molecular Microbiology: Principles and Applications, American 
Society for Microbiology, Washington, D.C.; Bergeron and Ouellette, 1995, 
Infection 23:69-72; Bergeron and Ouellette, 1998, J Clin Microbiol. 36:2169-72). 
The DNA probes and amplification primers which are objects of the present 
invention are applicable for the detection and identification of algae, arehaea, 
bacteria, fungi, and parasites directly from any clinical specimen such as blood, 
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urine, sputum, cerebrospinal fluid, pus, genital and gastro-intestinal tracts, skin or 
any other type of specimens (Table 3). These assays are also applicable to 
detection from microbial cultures (e.g. blood cultures, bacterial or fungal colonies 
on nutrient agar, or liquid cell cutures in nutrient broth). The DNA-based tests 
proposed in this invention are superior in terms of both rapidity and accuracy to 

standard^biochemical .methods __currently„.used „for_ routine diagnosis^ from any 

clinical specimens in microbiology laboratories. Since these tests can be performed 
in one hour or less, they provide the clinician with new diagnostic tools which 
should contribute to a better management of patients with infectious diseases. 
Specimens from sources other than humans (e.g. other primates, birds, plants, 
mammals, farm animals, livestock, food products, environment such as water or 
soil, and others) may also be tested with these assays. 

A high percentage of culture-negative specimens 

Among all the clinical specimens received for routine diagnosis, 
approximately 80% of urine specimens and even more (around 95%) for other 
types of normally sterile clinical specimens are negative for the presence of 
bacterial pathogens (Table 3). It would also be desirable, in addition to identify 
bacteria at the species or genus or family or group level in a given specimen, to 
screen out the high proportion of negative clinical specimens with a DNA-based 
test detecting the presence of any bacterium (i.e. universal bacterial detection). As 
disclosed in the present invention, such a screening test may be based on DNA 
amplification by PCR of a highly conserved genetic target found in all bacteria. 
Specimens negative for bacteria would not be amplified by this assay. On the other 
hand, those that are positive for any bacterium would give a positive amplification 
signal. Similarly, highly conserved genes of fungi and parasites .could serve not 
only to identify particular species or genus or family or group but also to detect the 
presence of any fungi or parasite in the specimen. 
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Towards the development of rapid DNA-based diagnostic tests 

A rapid diagnostic test should have a significant impact on the management of 
infections. DNA probe and DNA amplification technologies offer several 
advantages over conventional methods for the identification of pathogens and 
antin^crobial agents resistance genes from clinical samples (Persing et_ al ± 1993, 
Diagnostic Molecular Microbiology: Principles and Applications, American 
Society for Microbiology, Washington, D.C.; Ehrlich and Greenberg, 1994, PCR- 
based Diagnostics in Infectious Disease, Blackwell Scientific Publications, Boston, 
MA). There is no need for culture of the pathogens, hence the organisms can be 
detected directly from clinical samples, thereby reducing the time associated with 
the isolation and identification of pathogens. Furthermore, DNA-based assays are 
more accurate for microbial identification than currently used phenotypic 
identification systems which are based on biochemical tests and/or microscopic 
examination. Commercially available DNA-based technologies are currently used 
in clinical microbiology laboratories, mainly for the detection and identification of 
fastidious bacterial pathogens such as Mycobacterium tuberculosis, Chlamydia 
trachomatis, Neisseria gonorrhoeae as well as for the detection of a variety of 
viruses (Tang Y. and Persing D. H., Molecular detection and identification of 
microorganisms, In: P. Murray et al, 1999, Manual of Clinical Microbiology, 
ASM press, 7 th edition, Washington D.C.). There are also other commercially 
available DNA-based assays which are used for culture confirmation assays. 

Others have developed DNA-based tests for the detection and identification of 
bacterial pathogens which are objects of the present invention, for example: 
Staphylococcus sp. (US patent serial no. 5,437,978), Neisseria sp. (US patent serial 
no. 5,162,199 and European patent serial no. 0,337,896,131) and Listeria 
monocytogenes (US patent serial nos. 5,389,513 and 5,089,386). However, the 
diagnostic tests described in these patents are based either on rRNA genes or on 
genetic targets different from those described in the present invention. To our 
knowledge there are only four patents published by others mentioning the use of 
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any of the four highly conserved gene targets described in the present invention for 
diagnostic purposes (PCT international publication number WO92/03455 and 
WO00/14274, European patent publication number 0 133 671 Bl, and European 
patent publication number 0 133 288 A2). WO92/03455 is focused on the 
inhibition of Candida species for therapeutic purposes. It describes antisense 
pJigonucleotide_ probes hybridjzing to Candida^ messenger RNA. _Two o£ the 
numerous mRNA proposed as targets are coding for translation elongation factor 1 
(tefl) and the beta subunit of ATPase. DNA amplification or hybrization are not 
under the scope of their invention and although diagnostic use is briefly mentioned 
in the body of the application, no specific claim is made regarding diagnostics. 
WO00/14274 describes the use of bacterial recA gene for identification and 
speciation of bacteria of the Burkholderia cepacia complex. Specific claims are 
made on a method for obtaining nucleotide sequence information for the recA gene 
from the target bacteria and a following comparison with a standard library of 
nucleotide sequence information (claim 1), and on the use of PCR for amplification 
of the recA gene in a sample of interest (claims 4 to 7, and 13). However, the use 
of a discriminatory restriction enzyme in a RFLP procedure is essential to fulfill 
the speciation and WO00/14274 did not mention that multiple recA probes could 
be used simultaneously. Patent EP 0 133 288 A2 describes and claims the use of 
bacterial tuf (and fus) sequence for diagnostics based on hybridization of a /a/ (or 
fus) probe with bacterial DNA. DNA amplification is not under the scope of EP 0 
133 288 A2. Nowhere it is mentioned that multiple fa/ (or fus) probes could be 
used simultaneously. No mention is made regarding speciation using tuf (or fus) 
DNA nucleic acids and/or sequences. The sensitivities of the tuf hybrizations 
reported are lxlO 6 bacteria or 1-100 ng of DNA. This is much less sensitive than 
what is achieved by our assays using nucleic acid amplification technologies. 

Although there are phenotypic identification methods which have been used 
for more than 125 years in clinical microbiology laboratories, these methods do not 
provide information fast enough to be useful in the initial management of patients. 
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There is a need to increase the speed of the diagnosis of commonly encountered 
bacterial, ftmgal and parasitical infections. Besides being much faster, DNA-based 
diagnostic tests are more accurate than standard biochemical tests presently used 
for diagnosis because the microbial genotype (e.g. DNA level) is more stable than 
the phenotype (e.g. physiologic level). 

Bacteria, fungi and parasites encompass numerous well-known microbial 

pathogens. Other microorganisms could also be pathogens or associated with 
human diseases. For example, achlorophylious algae of the Prototheca genus can 
infect humans. Archae, especially methanogens, are present in the gut flora of 
humans (Reeve, J.H., 1999, J. Bacteriol. 181:3613-3617). However, methanogens 
have been associated to pathologic manifestations in the colon, vagina, and mouth 
(Belay et al, 1988, Appl. Enviro. Microbiol. 54:600-603; Belay et ai, 1990, J. 
Clin. Microbiol. 28:1666-1668; Weaver et ai, 1986, Gut 27:698-704). 

In addition to the identification of the infectious agent, it is often desirable to 
identify harmful toxins and/or to monitor the sensitivity of the microorganism to 
antimicrobial agents. As revealed in this invention, genetic identification of the 
microorganism could be performed simultaneously with toxin and antimicrobial 
agents resistance genes. 

Knowledge of the genomic sequences of algal, archaeal, bacterial, fungal 
and parasitical species continuously increases as testified by the number of 
sequences available from public databases such as GenBank. From the sequences 
readily available from those public databases, there is no indication therefrom as to 
their potential for diagnostic purposes. For determining good candidates for 
diagnostic purposes, one could select sequences for DNA-based assays for (i) the 
species-specific detection and identification of commonly encountered bacterial, 
fungal and parasitical pathogens, (ii) the genus-specific detection and identification 
of commonly encountered bacterial, fungal or parasitical pathogens, (iii) the 
family-specific detection and identification of commonly encountered bacterial, 
fungal or parasitical pathogens, (iv) the group-specific detection and identification 
of commonly encountered bacterial, fungal or parasitical pathogens, (v) the 
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universal detection of algal, archaeal, bacterial, fungal or parasitical pathogens, 
and/or (vi) the specific detection and identification of antimicrobial agents 
resistance genes, and/or (vii) the specific detection and identification of bacterial 
toxin genes. All of the above types of DNA-based assays may be performed 
directly from any type of clinical specimens or from a microbial culture. 
In - our- assigned -U.S.- patent 6,001,564 and our WO98/20157- patent- 
publication, we described DNA sequences suitable for (i) the species-specific 
detection and identification of clinically important bacterial pathogens, (ii) the 
universal detection of bacteria, and (iii) the detection of antimicrobial agents 
resistance genes. 

The WO98/20157 patent publication describes proprietary fa/DNA sequences 
as well as tuf sequences selected from public databases (in both cases, fragments of 
at least 100 base pairs), as well as oligonucleotide probes and amplification 
primers derived from these sequences. All the nucleic acid sequences described in 
that patent publication can enter in the composition of diagnostic kits or products 
and methods capable of a) detecting the presence of bacteria and fungi b) detecting 
specifically at the species, genus, family or group levels, the presence of bacteria 
and fungi and antimicrobial agents resistance genes associated with these 
pathogens. However, these methods and kits need to be improved, since the ideal 
kit and method should be capable of diagnosing close to 100% of microbial 
pathogens and associated antimicrobial agents resistance genes and toxins genes. 
For example, infections caused by Enterococcus faecium have become a clinical 
problem because of its resistance to many antibiotics. Both the detection of these 
bacteria and the evaluation of their resistance profiles are desirable. Besides that, 
novel DNA sequences (probes and primers) capable of recognizing the same and 
other microbial pathogens or the same and additional antimicrobial agents 
resistance genes are also desirable to aim at detecting more target genes and 
complement our earlier patent applications. 

The present invention improves the assigned application by disclosing new 
proprietary tuf nucleic acids and/or sequences as well as describing new ways to 
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obtain tuf nucleic acids and/or sequences. In addition we disclose new proprietary 
atpD and recA nucleic acids and/or sequences. In addition, new uses of tuf atpD 
and recA DNA nucleic acids and/or sequences selected from public databases 
(Table 1 1) are disclosed. 

Highly * conserved ' genes for identification and diagnostics 

Highly conserved genes are useful for identification of microorganisms. For 
bacteria, the most studied genes for identification of microorganisms are the 
universally conserved ribosomal RNA genes (rRNA). Among those, the principal 
targets used for identification purposes are the small subunit (SSU) ribosomal 16S 
rRNA genes (in prokaryotes) and 18S rRNA genes (in eukaryotes) (Relman and 
Persing, Genotyping Methods for Microbial Identification, In: D.H. Persing, 1996, 
PCR Protocols for Emerging Infectious Diseases, ASM Press, Washington D.C.). 
The rRNA genes are also the most commonly used targets for universal detection 
of bacteria (Chen et aU 1988, FEMS Microbiol. Lett. 57:19-24; McCabe et al. 9 
1999, Mol. Genet. Metabol. 66:205-21 1) and fungi (Van Burik et aU 1998, J. Clin. 
Microbiol 36:1169-1175). 

However, it may be difficult to discriminate between closely related species 
when using primers derived from the 16S rRNA. In some instances, 16S rRNA 
sequence identity may not be sufficient to guarantee species identity (Fox et aL> 
1992, Int. J. Syst. Bacteriol. 42:166-170) and it has been shown that inter-operon 
sequence variation as well as strain to strain variation could undermine the 
application of 16S rRNA for identification purposes (Clayton et ah, 1995, Int. J. 
Syst. Bacteriol. 45:595-599). The heat shock proteins (HSP) are another family of 
very conserved proteins. These ubiquitous proteins in bacteria and eukaryotes are 
expressed in answer to external stress agents. One of the most described of these 
HSP is HSP 60. This protein is very conserved at the amino acid level, hence it has 
been useful for phylogenetic studies. Similar to 16S rRNA, it would be difficult to 
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discriminate between species using the HSP 60 nucleotide sequences as a 
diagnostic tool. However, Goh et al identified a highly conserved region flanking 
a variable region in HSP 60, which led to the design of universal primers 
amplifying this variable region (Goh et al, US patent serial no. 5,708,160). The 
sequence variations in the resulting amplicons were found useful for the design of 
species-specific "assays." 



SUMMARY OF THE INVENTION 



It is an object of the present invention to provide a specific, ubiquitous and 
sensitive method using probes and/or amplification primers for determining the 
presence and/or amount of nucleic acids: 

- from any algal, archaeal, bacterial, fungal or parasitical species 

in any sample suspected of containing said nucleic acids, and optionally, 

- from specific microbial species or genera selected from the group 
consisting of the species or genera listed in Table 4, and optionally, 

- from an antimicrobial agents resistance gene selected from the group 
consisting of the genes listed in Table 5, and optionally, 

- from a toxin gene selected from the group consisting of the genes listed in 
Table 6, 

wherein each of said nucleic acids or a variant or part thereof comprises a 
selected target region hybridizable with said probes or primers; 

said method comprising the steps of contacting said sample with said probes 
or primers and detecting the presence and/or amount of hybridized probes or 
amplified products as an indication of the presence and/or amount of said any 
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microbial species, specific microbial species or genus or family or group and 
antimicrobial agents resistance gene and/or toxin gene. 

In a specific embodiment, a similar method directed to each specific 
microbial species or genus or family or group detection and identification, 
antimicrobial agents resistance genes detection, toxin genes detection, and 
universal bacterial detection, separately, is provided. 

In a more specific embodiment, the method makes use of DNA fragments 
from conserved genes (proprietary sequences and sequences obtained from public 
databases), selected for their capacity to sensitively, specifically and ubiquitously 
detect the targeted algal, archaeal, bacterial, fiingal or parasitical nucleic acids. 

In a particularly preferred embodiment, oligonucleotides of at least 12 
nucleotides in length have been derived from the longer DNA fragments, and are 
used in the present method as probes or amplification primers. To be a good 
diagnostic candidate, an oligonucleotide of at least 12 nucleotides should be 
capable of hybridizing with nucleic acids from given microorganism(s), and with 
substantially all strains and representatives of said microorganism(s); said 
oligonucleotide being species-, or genus-, or family-, or group-specific or 
universal. 

In another particularly preferred embodiment, oligonucleotides primers and 
probes of at least 12 nucleotides in length are designed for their specificity and 
ubiquity based upon analysis of our databases of/w/ atpD and recA sequences. 
These databases are generated using both proprietary and public sequence 
information. Altogether, these databases form a sequence repertory useful for the 
design of primers and probes for the detection and identification ofalgal, archaeal, 
bacterial, fungal and parasitical microorganisms. The repertory can also be 
subdivided into subrepertories for sequence analysis leading to the design of 
various primers and probes. 

The tuf, atpD and recA sequences databases as a product to assist the design 
of oligonucleotides primers and probes for the detection and identification ofalgal, 
archaeal, bacterial, fungal and parasitical microorganisms are also covered. 
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The proprietary oligonucleotides (probes and primers) are also another 
object of this invention. 

Diagnostic kits comprising probes or amplification primers such as those for 
the detection of a microbial species or genus or family or phylum or group selected 
from the following list consisting of Abiotrophia adiacens, Acinetobacter 

baumanii r -Actinomycetae r -Bacteroides,- Cytophaga—and—Flexibacter -phylum, 

Bacteroides fragilis, Bordetella pertussis, Bordetella sp., Campylobacter jejuni 
and C coli, Candida albicans, Candida dubliniensis, Candida glabrata, Candida 
guilliermondii, Candida krusei, Candida lusitaniae, Candida parapsilosis, 
Candida tropicalis, Candida zeylanoides, Candida sp., Chlamydia pneumoniae, 
Chlamydia trachomatis, Clostridium sp., Corynebacterium sp., Crypococcus 
neoformans, Cryptococcus sp., Cryptosporidium parvum, Entamoeba sp., 
Enterobacteriaceae group, Enterococcus casseliflavus-flavescens-gallinarum 
group, Enterococcus faecalis, Enterococcus faecium, Enterococcus gallinarum, 
Enterococcus sp., Escherichia coli and Shigella sp. group, Gemella sp., Giardia 
sp., Haemophilus influenzae, Klebsiella pneumoniae, Legionella pneumophila, 
Legionella sp., Leishmania sp., Mycobacteriaceae family, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, platelets contaminants group (see Table 14), 
Pseudomonas aeruginosa, Pseudomonads group, Staphylococcus aureus, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Staphylococcus 
hominis, Staphylococcus saprophyticus, Staphylococcus sp., Streptococcus 
agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus sp., 
Trypanosoma brucei, Trypanosoma cruzi, Trypanosoma sp„ Trypanosomatidae 
family, are also objects of the present invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of an antimicrobial agents resistance gene selected from the group listed 
in Table 5 are also objects of this invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of a toxin gene selected from the group listed in Table 6 are also objects 
of this invention. 
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Diagnostic kits further comprising probes or amplification primers for the 
detection of any other algal, archaeal, bacterial, fungal or parasitical species than 
those specifically listed herein, comprising or not comprising those for the 
detection of the specific microbial species or genus or family or group listed above, 
and further comprising or not comprising probes and primers for the antimicrobial 

agents-resistance genes listed in Table 5, and further comprising or not comprising — 

probes and primers for the toxin genes listed in Table 6 are also objects of this 
invention. 

In a preferred embodiment, such a kit allows for the separate or the 
simultaneous detection and identification of the above-listed microbial species or 
genus or family or group; or universal detection of algae, archaea, bacteria, fungi 
or parasites; or antimicrobial agents resistance genes; or toxin genes; or for the 
detection of any microorganism (algae, archaea, bacteria, fungi or parasites). 

In the above methods and kits, probes and primers are not limited to nucleic 
acids and may include, but are not restricted to analogs of nucleotides such as: 
inosine, 3-nitropyrrole nucleosides (Nichols et al, 1994, Nature 369:492-493), 
Linked Nucleic Acids (LNA) (Koskin et aU 1998, Tetrahedron 54:3607-3630), 
and Peptide Nucleic Acids (PNA) (Egholm et al, 1993, Nature 365:566-568), 

In the above methods and kits, amplification reactions may include but are 
not restricted to: a) polymerase chain reaction (PCR), b) ligase chain reaction 
(LCR), c) nucleic acid sequence-based amplification (NASBA), d) self-sustained 
sequence replication (3SR), e) strand displacement amplification (SDA), f) 
branched DNA signal amplification (bDNA), g) transcription-mediated 
amplification (TMA), h) cycling probe technology (CPT), i) nested PCR, j) 
multiplex PCR, k) solid phase amplification (SPA), 1) nuclease dependent signal 
amplification (NDS A), m) rolling circle amplification technology (RCA), n) 
Anchored strand displacement amplification, o) Solid-phase (immobilized) rolling 
circle amplification. 

In the above methods and kits, detection of the nucleic acids of target genes 
may include real-time or post-amplification technologies. These detection 
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technologies can include, but are not limited to, fluorescence resonance energy 
transfer (FRET)-based methods such as adjacent hybridization to FRET probes 
(including probe-probe and probe-primer methods), TaqMan, Molecular Beacons, 
scorpions, nanoparticle probes and Sunrise (Amplifluor). Other detection methods 
include target genes nucleic acids detection via immunological methods, solid 
- phase hybridization-methods on filters,- chips or any-other- solid support-, whether 
the hybridization is monitored by fluorescence, chemiluminescence, potentiometry, 
mass spectrometry, plasmon resonance, polarimetry, colorimetry, or scanometry. 
Sequencing, including sequencing by dideoxy termination or sequencing by 
hybridization, e.g. sequencing using a DNA chip, is another possible method to 
detect and identify the nucleic acids of target genes. 

In a preferred embodiment, a PGR protocol is used for nucleic acid 
amplification, in diagnostic method as well as in method of construction of a 
repertory of nucleic acids and deduced sequences. 

In a particularly preferred embodiment, a PCR protocol is provided, 
comprising, an initial denaturation step of 1-3 minutes at 95 °C, followed by an 
amplification cycle including a denaturation step of one second at 95 °C and an 
annealing step of 30 seconds at 45-65°C, without any time allowed specifically for 
the elongation step. This PCR protocol has been standardized to be suitable for 
PCR reactions with most selected primer pairs, which greatly facilitates the testing 
because each clinical sample can be tested with universal, species-specific, genus- 
specific, antimicrobial agents resistance gene and toxin gene PCR primers under 
uniform cycling conditions. Furthermore, various combinations of primer pairs 
may be used in multiplex PCR assays. 

It is also an object of the present invention that tuf, atpD and recA sequences 
could serve as drug targets and these sequences and means to obtain them revealed 
in the present invention can assist the screening, design and modeling of these 
drugs. 

It is also an object of the present invention that tuf, atpD and recA sequences 
could serve for vaccine purposes and these sequences and means to obtain them 
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revealed in the present invention can assist the screening, design and modeling of 
these vaccines. 

We aim at developing a universal DNA-based test or kit to screen out 
rapidly samples which are free of algal, archaeal, bacterial, fungal or parasitical 
cells. This test could be used alone or combined with more specific identification 
-tests-to detect and-identify the above -d-gd-and/or-arc^^ 

fungal and/or parasitical species and/or genera and/or family and/or group and to 
determine rapidly the bacterial resistance to antibiotics and/or presence of bacterial 
toxins. Although the sequences from the selected antimicrobial agents resistance 
genes are available from public databases and have been used to develop DNA- 
based tests for their detection, our approach is unique because it represents a major 
improvement over current diagnostic methods based on bacterial cultures. Using an 
amplification method for the simultaneous or independent or sequential microbial 
detection-identification and antimicrobial resistance genes detection, there is no 
need for culturing the clinical sample prior to testing. Moreover, a modified PCR 
protocol has been developed to detect all target DNA sequences in approximately 
one hour under uniform amplification conditions. This procedure should save lives 
by optimizing treatment, should diminish antimicrobial agents resistance because 
less antibiotics will be prescribed, should reduce the use of broad spectrum 
antibiotics which are expensive, decrease overall health care costs by preventing or 
shortening hospitalizations, and side effects of drugs, and decrease the time and 
costs associated with clinical laboratory testing. 

In another embodiment, sequence repertories and ways to obtain them for 
other gene targets are also an object of this invention, such is the case for the hexA 
nucleic acids and/or sequences of Streptococci. 

In yet another embodiment, for the detection of mutations associated with 
antibiotic resistance genes, we built repertories to distinguish between point 
mutations reflecting only gene diversity and point mutations involved in resistance. 
Such repertories and ways to obtain them for pbpla, pbp2b and pbp2x genes of 
sensitive and penicillin-resistant Streptoccoccus pneumoniae and also for gyrA and 
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parC gene fragments from various bacterial species are also an object of the 
present invention. 

The diagnostic kits, primers and probes mentioned above can be used to 
identify algae, archaea, bacteria, fungi, parasites, antimicrobial agents resistance 
genes and toxin genes on any type of sample, whether said diagnostic kits, primers 

_and prpbes_are_ used for in yitro_ or_ in_sitU- applications. JIh.e_s.aid .samples, may 

include but are not limited to: any clinical sample, any environment sample, any 
microbial culture, any microbial colony, any tissue, and any cell line. 

It is also an object of the present invention that said diagnostic kits, primers 
and probes can be used alone or in conjunction with any other assay suitable to 
identify microorganisms, including but not limited to: any immunoassay, any 
enzymatic assay, any biochemical assay, any lysotypic assay, any serological 
assay, any differential culture medium, any enrichment culture medium, any 
selective culture medium, any specific assay medium, any identification culture 
medium, any enumeration cuture medium, any cellular stain, any culture on 
specific cell lines, and any infectivity assay on animals. 

In the methods and kits described herein below, the oligonucleotide probes 
and amplification primers have been derived from larger sequences (i.e. DNA 
fragments of at least 100 base pairs). All DNA fragments have been obtained either 
from proprietary fragments or from public databases. DNA fragments selected 
from public databases are newly used in a method of detection according to the 
present invention, since they have been selected for their diagnostic potential. 

In another embodiment, the amino acid sequences translated from the 
repertory of tuf, atpD and reck nucleic acids and/or sequences are also an object of 
the present invention. 

It is clear to the individual skilled in the art that other oligonucleotide 
sequences appropriate for (i) the universal detection of algae, archaea, bacteria, 
fungi or parasites, (ii) the detection and identification of the above microbial 
species or genus or family or group, and (iii) the detection of antimicrobial agents 
resistance genes, and (iv) the detection of toxin genes, other than those listed in 
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Annexes I to III, XXI to XXII, XXXII to XXXVII, XXXIX to XLI, and XLIII to 
LIV may also be derived from the proprietary fragments or selected public 
database sequences. For example, the oligonucleotide primers or probes may be 
shorter or longer than the ones chosen; they may also be selected anywhere else in 
the proprietary DNA fragments or in the sequences selected from public databases; 
- they may- be also -variants of-the same-oligonucleotide.- If-the-target-DNA- or a- 
variant thereof hybridizes to a given oligonucleotide, or if the target DNA or a 
variant thereof can be amplified by a given oligonucleotide PCR primer pair, the 
converse is also true; a given target DNA may hybridize to a variant 
oligonucleotide probe or be amplified by a variant oligonucleotide PCR primer. 
Alternatively, the oligonucleotides may be designed from any DNA fragment 
sequences for use in amplification methods other than PCR. Consequently, the core 
of this invention is the identification of universal, species-specific, genus-specific, 
family-specific, group-specific, resistance gene-specific, toxin gene-specific 
genomic or non-genomic DNA fragments which are used as a source of specific 
and ubiquitous oligonucleotide probes and/or amplification primers. Although the 
selection and evaluation of oligonucleotides suitable for diagnostic purposes 
requires much effort, it is quite possible for the individual skilled in the art to 
derive, from the selected DNA fragments, oligonucleotides other than the ones 
listed in Annexes I to III, XXI to XXII, XXXII to XXXVII, XXXIX to XLI, and 
XLIII to LIV which are suitable for diagnostic purposes. When a proprietary 
fragment or a public databases sequence is selected for its specificity and ubiquity, 
it increases the probability that subsets thereof will also be specific and ubiquitous. 

Since a high percentage of clinical specimens are negative for bacteria 
(Table 3), DNA fragments having a high potential for the selection of universal 
oligonucleotide probes or primers were selected from proprietary and public 
database sequences. The amplification primers were selected from genes highly 
conserved in algae, archaea, bacteria, fungi and parasites, and are used to detect the 
presence of any algal, archaeal, bacterial, fungal or parasitical pathogen in clinical 
specimens in order to determine rapidly whether it is positive or negative for algae, 
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archaea, bacteria, fungi or parasites. The selected genes, designated tuf fits, atpD 
and recA, encode respectively 2 proteins (elongation factors Tu and G) involved in 
the translational process during protein synthesis, a protein (beta subunit) 
responsible for the catalytic activity of proton pump ATPase and a protein 
responsible for the homologous recombination of genetic material. The alignments 

ottuf-atpDsnd recA sequences used to derive the universal primers include- both 

proprietary and public database sequences. The universal primer strategy allows 
the rapid screening of the numerous negative clinical specimens (around 80% of 
the specimens received, see Table 3) submitted for microbiological testing. 

Table 4 provides a list of the archaeal, bacterial, fungal and parasitical 
species for which tuf and/or atpD and/or recA nucleic acids and/or sequences are 
revealed in the present invention. Tables 5 and 6 provide a list of antimicrobial 
agents resistance genes and toxin genes selected for diagnostic purposes. Table 7 
provides the origin of tuf atpD and recA nucleic acids and/or sequences listed in 
the sequence listing. Tables 8-10 and 12-14 provide lists of species used to test the 
specificity, ubiquity and sensitivity of some assays described in the examples. 
Table 11 provides a list of microbial species for which tuf and/or atpD and/or recA 
sequences are available in public databases. Table 15 lists the microorganisms 
identified by commercial systems. Tables 16-18 are part of Example 42, whereas 
Tables 19-20 are part of Example 43. Tables 21-22 illustrate Example 44, whereas 
Tables 23-25 illustrate Example 45. 
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In accordance with the present invention is provided a method for generating 
a repertory of nucleic acids of tuf, fus, atpD and/or recA genes from which 
are derived probes or primers, or both, useful for the detection of one, more 
than one related microorganisms, or substantially all microorganisms of a 
group selected from algae, archaea, bacteria, fungi and parasites, which 
comprises the step of: 

amplifying the nucleic acids of a plurality of determined algal, 
archaeal, bacterial, fungal and parasitical species with any combination of 
the primer pairs defined in SEQ ID NOs.: 558-561, 562-574, 636-655, 664, 
681-683, 696-697, 699-700, 708, 812-815, 911-917, 919-922, 935-938, 
1203-1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999- 2003, 
2282-2285. 

The terms "related microorganisms" are intended to cover microorganisms 
that share a common evolutive profile up to the speciation e.g. those that 
belong to a species, a genus, a family or a phyllum. The same terms are also 
intended to cover a group of different species that are grouped for a specific 
reason, for example, because they all have a common host tissue or cell. In 
one specific example, a group of microorganims potentially found in platelet 
preparations are grouped together and are considered "related" organisms 
for the purpose of their simultaneous detection in that particular type of 
sample. 

The repertories per se of nucleic acids and of sequences derived therefrom 
are also provided, as well as "gene banks" comprising these repertories. 

For generating sequences of probes or primers, the above method is 
reproduced or one may start from the sequence repertory or gene bank itself, 
and the following steps are added: 

aligning a subset of nucleic acid sequences of said repertory, 
locating nucleic acid stretches that are present in the nucleic 
acids of strains or representatives of said one, more than one related 
microorganisms, or substantially all microorganisms of said group, and not 
present in the nucleic acid sequences of other microorganisms, and 
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deriving consensus nucleic acid sequences useful as probes or 
primers from said stretches. 

Once the sequences of probes or primers are designed, they are converted 
into real molecules by nucleic acid synthesis. 



From the above methods and resulting repertories, probes and primers for 
the universal detection of any one of alga, archaeon, bacterium, fungus and 
parasite are obtainable. 

More specifically, the following probes or primers having the sequence 
defined in SEQ ID NOs.: 543, 556-574, 636-655, 658-661, 664, 681-683, 
694, 696, 697, 699, 700, 708, 812-815, 911-917, 919-922, 935-938, 1203- 
1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999-2000, 2282- 
2285 or any variant of at least 12 nucleotides capable of hybridizing with the 
targeted microorganism(s) and these sequences and a diagnostic method 
using the same are provided. 

Further, probes or primers having specific and ubiquitous properties for the 
detection and identification of any one of an algal, archaeal, bacterial, fungal 
and parasitital species, genus, family and group are also designed and 
derived from the same methods and repertories. 

More specifically, are provided definite probes or primers having specific and 
ubiquitous properties for the detection and identification of microorganisms. 

Indeed, a general method is provided for detecting the presence in a test 
sample of any microorganism that is an alga, archaeum, bacterium, fungus or 
parasite, which comprises: 

a) putting in contact any test sample tuf or atpD or recA 
sequences and nucleic acid primers and/or probes, said primers and/or 
probes having been selected to be sufficiently complementary to hybridize to 
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one or more tuf or atpD or recA sequences that are specific to said 
microorganism: 

b) allowing the primers and/or probes and any test sample tuf or 
atpD or recA sequences to hybridize under specified conditions such as said 
primers and/or probes hybridize to the tuf or afpD or recA sequences of said 
microorganism jind does not de[ectably_ hybridize to tuf or afpDju recA 
sequences from other microorganisms; and, 

c) testing for hybridization of said primers and/or probes to any 
test sample tuf or atpD or recA sequences. 

In the latter, step c) is based on a nucleic acid target amplification method, or 
on a signal amplification method. 

The terms "sufficiently complementary" cover perfect and imperfect 
complementarity. 

In addition to the universal or the specific detection and/or identification of 
microorganisms, the simultaneous detection of antimicrobial agent resistance 
gene or of a toxin gene is provided in compositions of matter as well as in 
diagnostic methods. Such detection is brought by using probes or primers 
having at least 12 nucleotides in length capable of hybridizing with an 
antimicrobial agent resistance gene and/or toxin genera definite set thereof 
being particularly provided. 

Of course, any propriatory nucleic acid and nucleotide sequence derived 
therefrom, and any variant of at least 12 nucleotides capable of a selective 
hybridization with the following nucleic acids are within the scope of this 
invention as well as derived recombinant vectors and hosts: 

SEQ ID NOs.: 1-73, 75-241, 399-457, 498-529, 612-618, 621-624, 
675, 677, 717-736, 779-792, 840-855, 865, 868-888, 897-910, 932, 967-989 
992, 1266-1297, 1518-1526, 1561-1575, 1578-1580, 1662-1664, 1666-1667, 
1669-1670, 1673-1683, 1685-1689, 1786-1843, 1874-1881, 1956-1960, 
2183-2185, 2187-2188, 2193-2201, 2214-2249, 2255-2272, which are all tuf 
sequences; 
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SEO ID NOs.: 242-270, 272-398, 458-497, 530-538, 663, 667, 673- 
676, 678-680, 737-778, 827-832, 834-839, 856-862, 866-867, 889-896, 929- 
931, 941-966, 1245-1254, 1256-1265, 1527, 1576-1577, 1600-1604,1638- 
1647, 1649-1660, 1671, 1684, 1844-1848, 1849-1865, 2189-2192, which are 
all atpD sequences; 

SEQ ID NOs.: 990-991, 1003, 1288-1289, 17H 1756-1763, 1866- 
1873 and 2202-2212, which are all recA sequences; and 

SEQ ID NOs.: 1004-1075, 1255, 1607-1608, 1648, 1764-1785, 2013- 
2014, 2056-2064, 2273-2280, which are antimicrobial agent resistance or 
toxin gene sequences found to be suitable for the detection and identification 
of microbial species., 

To complement the following repertories, another one comprising hexA 
nucleic acids and derived sequences have been construed through 
amplification of nucleic acids of any streptococcal species with any 
combination of primers SEO ID NOs.: 1179, 1181, 1182 and 1184 to 1191. 
From this particular repertory, primers and/or probes for detecting 
Streptococcus pneumoniae have been designed and obtained. Particularly, a 
nucleic acid sequence of at least 12 nucleotides capable of hybridizing with 
Streptococcus pneumoniae and with any one of SEQ ID NOs.: 1184 to 1 187 
orwith SEQ ID NOs.: 1179, 1180, 1181 or 1182 are provided. 

The remarkable sequence diversity of nucleic acids that encode proteins also 
provides diversity of peptide sequences which constitute another repertory 
that is also within the scope of this invention. From the protein and nucleic 
acid sequence repertories is derived a use therefrom for the design of a 
therapeutic agent effective against a target microorganism, for example, an 
antibiotic, a vaccine or a genie therapeutic agent. 

Due to the constant evolution in the diagnostic methods, here is finally 
provided a method for the identification of a microorganism in a test sample, 
comprising the steps of: 

a) obtaining a nucleic acid sequence from a tuf, fus, afpD, and/or 
recA genes of said microorganisms, and 
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b) comparing said nucleic acid sequence with the nucleic acid 
sequences of a bank as defined in claim 5, said repertory comprising a 
nucleic acid sequence obtained from the nucleic acids of said 
microorganism, whereby said microorganism is identify when there is a 
match between the sequences. 



In this method, any way by which the specified given sequence is obtained is 
contemplated, and this sequence is simply compared to the sequences of a 
bank or a repertory. If the comparison results in a match, e.g. if bank 
comprises the nucleic acid sequence of interest, the identification of the 
microorganism is provided. 
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DETAILED DESCRIPTION OF THE INVENTION 

HIGHLY CONSERVED GENES AND THEIR USE TO GENERATE SPECIES- 
SPECIFIC, GENUS-SPECIFIC FAMILY-SPECIFIC GROUP-SPECIFIC AND 
UNIVERSAL NUCLEIC ACID PROBES AND AMPLIFICATION PRIMERS TO 
RAPIDLY DETECT AND IDENTIFY ALGAL, ARCHAEAL. BACTERIAL. 
FUNGAL AND PARASITICAL MICROORGANISMS FROM CLINICAL 
SPECIMENS FOR DIAGNOSIS 

The present inventors reasoned that comparing the published Haemophilus 
influenzae and Mycoplasma genitalium genomes and searching for conserved 
genes could provide targets to develop useful diagnostic primers and probes. This 
sequence comparison is highly informative as these two bacteria are distantly 
related and most genes present in the minimal genome of M. genitalium are likely 
to be present in every bacterium. Therefore genes conserved between these two 
bacteria are likely to be conserved in all other bacteria. 

Following the genomic comparison, it was found that several protein -coding 
genes were conserved in evolution. Highly conserved proteins included the 
translation elongation factors G (EF-G) and Tu (EF-Tu) and the (3 subunit of F()Fi 
type ATP-synthase, and to a lesser extent, the RecA recombinase. These four 
proteins coding genes were selected amongst the 20 most conserved genes on the 
basis that they all possess at least two highly conserved regions suitable for the 
design of universal amplification and sequencing primers. Moreover, within the 
fragment amplified by these primers, highly conserved and more variable regions 
are also present hence suggesting it might be possible to rapidly obtain sequence 
information from various microbial species to design universal as well as species-, 
genus-, family-, or group-specific primers and probes of potential use for the 
detection and identification and/or quantification of microorganisms. 
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Translation elongation factors are members of a family of GTP-binding 
proteins which intervene in the interactions of tRNA molecules with the ribosome 
machinery during essential steps of protein synthesis. The role of elongation factor 
Tu is to facilitate the binding of aminoacylated tRNA molecules to the A site of the 
ribosome. The eukaryotic, archaeal (archaebacterial) and algal homolog of EF-Tu 

__isxalled elongation-factor -1 alpha-(EF-lot). All protein synthesis factors'ongiriated 

from a common ancestor via gene duplications and fusions (Cousineau et aL, 1997, 
J. Mol. Evol. 45:661-670). In particular, elongation factor G (EF-G), although 
having a functional role in promoting the translocation of aminoacyl-tRNA 
molecules from the A site to the P site of the ribosome, shares sequence 
homologies with EF-Tu and is thought to have arisen from the duplication and 
fusion of an ancestor of the EF-Tu gene. 

In addition, EF-Tu is known to be the target for antibiotics belonging to the 
elfamycin's group as well as to other structural classes (Anborgh and Parmeggiani, 
1991, EMBO J. 10:779-784; Luiten et a/., 1992, European patent application serial 
No. EP 0 466 251 A 1). EF-G for its part, is the target of the antibiotic fusidic acid. 
In addition to its crucial activities in translation, EF-Tu has chaperone-like 
functions in protein folding, protection against heat denaturation of proteins and 
interactions with unfolded proteins (Caldas et al y 1998, J. BioL Chem 273:1 1478- 
11482). Interestingly, a form of the EF-Tu protein has been identified as a 
dominant component of the periplasm of Neisseria gonorrhoeae (Porcella et aL, 
1996, Microbiology 142:2481-2489), hence suggesting that at least in some 
bacterial species, EF-Tu might be an antigen with vaccine potential. 

FoFl type ATP-synthase belongs to a superfamily of proton-translocating 

ATPases divided in three major families: P, V and F (Nelson and Taiz, 1989, TIBS 
14:113-1 16). P- ATPases (or E1-E2 type) operate via a phosphorylated intermediate 
. and are not evolutionarily related to the other two families. V-ATPases (or V()Vi 
type) are present on the vacuolar and other endomembranes of eukaryotes, on the 
plasma membrane of archaea (archaebacteria) and algae, and also on the plasma 
membrane of some eubacteria especially species belonging to the order 
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Spirochaetales as well as to the Chlamydiaceae and Deinococcaceae families. F- 
ATPases (or F()Fi type) are found on the plasma membrane of most eubacteria, on 
the inner membrane of mitochondria and on the thylakoid membrane of 
chloroplasts. They function mainly in ATP synthesis. They are large multimeric 
enzymes sharing numerous structural and functional features with the V-ATPases. 
"Find ~V : type~~ATPases "have diverged fronTa common" ancestor in aiT event 
preceding the appearance of eukaryotes. The (3 subunit of the F-ATPases is the 
catalytic subunit and it possesses low but significant sequence homologies with the 
catalytic A subunit of V-ATPases. 

The translation elongation factors EF-Tu, EF-G and EF-la, and the catalytic 
subunit of F or V-types ATP-synthase, are highly conserved proteins sometimes 
used for phylogenetic analysis and their genes are also known to be highly 
conserved (Iwabe etal, 1989, Proc. Natl. Acad. Sci. USA 86:9355-9359, Gogarten 
et aU 1989, Proc. Natl. Acad. Sci. USA 86:6661-6665, Ludwig et al y 1993, 
Antonie van Leeuwenhoek 64:285-305). A recent BLAST (Altschul et aU 1997, J. 
Mol. Biol. 215:403-410) search performed by the present inventors on the 
GenBank, European Molecular Biology Laboratory (EMBL), DNA Database of 
Japan (DDBJ) and specific genome project databases indicated that throughout 
bacteria, the EF-Tu and the (3 subunit of F()Fi type ATP-synthase genes may be 
more conserved than other genes that are well conserved between H. influenzae 
and M. genitalium. 

The RecA recombinase is a multifunctional protein encoded by the reck 
gene. It plays a central role in homologous recombination, it is critical for the 
repair of DNA damage and it is involved in the regulation of the SOS system by 
promoting the proteolytic digestion of the LexA repressor. It is highly conserved in 
bacteria and could serve as a useful genetic marker to reconstruct bacterial 
phylogeny (Miller and Kokjohn, 1990, Annu. Rev, Microbiol. 44:365-394). 
Although RecA possesses some highly conserved sequence segments that we used 
to design universal primers aimed at sequencing the reck fragments, it is clearly 
not as well conserved EF-G, EF-Tu and |3 subunit of F()Fl type ATP-synthase. 
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Hence, RecA may not be optimal for universal detection of bacteria with high 
sensitivity but it was chosen because preliminary data indicated that EF-G, EF-Tu 
and p subunit of F()Fl type ATP-synthase may sometimes be too closely related to 
find specific primer pairs that could discriminate between certain very closely 
related species and genera. While RecA, EF-G, EF-Tu and p subunit of F()Fi type 

ATP-synthasegenes, possesses highly conserved regions suitable for the design of 

universal sequencing primers, the less conserved region between primers should be 
divergent enough to allow species-specific and genus-specific primers in those 
cases. 

Thus, as targets to design primers and probes for the genetic detection of 
microorganisms, the present inventors have focused on the genes encoding these 
four proteins: tuf the gene for elongation factor Tu (EF-Tu); fus, the gene for the 
elongation factor G (EF-G); atpD, the gene for (3 subunit of F()Fl type ATP- 
synthase; and recA y the gene encoding the RecA recombinase. In several bacterial 
genomes tuf is often found in two highly similar duplicated copies named tuf A and 
tufB (Filer and Furano, 1981, J. Bacterid. 148:1006-1011, Sela et al, 1989, J. 
Bacteriol. 171:581-584). In some particular cases, more divergent copies of the tuf 
genes can exist in some bacterial species such as some actinomycetes (Luiten et al 
European patent application publication No. EP 0 446 251 Al; Vijgenboom et al, 
1994, Microbiology 140:983-998) and, as revealed as part of this invention, in 
several enterococcal species. In several bacterial species, tuf is organized in an 
operon with its homolog gene for the elongation factor G (EF-G) encoded by the 
fusA gene (Figure 3). This operon is often named the str operon. The tuffus, atpD 
and recA genes were chosen as they are well conserved in evolution and have 
highly conserved stretches as well as more variable segments. Moreover, these four 
genes have eukaryotic orthologs which are described in the present invention as 
targets to identify fungi and parasites. The eukaryotic homolog of elongation factor 
Tu is called elongation factor 1-alpha (EF-lot) (gene name: tef tefh efh efl or 
EF-1). In fungi, the gene for EF-la occurs sometimes in two or more highly 
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similar duplicated copies (often named tefl, tefl, tefl...). In addition, eukaryotes 
have a copy of elongation factor Tu which is originating from their organelle 
genome ancestry (gene name: tufl, tuJM or tufA). For the purpose of the current 
invention, the genes for these four functionally and evolutionarily linked elon- 
gation factors (bacterial EF-Tu and EF-G, eukaryotic EF-la, and organellar EF- 
Tu) will hereafter be "designated as~«tuf nucleic acids and/or sequence 
eukaryotic (mitochondrial) FoFl type ATP-synthase beta subunit gene is named 
atp2 in yeast. For the purpose of the current invention, the genes of catalytic sub- 
unit of either F or V-type ATP-synthase will hereafter be designated as «atpD 
nucleic acids and/or sequences)). The eukaryotic homologs of RecA are distributed 
in two families, typified by the Rad51 and Dmcl proteins. Archaeal homologs of 
RecA are called RadA. For the purpose of the current invention, the genes corres- 
ponding to the latter proteins will hereafter be designated as «recA nucleic acids 
and/or sequences)). 

In the description of this invention, the terms «nucleic acids)) and 
«sequences)) might be used interchangeably. However, «nucleic acids)) are 
chemical entities while «sequences)) are the pieces of information derived from 
(inherent to) these «nucleic acids)). Both nucleic acids and sequences are equiva- 
lent^ valuable sources of information for the matter pertaining to this invention. 

Analysis of multiple sequence alignments of tuf and atpD sequences 
permitted the design of oligonucleotide primers (and probes) capable of amplifying 
(or hybridizing to) segments off«/(and/or Jus) and atpD genes from a wide variety 
of bacterial species (see Examples 1 to 4, 24 and 26, and Table 7). Sequencing and 
amplification primer pairs for tuf nucleic acids and/or sequences are listed in 
Annex I and hybridization probes are listed in Annexes III and XLVII. Sequencing 
and amplification primer pairs for atpD nucleic acids and/or sequences are listed in 
Annex II. Analysis of the main subdivisions of tuf and atpD sequences (see Figures 
1 and 2) permitted to design sequencing primers amplifying specifically each of 
these subdivisions. It should be noted that these sequencing primers could also be 
used as universal primers. However, since some of these sequencing primers 

30 



WO 01/23604 PCT/CAOO/01150 

include several variable sequence (degenerated) positions, their sensitivity could be 
lower than that of universal primers developed for diagnostic purposes. Further 
subdivisions could be done on the basis of the various phyla where these genes are 
encountered. 

Similarly, analysis of multiple sequence alignments of reck sequences present 
in the" publicdatabasespermitted the "design of oligonucleot^ 
amplifying segments of reck genes from a wide variety of bacterial species. 
Sequencing and amplification primer pairs for reck sequences are listed in Annex 
XXI. The main subdivisions of reck nucleic acids and/or sequences comprise reck, 
radk y rad51 and dmcl. Further subdivisions could be done on the basis of the 
various phyla where these genes are encountered. 

The present inventor's strategy is to get as much sequence data information 
from the four conserved genes (tuf, fus, atpD and reck). This ensemble of 
sequence data forming a repertory (with subrepertories corresponding to each 
target gene and their main sequence subdivisions) and then using the sequence 
information of the sequence repertory (or subrepertories) to design primer pairs 
that could permit either universal detection of algae or archaea or bacteria or fungi 
or parasites, detection of a family or group of microorganism (e.g. 
Enterobacteriaceae), detection of a genus (e.g. Streptococcus) or finally a specific 
species (e.g. Staphylococcus aureus). It should be noted that for the purpose of the 
present invention a group of microorganisms is defined depending on the needs of 
the particular diagnostic test. It does not need to respect a particular taxonomical 
grouping or phylum. See Example 12 where primers were designed to amplify a 
group a bacteria consisting of the 17 major bacterial species encountered as 
contaminants of platelet concentrates. Also remark that in that Example, the 
primers are not only able to sensitively and rapidly detect at least the 17 important 
bacterial species, but could also detect other species as well, as shown in Table 14. 
In these circumstances the primers shown in Example 12 are considered universal 
for platelet-contaminating bacteria. To develop an assay specific for the latter, one 
or more primers or probes specific to each species could be designed. Another 
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example of primers and/or probes for group detection is given by the Pseudomonad 
group primers. These primers were designed based upon alignment of tuf 
sequences from real Pseudomonas species as well as from former Pseudomonas 
species such as Stenotrophomonas maltophilia. The resulting primers are able to 
amplify all Pseudomonas species tested as well as several species belonging to 
different genera, hence asjbeing specific for ji group including Pseudomonas . and - 
other species, we defined that group as Pseudomonads, as several members were 
former Pseudomonas. 

For certain applications, it may be possible to develop a universal, group, 
family or genus-specific reaction and to proceed to species identification using 
sequence information within the amplicon to design species-specific internal 
probes or primers, or alternatively, to proceed directly by sequencing the amplicon. 
The various strategies will be discussed further below. 

The ensembles formed by public and proprietary tuf, atpD and reck nucleic 
acids and/or sequences are used in a novel fashion so they constitute three 
databases containing useful information for the identification of microorganisms. 

Sequence repertories of other gene targets were also built to solve some 
specific identification problems especially for microbial species genetically very 
similar to each other such as E. coli and Shigella (see Example 23). Based on tuf, 
atpD and reck sequences, Streptococcus pneumoniae is very difficult to 
differentiate from the closely related species S. oralis and 5. mitis. Therefore, we 
elected to built a sequence repertory from hexA sequences (Example 19), a gene 
much more variable than our highly conserved tuf , atpD and recA nucleic acids 
and/or sequences. 

For the detection of mutations associated with antibiotic resistance genes, we 
also built repertories to distinguish between point mutations reflecting only gene 
diversity and point mutations involved in resistance. This was done for pbpla, 
pbp2b and pbp2x genes of penicillin-resistant and sensitive Streptoccoccus 
pneumoniae (Example 18) and also for gyrA and parC gene fragments of various 
bacterial species for which quinolone resistance is important to monitor. 
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Oligonucleotide primers and probes design and synthesis 

The tuf,fus, atpD and recA DNA fragments sequenced by us and/or selected 
from public databases (GenBank and EMBL) were used to design oligonucleotides 
primers and probes for diagnostic purposes. Multiple sequence alignments were 
made using subsets of the tuf or atpD or recA sequences repertory. Subsets were 
chosen to encompass as much as possible of the targetted microorganism(s) DNA 
sequence data and also include sequence data from phylogenetically related 
microorganisms from which the targetted microorganism(s) should be 
distinguished. Regions suitable for primers and probes should be conserved for the 
targetted microorganism(s) and divergent for the microorganisms from which the 
targetted microorganism(s) should be distinguished. The large amount of tuf or 
atpD or recA sequences data in our repertory permits to reduce trial and errors in 
obtaining specific and ubiquitous primers and probes. We also relied on the 
corresponding peptide sequences of tuf Jus, atpD and recA nucleic acids and/or 
sequences to facilitate the identification of regions suitable for primers and probes 
design. As part of the design rules, all oligonucleotides (probes for hybridization 
and primers for DNA amplification by PCR) were evaluated for their suitability for 
hybridization or PCR amplification by computer analysis using standard programs 
(i.e. the Genetics Computer Group (GCG) programs and the primer analysis 
software Oligo™ 5.0). The potential suitability of the PCR primer pairs was also 
evaluated prior to the synthesis by verifying the absence of unwanted features such 
as long stretches of one nucleotide and a high proportion of G or C residues at the 
3' end (Persing et al, 1993, Diagnostic Molecular Microbiology: Principles and 
Applications, American Society for Microbiology, Washington, D.C.). 
Oligonucleotide probes and amplification primers were synthesized using an 
automated DNA synthesizer (Perkin-Elmer Corp., Applied Biosystems Division). 

The oligonucleotide sequence of primers or probes may be derived from 
either strand of the duplex DNA. The primers or probes may consist of the bases 
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A, G, C, or T or analogs and they may be degenerated at one or more chosen 
nucleotide position(s). The primers or probes may be of any suitable length and 
may be selected anywhere within the DNA sequences from proprietary fragments 
or from selected database sequences which are suitable for (i) the universal 
detection of algae or archaea or bacteria or fungi or parasites, (ii) the species- 
~i specific "detection Mi identification" of "^y miaoOTgaSiismr including but not 
limited to: Abiotrophia adiacens, Bacteroides fragilis> Bordetella pertussis, 
Candida albicans, Candida dubliniensis, Candida glabrata, Candida 
guilliermondii, Candida krusei, Candida lusitaniae, Candida parapsilosis, 
Candida tropicalis, Candida zeylanoides, Campylobacter jejuni and C. coli, 
Chlamydia pneumoniae, Chlamydia trachomatis, Cryptococcus neoformans, 
Cryptosporidium parvum, Enterococcus faecalis, Enterococcus faecium, 
Enterococcus gallinarum, Escherichia coli, Haemophilus influenzae, Legionella 
pneumophila, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Staphylococcus hominis, Staphylococcus saprophyticus, 
Streptococcus agalactiae, Streptococcus pneumoniae, Trypanosoma brucei, 
Trypanosoma cruzi, (Hi) the genus-specific detection of Bordetella species, 
Candida species, Clostridium species, Corynebacterium species, Cryptococcus 
species, Entamoeba species, Enterococcus species, Gemella species, Giardia 
species, Legionella species, Leishmania species, Staphylococcus species, 
Streptococcus species, Trypanosoma species, (iv) the family-specific detection of 
Enterobacteriaceae family members, Mycobacteriaceae family members, 
Trypanosomatidae family members, (v) the detection of Enterococcus 
casseliflavus-flavescens-gallinarum group, Enterococcus, Gemella and 
Abiotrophia adiacens group, Pseudomonads extended group, Platelet- 
contaminating bacteria group, (vi) the detection of clinically important 
antimicrobial agents resistance genes listed in Table 5, (vii) the detection of 
clinically important toxin genes listed in Table 6. 
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Variants for a given target microbial gene are naturally occurring and are 
attributable to sequence variation within that gene during evolution (Watson et ah, 
1987, Molecular Biology of the Gene, 4 th ed., The Benjamin/Cummings Publishing 
Company, Menlo Park, CA; Lewin, 1989, Genes IV, John Wiley & Sons, New 
York, NY). For example, different strains of the same microbial species may have 
a single or more nucleotide variadon(s) at th^oligonucleotide ^hybridization site. 
The person skilled in the art is well aware of the existence of variant algal, 
archaeal, bacterial, fungal or parasitical DNA nucleic acids and/or sequences for a 
specific gene and that the frequency of sequence variations depends on the 
selective pressure during evolution on a given gene product. The detection of a 
variant sequence for a region between two PCR primers may be demonstrated by 
sequencing the amplification product. In order to show the presence of sequence 
variants at the primer hybridization site, one has to amplify a larger DNA target 
with PCR primers outside that hybridization site. Sequencing of this larger 
fragment will allow the detection of sequence variation at this site. A similar 
strategy may be applied to show variants at the hybridization site of a probe. 
Insofar as the divergence of the target nucleic acids and/or sequences or a part 
thereof does not affect the specificity and ubiquity of the amplification primers or 
probes, variant microbial DNA is under the scope of this invention. Variants of the 
selected primers or probes may also be used to amplify or hybridize to a variant 
DNA. 

Sequencing of tuf nucleic acids and/or sequences from a variety of archaeal, 
bacterial, fungal and parasitical species 

The nucleotide sequence of a portion of tuf nucleic acids and/or sequences 
was determined for a variety of archaeal, bacterial, fungal and parasitical species. 
The amplification primers (SEQ ID NOs. 664 and 697), which amplify a tuf gent 
portion of approximately 890 bp, were used along with newly designed sequencing 
primer pairs (See Annex I for the sequencing primers for tuf nucleic acids and/or 
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sequences). Most primer pairs can amplify different copies oituf genes (tu/A and 
tufB). This is not surprising since it is known that for several bacterial species these 
two genes are nearly identical. For example, the entire tuf A and tufB genes from E. 
coli differ at only 13 nucleotide positions (Neidhardtef ah, 1996, Escherichia coli 
and Salmonella: Cellular and Molecular Biology, 2 nd ed., American Society for 
-Microbiology -Press,- Washington^ D.Ct); Similarly y some fungi are known to have 
two nearly identical copies oituf nucleic acids and/or sequences (EF-la). These 
amplification primers are degenerated at several nucleotide positions and contain 
inosines in order to allow the amplification of a wide range of tuf nucleic acids 
and/or sequences. The strategy used to select these amplification primers is similar 
to that illustrated in Annex I for the selection of universal primers. The tuf 
sequencing primers even sometimes amplified highly divergent copies oftuf genes 
(tufC) as illustrated in the case of some enterococcal species (SEQ ID NOs.: 73, 
75, 76, 614 to 618, 621 and 987 to 989). To prove this, we have determined the 
enterococcal tuf nucleic acids and/or sequences from PCR amplicons cloned into a 
plasmid vector. Using the sequence data from the cloned amplicons, we designed 
new sequencing primers specific to the divergent (tufC) copy of enterococci(SEQ 
ID NOs.: 658-659 and 661) and then sequenced directly ihetufC amplicons. The 
amplification primers (SEQ ID NOs.: 543, 556, 557, 643-645, 660, 664, 694, 696 
and 697) could be used to amplify theft*/" nucleic acids and/or sequences from any 
bacterial species. The amplification primers (SEQ ID NOs.: 558, 559, 560, 653, 
654, 655, 813, 815, 1974-1984, 1999-2003) could be used to amplify the/w/ (EF- 
la) genes from any fungal and/or parasitical species. The amplification primers 
SEQ ID NOs. 1221-1228 could be used to amplify bacterial tuf nucleic acids 
and/or sequences of the EF-G subdivision (fusA) (Figure 3). The amplification 
primers SEQ ID NOs. 1224, and 1227-1229 could be used to amplify bacterial^/ 
nucleic acids and/or sequences comprising the end of EF-G (fusA) and the 
beginning of EF-Tu (tuf), including the intergenic region, as shown in Figure 3. 
Most tuf fragments to be sequenced were amplified using the following 
amplification protocol: One ul of cell suspension (or of purified genomic DNA 
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0.1-100 ng//il) was transferred directly to 19 /xl of a PCR reaction mixture. Each 
PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 9.0), 0.1% Triton X- 
100, 2.5 mM MgCl 2 , 1 /xM of each of the 2 primers, 200 of each of the four 
dNTPs, 0.5 unit of Taq DNA polymerase (Promega Corp., Madison, WI). PCR 
reactions were subjected to cycling using a PTC-200 thermal cycler (MJ Research 
Inc.,_Watertown,-Mass.) as follows: 3 min at-94-96 °G followed by 30^45cycles of 
1 min at 95 °C for the denaturation step, 1 min at 50-55 °C for the annealing step 
and 1 min at 72 °C for the extension step. Subsequently, twenty microliters of the 
PCR-amplified mixture were resolved by electrophoresis in a 1.5% agarose gel. 
The amplicons were then visualized by staining with methylene blue (Flores et aU 
1992, Biotechniques, 13:203-205). The size of the amplification products was 
estimated by comparison with a 100-bp molecular weight ladder. The band 
corresponding to the specific amplification product was excised from the agarose 
gel and purified using the QIAquick™ gel extraction kit (QIAGEN Inc., 
Chatsworth, CA). The gel-purified DNA fragment was then used directly in the 
sequencing protocol. Both strands of the tuf genes amplification product were 
sequenced by the dideoxynucleotide chain termination sequencing method by 
using an Applied Biosystems automated DNA sequencer (model 377) with their 
Big Dye™ Terminator Cycle Sequencing Ready Reaction Kit (Applied 
Biosystems, Foster City, CA). The sequencing reactions were performed by using 
the same amplification primers and 10 ng/100 bp of the gel-purified amplicon per 
reaction. For the sequencing of long amplicons such as those of eukaryotic tuf (EF- 
la) nucleic acids and/or sequences, we designed internal sequencing primers 
(SEQ ID NOs.: 654, 655 and 813) to be able to obtain sequence data on both 
strands for most of the fragment length. In order to ensure that the determined 
sequence did not contain errors attributable to the sequencing of PCR artefacts, we 
have sequenced two preparations of the gel-purified tuf amplification product 
originating from two independent PCR amplifications. For most target microbial 
species, the sequences determined for both amplicon preparations were identical. 
In case of discrepancies, amplicons from a third independent PCR amplification 
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were sequenced. Furthermore, the sequences of both strands were 100% 
complementary thereby confirming the high accuracy of the determined sequence. 
The tuf nucleic acids and/or sequences determined using the above strategy are 
described in the Sequence Listing. Table 7 gives the originating microbial species 
and the source for each tuf sequence in the Sequence Listing. 

-The -alignment of- the -tuf -sequences determined -by us or- selected from 

databases revealed clearly that the length of the sequenced portion of the tuf genes 
is variable. There may be insertions or deletions of several amino acids. In 
addition, in several fungi introns were observed. Intron nucleic acids and/or 
sequences are part of tuf nucleic acids and/or sequences and could be useful in the 
design of species-specific primers and probes. This explains why the size of the 
sequenced tuf amplification products was variable from one fungal species to 
another. Consequently, the nucleotide positions indicated on top of each of 
Annexes IV to XX, XXIII to XXXI, XXXVIII and XLII do not correspond for 
sequences having insertions or deletions. 

It should also be noted that the various tuf nucleic acids and/or sequences 
determined by us occasionally contain base ambiguities. These degenerated 
nucleotides correspond to sequence variations between tufA and tufB genes (or 
copies of the EF-G subdivision of tuf nucleic acids and/or sequences, or copies of 
EF- la subdivision of tuf nucleic acids and/or sequences for fungi and parasites) 
because the amplification primers amplify both tuf genes. These nucleotide 
variations were not attributable to nucleotide misincorporations by the Taq DNA 
polymerase because the sequence of both strands was identical and also because 
the sequences determined with both preparations of the gel-purified ta/amplicons 
obtained from two independent PCR amplifications were identical. 

The selection of amplification primers from tuf nucleic acids and/or sequences 

The tuf sequences determined by us or selected from public databases were 
used to select PCR primers for universal detection of bacteria, as well as for genus- 
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specific, species-specific family-specific or group-specific detection and 
identification. The strategy used to select these PCR primers was based on the 
analysis of multiple sequence alignments of various to/sequences. For more details 
about the selection of PCR primers from tuf sequences please refer to Examples 5, 
7-14, 17, 22, 24, 28, 30-31, 33, 36, and _38-40,jand to Annexes _VMX, XI-XIX and 
XXV. 

Sequencing oiatoD and recA nucleic acids and/or sequences from a variety of 
archaeal, bacterial, fungal and parasitical species 

The method used to obtain atpD and recA nucleic acids and/or sequences is 
similar to that described above for /«/nucleic acids and/or sequences. 

The selection of amplification primers from atpD or recA nucleic acids and/or 
sequences 

The comparison of the nucleotide sequence for the atpD or reck genes from 
various archaeal, bacterial, fungal and parasitical species allowed the selection of 
PCR primers (refer to Examples 6, 13, 29, 34 and 37, and to Annexes IV, V, X, 
and XX). 

DNA amplification 

For DNA amplification by the widely used PCR (polymerase chain reaction) 
method, primer pairs were derived from proprietary DNA fragments or from 
database sequences. Prior to synthesis, the potential primer pairs were analyzed by 
using the Oligo™ 5.0 software to verify that they were good candidates for PCR 
amplification. 

During DNA amplification by PCR, two oligonucleotide primers binding 
respectively to each strand of the heat-denatured target DNA from the microbial 
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genome are used to amplify exponentially in vitro the target DNA by successive 
thermal cycles allowing denaturation of the DNA, annealing of the primers and 
synthesis of new targets at each cycle (Persing et aU 1993, Diagnostic Molecular 
Microbiology; Principles and Applications, American Society for Microbiology, 
Washington, D.C.). 

Briefly, -the PCR-protocols were-as follows: -Treated-clinical-specimens or 

standardized bacterial or fungal or parasitical suspensions (see below) or purified 
genomic DNA from bacteria, fungi or parasites were amplified in a 20 /d PCR 
reaction mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 2.5 mM MgCl 2 , 0.4 /xM of each primer, 200 /iM of each of the four dNTPs 
and 0.5 unit of Tag DNA polymerase (Promega) combined with the 
TaqStart™antibody (Clontech Laboratories Inc., Palo Alto, CA). The TaqStart™ 
antibody, which is a neutralizing monoclonal antibody to Tag DNA polymerase, 
was added to all PCR reactions to enhance the specificity and the sensitivity of the 
amplifications (Kellogg et ah, 1994, Biotechniques 16:1134-1137). The treatment 
of the clinical specimens varies with the type of specimen tested, since the 
composition and the sensitivity level required are different for each specimen type. 
It consists in a rapid protocol to lyse the microbial cells and eliminate or neutralize 
PCR inhibitors. For amplification from bacterial or fungal or parasitical cultures or 
from purified genomic DNA, the samples were added directly to the PCR 
amplification mixture without any pre-treatment step. An internal control was 
derived from sequences not found in the target microorganisms or in the human 
genome. The internal control was integrated into all amplification reactions to 
verify the efficiency of the PCR assays and to ensure that significant PCR 
inhibition was absent. Alternatively, an internal control derived from rRNA was 
also useful to monitor the efficiency of microbial lysis protocols. 

PCR reactions were then subjected to thermal cycling (3 min at 94-96°C 
followed by 30 cycles of 1 second at 95 °C for the denaturation step and 30 seconds 
at 50-65°C for the annealing-extension step) using a PTC-200 thermal cycler (MJ 
Research Inc.). The number of cycles performed for the PCR assays varies 
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according to the sensitivity level required. For example, the sensitivity level 
required for microbial detection directly from clinical specimens is higher for 
blood specimens than for urine specimens because the concentration of 
microorganisms associated with a septicemia can be much lower than that 
associated with a urinary tract infection. Consequently, more sensitive PCR assays 

having more thermal cycles are probably required-for- direct-detection from blood 

specimens. Similarly, PCR assays performed directly from bacterial or fungal or 
parasitical cultures may be less sensitive than PCR assays performed directly from 
clinical specimens because the number of target organisms is normally much lower 
in clinical specimens than in microbial cultures. 

The person skilled in the art of DNA amplification knows the existence of 
other rapid amplification procedures such as ligase chain reaction (LCR), 
transcription-mediated amplification (TMA), self-sustained sequence replication 
(3SR), nucleic acid sequence-based amplification (NASBA), strand displacement 
amplification (SDA), branched DNA (bDNA), cycling probe technology (CPT), 
solid phase amplification (SPA), rolling circle amplification technology (RCA), 
solid phase RCA, anchored SDA and nuclease dependent signal amplification 
(NDSA) (Lee et al, 1997, Nucleic Acid Amplification Technologies: Application 
to Disease Diagnosis, Eaton Publishing, Boston, MA; Persing et al, 1993, 
Diagnostic Molecular Microbiology: Principles and Applications, American 
Society for Microbiology, Washington, D.C; Westin et al> 2000, Nat. BiotechnoL 
18:199-204). The scope of this invention is not limited to the use of amplification 
by PCR, but rather includes the use of any rapid nucleic acid amplification method 
or any other procedure which may be used to increase the sensitivity and/or the 
rapidity of nucleic acid-based diagnostic tests. The scope of the present invention 
also covers the use of any nucleic acids amplification and detection technology 
including real-time or post-amplification detection technologies, any amplification 
technology combined with detection, any hybridization nucleic acid chips or arrays 
technologies, any amplification chips or combination of amplification and 
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hybridization chips technologies. Detection and identification by any sequencing 
method is also under the scope of the present invention. 

Any oligonucleotide suitable for the amplification of nucleic acids by 
approaches other than PCR or for DNA hybridization which are derived from the 
species-specific, genus-specific and universal DNA fragments as well as from 
~~~ selected antimicrobial agents resistance or toxin gene sequences" included in this 
document are also under the scope of this invention. 

Detection of amplification products 

Classically, detection of amplification is performed by standard ethidium 
bromide-stained agarose gel electrophoresis. It is clear that other methods for the 
detection of specific amplification products, which may be faster and more 
practical for routine diagnosis, may be used. Such methods may be based on the 
detection of fluorescence after or during amplification. One simple method for 
monitoring amplified DNA is to measure its rate of formation by measuring the 
increase in fluorescence of intercalating agents such as ethidium bromide or 
SYBR® Green I (Molecular Probes). If more specific detection is required, 
fluorescence-based technologies can monitor the appearance of a specific product 
during the reaction. The use of dual-labeled fluorogenic probes such as in the 
TaqMan™ system (Applied Biosystems) which utilizes the 5-3' exonuclease 
activity of the Taq polymerase is a good example (Livak K.J. et al 1995, PCR 
Methods Appl. 4:357-362). TaqMan™ can be performed during amplification and 
this "real-time" detection can be done in a single closed tube hence eliminating 
post-PCR sample handling and consequently preventing the risk of amplicon 
carryover. Several other fluorescence-based detection methods can be performed in 
real-time. Fluorescence resonance energy transfer (FRET) is the principle behind 
the use of adjacent hybridization probes (Wittwer, C.T. et al. 1997. BioTechniques 
22:130-138), molecular beacons (Tyagi S. and Kramer F.R. 1996. Nature 
Biotechnology 14:303-308) and scorpions (Whitcomb et al. 1999. Nature 
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Biotechnology 17:804-807). Adjacent hybridization probes are designed to be 
internal to the amplification primers. The 3' end of one probe is labelled with a 
donor fluorophore while the 5' end of an adjacent probe is labelled with an 
acceptor fluorophore. When the two probes are specifically hybridized in closed 
proximity (spaced by 1 to 5 nucleotides) the donor fluorophore which has been 
excited by an^xternal light source emi^lightthat is absorbed by a second acceptor 
that emit more fluorescence and yields a FRET signal. Molecular beacons possess 
a stem-and-loop structure where the loop is the probe and at the bottom of the stem 
a fluorescent moiety is at one end while a quenching moiety is at the other end. 
The beacons undergo a fluorogenic conformational change when they hybridize to 
their targets hence separating the fluorochrome from its quencher. The FRET 
principle is also used in an air thermal cycler with a built-in fluorometer (Wittwer, 
C.T. et al 1997. BioTechniques 22:130-138). The amplification and detection are 
extremely rapid as reactions are performed in capillaries: it takes only 18 min to 
complete 45 cycles. Those techniques are suitable especially in the case where few 
pathogens are searched for. Boehringer-Roche Inc. sells the LightCycler™, and 
Cepheid makes the SmartCycler. These two apparatus are capable of rapid cycle 
PCR combined with fluorescent SYBR® Green I or FRET detection. We recently 
demonstrated in our laboratory, real-time detection of 10 CFU in less than 40 
minutes using adjacent hybridization probes on the LightCycler™. Methods based 
on the detection of fluorescence are particularly promising for utilization in routine 
diagnosis as they are very rapid, quantitative and can be automated. 

Microbial pathogens detection and identification may also be performed by 
solid support or liquid hybridization using species-specific internal DNA probes 
hybridizing to an amplification product. Such probes may be generated from any 
sequence from our repertory and designed to specifically hybridize to DNA 
amplification products which are objects of the present invention. Alternatively, 
the internal probes for species or genus or family or group detection and 
identification may be derived from the amplicons produced by a universal, family-, 
group-, genus- or species-specific amplification assay(s). The oligonucleotide 
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probes may be labeled with biotin or with digoxigenin or with any other reporter 
molecule (for more details see below the section on hybrid capture). Hybrization 
on a solid support is amendable to miniaturization. 

At present the oligonucleotide nucleic acid microarray technology is 
appealing. Currently, available low to medium density arrays (Heller et al. y An 
integrated microelectronics hybridization system _ for genoirdc research^ and 
diagnostic applications. In: Harrison, D.J., and van den Berg, A., 1998, Micro total 
analysis systems 98, Kluwer Academic Publisher, Dordrecht.) could specifically 
capture fluorescent-labelled amplicons. Detection methods for hybridization are 
not limited to fluorescence; potentiometry, colorimetry and plasmon resonance are 
some examples of alternative detection methods. In addition to detection by 
hybridization, nucleic acid microarrays could be used to perform rapid sequencing 
by hybridization. Mass spectrometry could also be applicable for rapid 
identification of the amplicon or even for sequencing of the amplification products 
(Chiu and Cantor, 1999, Clinical Chemistry 45:1578; Berkenkamp et al, 1998, 
Science 281:260). 

For the future of our assay format, we also consider the major challenge of 
molecular diagnostics tools, Le.: integration of the major steps including sample 
preparation, genetic amplification, detection, data analysis and presentation 
(Anderson et ah, Advances in integrated genetic analysis. In: Harrison, D.J., and 
van den Berg, A., 1998, Micro total analysis systems 98, Kluwer Academic 
Publisher, Dordrecht.). 

To ensure PCR efficiency, glycerol, dimethyl sulfoxide (DMSO) or other 
related solvents can be used to increase the sensitivity of the PCR and to overcome 
problems associated with the amplification of a target DNA having a high GC 
content or forming strong secondary structures (Dieffenbach and Dveksler, 1995, 
PCR Primer: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Plainview, New York). The concentration ranges for glycerol and DMSO are 5- 
15% (v/v) and 3-10% (v/v), respectively. For the PCR reaction mixture, the 
concentration ranges for the amplification primers and MgCl 2 are 0.1-1.5 /xM and 

44 



WO 01/23604 PCT/CAOO/01150 

1.0-10.0 mM, respectively. Modifications of the standard PCR protocol using 
external and nested primers (i.e. nested PCR) or using more than one primer pair 
(i.e. multiplex PCR) may also be used (Persing et al, 1993, Diagnostic Molecular 
Microbiology: Principles and Applications, American Society for Microbiology, 
Washington, D.C.). For more details about the PCR protocols and amplicon 
detection methods, see Examples. 

Hybrid capture and chemiluminescence detection of amplification products 

Hybridization and detection of amplicons by chemiluminescence were 
adapted from Nikiforov et al (1994, PCR Methods and Applications 3:285-291 
and 1995, Anal. Biochem. 227:201-209 ) and from the DIG™ system protocol of 
Boehringer Mannheim Briefly, 50 \i\ of a 25 picomoles solution of capture probe 
diluted in EDC { l-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride} 
are immobilized in each well of 96-wells plates (Microlite™ 2, Dynex) by 
incubation overnight at room temperature. The next day, the plates are incubated 
with a solution of 1% BSA diluted into TNTw (10 mM Tris-HCl, pH 7.5; 150 mM 
NaCl; 0.05% Tween™ 20) for 1 hour at 37 °C. The plates are then washed on a 
Wellwash Ascent™ (Labsystems) with TNTw followed by Washing Buffer (100 
mM maleic acid pH7.5; 150 mM NaCl; 0.3% Tween™ 20). 

The amplicons were labelled with DIG-ll-dUTP during PCR using the PCR 
DIG Labelling Mix from Boehringer Mannheim according to the manufacturer's 
instructions. Hybridization of the amplicons to the capture probes is performed in 
triplicate at stringent temperature (generally, probes are designed to allow 
hybrization at 55 °C, the stringent temperature) for 30 minutes in 1.5 M NaCl; 10 
mM EDTA. It is followed by two washes in 2 X SSC; 0.1% SDS, then by four 
washes in 0.1X SSC; 0.1% SDS at the stringent temperature (55 °C). Detection 
with 1,2 dioxetane chemiluminescent alkaline phosphatase substrates like CSPD® 
(Tropix Inc.) is performed according to the manufacturer's instructions but with 
shorter incubations times and a different antibody concentration. The plates are 
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agitated at each step, the blocking incubation is performed for only 5 minutes, the 
anti-DIG-APl is used at a 1:1000 dilution, the incubation with antibody lasts 15 
minutes, the plates are washed twice for only 5 minutes. Finally, after a 2 minutes 
incubation into the detection buffer, the plates are incubated 5 minutes with 
CSPD® at room temperature followed by a 10 minutes incubation at 37 °C without 
agitation.. Luminous sjgnaLdetection _is_ performed on j Dynex.Micrptiter Plate _ 
Lurninometer using RLU (Relative Light Units). 

Specificity, ubiquity and sensitivity tests for oligonucleotide primers and probes 

The specificity of oligonucleotide primers and probes was tested by 
amplification of DNA or by hybridization with bacterial or fungal or parasitical 
species selected from a panel comprising closely related species and species 
sharing the same anatomo-pathological site (see Annexes and Examples). All of 
the bacterial, fungal and parasitical species tested were likely to be pathogens 
associated with infections or potential contaminants which can be isolated from 
clinical specimens. Each target DNA could be released from microbial cells using 
standard chemical and/or physical treatments to lyse the cells (Sambrook et al y 
1989, Molecular Cloning: A Laboratory Manual, 2 nd ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY) or alternatively, genomic DNA 
purified with the GNOME™ DNA kit (BiolOl, Vista, CA) was used. 
Subsequently, the DNA was subjected to amplification with the primer pairs. 
Specific primers or probes amplified only the target microbial species, genus, 
family or group. 

Oligonucleotides primers found to amplify specifically the target species, 
genus, family or group were subsequently tested for their ubiquity by amplification 
(i.e. ubiquitous primers amplified efficiently most or all isolates of the target 
species or genus or family or group). Finally, the sensitivity of the primers or 
probes was determined by using 10-fold or 2-fold dilutions of purified genomic 
DNA from the targeted microorganism. For most assays, sensitivity levels in the 
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range of 1-100 copies were obtained. The specificity, ubiquity and sensitivity of 
the PCR assays using the selected amplification primer pairs were tested either 
directly from cultures of microbial species or from purified microbial genomic 
DNA. 

Probes were tested in hybrid capture assays as described above. An 
_^gonucleotide probe w^ considered specific only when it hybjridjzed splelyjo, 
DNA from the species or genus or family or group from which it was selected. 
Oligonucleotide probes found to be specific were subsequently tested for their 
ubiquity (i.e. ubiquitous probes detected efficiently most or all isolates of the target 
species or genus or family or group) by hybridization to microbial DNAs from 
different clinical isolates of the species or genus or family or group of interest 
including ATCC reference strains. Similarly, oligonucleotide primers and probes 
could be derived from antimicrobial agents resistance or toxin genes which are 
objects of the present invention. 

Reference strains 

The reference strains used to build proprietary tuf, atpD and recA sequence 
data subrepertories, as well as to test the amplification and hybridization assays 
were obtained from (i) the American Type Culture Collection (ATCC), (ii) the 
Laboratoire de sante publique du Quebec (LSPQ), (iii) the Centers for Disease 
Control and Prevention (CDC), (iv) the National Culture Type Collection (NCTC) 
and (v) several other reference laboratories throughout the world. The identity of 
our reference strains was confirmed by phenotypic testing and reconfirmed by 
analysis of tuf, atpD and recA sequences (see Example 13). 

Antimicrobial agents resistance genes 

Antimicrobial resistance complicates treatment and often leads to therapeutic 
failures. Furthermore, overuse of antibiotics inevitably leads to the emergence of 
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microbial resistance. Our goal is to provide clinicians, in approximately one hour, 
the needed information to prescribe optimal treatments. Besides the rapid 
identification of negative clinical specimens with DNA-based tests for universal 
algal, archaeal, bacterial, fungal or parasitical detection and the identification of 
the presence of a specific pathogen in the positive specimens with species- and/or 
^genus- and/of family- an tests7 clinicians also need 

timely information about the ability of the microbial pathogen to resist antibiotic 
treatments. We feel that the most efficient strategy to evaluate rapidly microbial 
resistance to antimicrobials is to detect directly from the clinical specimens the 
most common and clinically important antimicrobial agents resistance genes (i.e. 
DNA-based tests for the specific detection of antimicrobial agents resistance 
genes). Since the sequence from the most important and common antimicrobial 
agents resistance genes are available from public databases, our strategy is to use 
the sequence from a portion or from the entire resistance gene to design specific 
oligonucleotide primers or probes which will be used as a basis for the 
development of sensitive and rapid DNA-based tests. The list of each of the 
antimicrobial agents resistance genes selected on the basis of their clinical 
relevance (i.e. high incidence and importance) is given in Table 5; descriptions of 
the designed amplification primers and internal probes are given in Annexes 
XXXIV-XXXVII, XXXIX, XLV, and L-LI. Our approach is unique because the 
antimicrobial agents resistance genes detection and the microbial detection and 
identification can be performed simultaneously, or independently, or sequentially 
in multiplex or parallel or sequential assays under uniform PCR amplification 
conditions. These amplifications can also be done separately. 

Toxin genes 

Toxin identification is often very important to prescribe optimal treatments. 
Besides the rapid identification of negative clinical specimens with DNA-based 
tests for universal bacterial detection and the identification of the presence of a 
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specific pathogen in the positive specimens with species- and/or genus- and/or 
family- and/or group-specific DNA-based tests, clinicians sometimes need timely 
information about the ability of certain bacterial pathogens to produce toxins. 
Since the sequence from the most important and common bacterial toxin genes are 
available from public databases, our strategy is to use the sequence from a portion 

— or-from-the entire toxin gene to-design-specific oligonucleotide primers or -probes 

which will be used as a basis for the development of sensitive and rapid DNA- 
based tests. The list of each of the bacterial toxin genes selected on the basis of 
their clinical relevance (i.e. high incidence and importance) is given in Table 6; 
descriptions of the designed amplification primers and internal probes are given in 
Annexes XXII, XXXII and XXXIII. Our approach is unique because the toxin 
genes detection and the bacterial detection and identification can be performed 
simultaneously, or independently, or sequentially, in multiplex or parallel or 
sequential assays under uniform PCR amplification conditions. These 
amplifications can also be done separately. 

Universal bacterial detection 

In the routine microbiology laboratory, a high percentage of clinical 
specimens sent for bacterial identification are negative by culture. Testing clinical 
samples with universal amplification primers or universal probes to detect the 
presence of bacteria prior to specific identification and screening out the numerous 
negative specimens is thus useful as it reduces costs and may rapidly orient the 
clinical management of the patients. Several amplification primers and probes were 
therefore synthesized from highly conserved portions of bacterial sequences from 
the tuf y atpD and recA nucleic acids and/or sequences. The universal primers 
selection was based on a multiple sequence alignment constructed with sequences 
from our repertory. 

All computer analysis of amino acid and nucleotide sequences were 
performed by using the GCG programs. Subsequently, optimal PCR primers for 
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the universal amplification of bacteria were selected with the help of the Oligo™ 
program. The selected primers are degenerated at several nucleotide positions and 
contain several inosines in order to allow the amplification of all clinically relevant 
bacterial species. Inosine is a nucleotide analog able to specifically bind to any of 
the four nucleotides A, C, G or T. Degenerated oligonucleotides consist of an 
oligonucleotide mix having two or more of the four nucleotides A, C, G or T at the 
site of mismatches. The inclusion of inosine and/or of base ambiguities in the 
amplification primers allow mismatch tolerance thereby permitting the 
amplification of a wider array of target nucleotide sequences (Dieffenbach and 
Dveksler, 1995 PCR Primer: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Plainview, NY). 

The amplification conditions with the universal primers are very similar to 
those used for the species- and genus-specific amplification assays except that the 
annealing temperature is slightly lower. The original universal PCR assay 
described in our assigned WO98/20157 (SEQ ID NOs. 23-24 of the latter 
application) was specific and nearly ubiquitous for the detection of bacteria. The 
specificity for bacteria was verified by amplifying genomic DNA isolated from the 
12 fungal species as well as genomic DNA from Leishmania donovani, 
Saccharomyces cerevisiae and human lymphocytes. None of the above eukaryotic 
DNA preparations could be amplified by the universal assay, thereby suggesting 
that this test is specific for bacteria. The ubiquity of the universal assay was 
verified by amplifying genomic DNAs from 1 16 reference strains which represent 
95 of the most clinically relevant bacterial species. These species have been 
selected from the bacterial species listed in Table 4. We found that at least 104 of 
these strains could be amplified. However, the assay could be improved since 
bacterial species which could not be amplified with the original ta/ nucleic acids 
and/or sequences-based assay included species belonging to the following genera: 
Corynebacterium (11 species) and Stenotrophomonas (1 species). Sequencing of 
the tuf genes from these bacterial species and others has been performed in the 
scope of the present invention in order to improve the universal assay. This 
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sequencing data has been used to select new universal primers which may be more 
ubiquitous and more sensitive. Also, we improved our primer and probes design 
strategy by taking into consideration the phylogeny observed in analysing our 
repertory of tuf, atpD and recA sequences. Data from each of the 3 main 
subrepertories (tuf, atpD and recA) was subjected to a basic phylogenic analysis 

using-the Pileup command -from-version-10-of - the GCG-package- (Genetics — 

Computer Group, inc.). This analysis indicated the main branches or phyla 
reflecting the relationships between sequences. Instead of trying to design primers 
or probes able to hybridize to all phyla, we designed primers or probes able to 
hybridize to the main phyla while trying to use the largest phylum possible. This 
strategy should allow less degenerated primers hence improving sensitivity and by 
combining primers in a mutiplex assay, improve ubiquity. Universal primers SEQ 
ID NOs. 643-645 based on tuf sequences have been designed to amplify most 
pathogenic bacteria except Actinomyceteae, Clostridiaceae and the Cytophaga, 
Flexibacter and Bacteroides phylum (pathogenic bacteria of this phylum include 
mostly Bacteroides, Porphyromonas and Prevotella species). Primers to fill these 
gaps have been designed for Actinomyceteae (SEQ ID NOs. 646-648), 
Clostridiaceae (SEQ ID NOs. 796-797, 808-811), and the Cytophaga, Flexibacter 
and Bacteroides phylum (SEQ ID NOs. 649-651), also derived from to/ nucleic 
acids and/or sequences. These primers sets could be used alone or in conjuction to 
render the universal assay more ubiquitous. 

Universal primers derived from atpD sequences include SEQ ID NOs. 562- 
565. Combination of these primers does not amplify human DNA but should 
amplify almost all pathogenic bacterial species except proteobacteria belonging to 
the epsilon subdivision (Campylobacter and Helicobacter), the bacteria from the 
Cytophaga, Flexibacter and Bacteroides group and some actinomycetes and 
corynebacteria. By analysing atpD sequences from the latter species, primers and 
probes to specifically fill these gaps could be designed and used in conjuction with 
primers SEQ ID NOs. 562-565, also derived from atpD nucleic acids and/or 
sequences. 
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In addition, universality of the assay could be expanded by mixing atpD 
sequences-derived primers with tuf sequences-derived primers. Ultimately, even 
reck sequences-derived primers could be added to fill some gaps in the universal 
assay. 

It is important to note that the 95 bacterial species selected to test the ubiquity 
of theTiniversal assay include all of themost" clinically Yele^t'bacterial" species - 
associated with a variety of human infections acquired in the community or in 
hospitals (nosocomial infections). The most clinically important bacterial and 
fungal pathogens are listed in Tables 1 and 2. 

Amino acid sequences derived from tuf, atpD and recA nucleic acids and/or 
sequences 

The amino acid sequences translated from the repertory of tuf, atpD and reck 
nucleic acids and/or sequences are also an object of the present invention. The 
amino acid sequence data will be particularly useful for homology modeling of 
three-dimensional (3D) structure of the elongation factor Tu, elongation factor G, 
elongation factor la, ATPase subunit beta and RecA recombinase. For all these 
proteins, at least one structure model has been published using X-ray diffraction 
data from crystals. Based on those structural informations it is possible to use 
computer sofware to build 3D model structures for any other protein having 
peptide sequence homologies with the known structure (Greer, 1991, Methods in 
Enzymology, 202:239-252; Taylor, 1994, Trends Biotechnol., 12(5):154-158; Sali, 
1995, Cuit. Opin. Biotechnol. 6:437-451; Sanchez and Sali, 1997, Curr. Opin. 
Struct. Biol. 7:206-214; Fischer and Eisenberg, 1999, Curr. Opin. Struct. Biol. 
9:208-211; Guex et a/., 1999, Trends Biochem. Sci. 24: 364-367). Model 
structures of target proteins are used for the design or to predict the behavior of 
ligands and inhibitors such as antibiotics. Since EF-Tu and EF-G are already 
known as antibiotic targets (see above) and since the beta subunit of ATPase and 
RecA recombinase are essential to the survival of the microbial cells in natural 
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conditions of infection, all four proteins could be considered antibiotic targets. 
Sequence data, especially the new data generated by us could be very useful to 
assist the creation of new antibiotic molecules with desired spectrum of activity. In 
addition, model structures could be used to improve protein function for 
commercial purposes such as improving antibiotic production by microbial strains 
or increasing-biomass 
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The following detailed embodiments and appended drawings are provided as 
illustrative examples of his invention, with no intention to limit the scope thereof. 



DESCRIPTION OF THE DRAWINGS 



Figures 1 and 2 illustrate the principal subdivisions of the tuf and aipD sequences 
repertories, respectively. For the design of primers and probes, depending on the 
needs, one may want to use the complete data set illustrated on the top of the 
pyramid or use only a subset illustrated by the different branching points. Smaller 
subdivisions, representing groups, families, genus and species, could even be made 
to extend to the bottom of the pyramid. Because the tuf and atpD sequences are 
highly conserved and evolved with each species, the design of primers and probes 
does not need to include all the sequences within the database or its subdivisions. 
As illustrated in Annexes IV to XX, XXIII to XXXI, XXXVIII and XLII, 
depending on the use, sequences from a limited number of species can be carefully 
selected to represent: i) only the main phylogenetic branches from which the 
intended probes and primers need to be differentiating, and ii) only the species for 
which they need to be matching. However, for ubiquity purposes, and especially 
for primers and probes identifying large groups of species (genus, family, group or 
universal, or sequencing primers), the more data is included into the sequence 
analysis, the better the probes and primers will be suitable for each particular 
intended use. Similarly, for specificity purposes, a larger data set (or repertory) 
ensures optimal primers and probes design by reducing the chance of employing 
nonspecific oligonucleotides. 

Figure 3 illustrates the approach used to design specific amplification primers from 
fusA as well as from the region between the end of fusA and the beginning of tufm 
the streptomycin (str) operon (referred to as the /ksA-fw/intergenic spacer in Table 
7). 

Figures 4 to 6 are illustrations to Example 42, whereas Figures 7 to 10 illustrate 
Example 43. Figures 1 1 and 12 illustrate Example 44. 
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FIGURE LEGENDS 

Figure 3. Schematic organization of universal amplification primers (SEQ ID NOs. 
1221-1229) in the str operon. Amplicon sizes are given in bases pairs. Drawing not 
to scale, as the fusA-tuf intergenic spacer size varies depending on the bacterial 
species. Indicated amplicon lengths are for E. coli. 

Figure 4. Abridged multiple amino acid sequence alignment of the partial tuf gene 
products from selected species illustrated using the program Alscript. Residues 
highly conserved in bacteria are boxed in grey and gaps are represented with dots. 
Residues in reverse print are unique to the enterococcal tufB as well as to 
streptococcal and lactococcal ta/gene products. Numbering is based on E. coli EF- 
Tu and secondary structure elements of £. coli EF-Tu are represented by cylinders 
(cc-helices) and arrows (|3-strands). 

Figure 5. Distance matrix tree of bacterial EF-Tu based on amino acid sequence 
homology. The tree was constructed by the neighbor-joining method. The tree was 
rooted using archeal and eukaryotic EF-loc genes as the outgroup. The scale bar 
represents 5% changes in amino acid sequence, as determined by taking the sum of 
all of the horizontal lines connecting two species. 

Figure 6. Southern hybridization of BgRVXbal digested genomic DNAs of some 
enterococci (except for E. casseliflavus and E. gallinarum whose genomic DNA 
was digested with BamUUPvuIl) using the tuf A gene fragment of E. faecium as 
probes. The sizes of hybridizing fragments are shown in kilobases. Strains tested 
are listed in Table 16. 
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Figure 7. Pantoea and Tatumella species specific signature indel in atpD genes. 
The nucleotide positions given are for E. coli atpD sequence (GenBank accession 
no. V00267). Numbering starts from the first base of the initiation codon. 

Figure 8: Trees based on sequence data from tuf (left side) and atpD (right side). 
_The .phylogenetic- analysis^ was. performed -using the-NeighborJoining -method - 
calculated using the Kimura two-parameter method. The value on each branch 
indicates the occurence (%) of the branching order in 750 bootstrapped trees. 

Figure 9: Phylogenetic tree of members of the family Enterobacteriaceae based on 
tuf (a), atpD (b), and 16S rDNA (c) genes. Trees were generated by neighbor- 
joining method calculated using the Kimura two-parameter method. The value on 
each branch is the percentage of bootstrap replications supporting the branch. 750 
bootstrap replications were calculated. 

Figure 10: Plot of tuf distances versus 16S rDNA distances (a), atpD distances 
versus 16S rDNA distances (b), and atpD distances versus tuf distances (c). 
Symbols: O , distances between pairs of strains belonging to the same species; 
distances between E. coli strains and Shigella strains; distances between pairs 
belonging to the same genus; ■, distances between pairs belonging to different 
genera; A, distances between pairs belonging to different families. 

EXAMPLES AND ANNEXES 

For sake of clarity, here is a list of Examples and Annexes: 

Example 1: Sequencing of bacterial atpD (F-type and V-type) gene fragments. 
Example 2: Sequencing of eukaryotic atpD (F-type and V-type) gene 
fragments. 

Example 3: Sequencing of eukaryotic /w/(EF- 1) gene fragments. 
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Example 4: Sequencing of eukaryotic tuf (organelle origin, M) gene fragments. 
Example 5: Specific detection and identification of Streptococcus agalactiae 
using tuf sequences. 

Example 6: Specific detection and identification of Streptococcus agalactiae 

using atpD sequences. 
"Example 7: ~ "Devdopment~of a"PCR" assay for- detection and identification-of- 

staphylococci at genus and species levels. 
Example 8: Differentiating between the two closely related yeast species 

Candida albicans and Candida dubliniensis. 
Example 9: Specific detection and identification of Entamoeba histolytica. 
Example 10: Sensitive detection and identification of Chlamydia trachomatis. 
Example 11: Genus-specific detection and identification of enterococci. 
Example 12: Detection and identification of the major bacterial platelets 

contaminants using tuf sequences with a multiplex PCR test. 
Example 13: The resolving power of the tuf and atpD sequences databases is 

comparable to the biochemical methods for bacterial identification. 
Example 14: Detection of group B streptococci from clinical specimens. 
Example 15: Simultaneous detection and identification of Streptococcus 

pyogenes and its pyrogenic exotoxin A. 
Example 16: Real-time detection and identification of Shiga toxin-producing 

bacteria. 

Example 17: Development of a PCR assay for the detection and identification of 
staphylococci at genus and species levels and its associated mecA 
gene. 

Example 18: Sequencing of pbpla, pbplb and pbp2x genes of Streptoccoccus 
pneumoniae. 

Example 19: Sequencing of hexA genes of Streptococcus species. 
Example 20: Development of a multiplex PCR assay for the detection of 
Streptococcus pneumoniae and its penicillin resistance genes. 
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Example 21: Sequencing of the vancomycin resistance vanA, vanCl, vanC2 and 
vanC3 genes. 

Example 22: Development of a PCR assay for the detection and identification of 
enterococci at genus and species levels and its associated resistance 
genes vanA and vanB. 

Example 23: "Development of ~ar multiplex- -PGR— assay- for- detection - and 

identification of vancomycin-resistant Enterococcus faecalis, 
Enterococcus faecium, Enterococcus gallinarum, Enterococcus 
casseliflavuSy and Enterococcus flavescens. 

Example 24: Universal amplification involving the EF-G (fusA) subdivision of 
tuf sequences. 

Example 25: DNA fragment isolation from Staphylococcus saprophytics by 

arbitrarily primed PCR. 
Example 26: Sequencing of prokaryotic ta/gene fragments. 
Example 27: Sequencing of procaryotic recA gene fragments. 
Example 28: Specific detection and identification of Escherichia coli/Shigella sp. 

using tuf sequences. 

Example 29: Specific detection and identification of Klebsiella pneumoniae 

using atpD sequences. 
Example 30: Specific detection and identification of Acinetobacter baumanii 

using fa/sequences. 

Example 31: Specific detection and identification of Neisseria gonorrhoeae 

using m/sequences. 
Example 32: Sequencing of bacterial gyrA and parC gene fragments. 
Example 33: Development of a PCR assay for the specific detection and 

identification of Staphylococcus aureus and its quinolone resistance 

genes gyrA and parC. 
Example 34: Development of a PCR assay for the detection and identification of 

Klebsiella pneumoniae and its quinolone resistance genes gyrA and 

parC. 
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Example 35: Development of a PCR assay for the detection and identification of 
Streptococcus pneumoniae and its quinolone resistance genes gyrA 
and parC. 

Example 36: Detection of extended-spectrum TEM-type P-lactamases in 
Escherichia coll 

- Example -37*— Detection —of- -extended-spectmm~SH-V-type™-p4actamases -in - 
Klebsiella pneumoniae. 
Example 38: Development of a PCR assay for the detection and identification of 
Neisseria gonorrhoeae and its associated tetracycline resistance 
gene tetM. 

Example 39: Development of a PCR assay for the detection and identification of 
Shigella sp. and their associated trimethoprim resistance gene 
dhfrla. 

Example 40: Development of a PCR assay for the detection and identification of 
Acinetobacter baumanii and its associated aminoglycoside 
resistance gene aph(3 f )-VIa. 

Example 41: Specific detection and identification of Bacteroides fragilis using 
atpD (V-type) sequences. 

Example 42: Evidence for horizontal gene transfer in the evolution of the 
elongation factor Tu in Enterococci. 

Example 43: Elongation factor Tu (tuf) and the F-ATPase beta-subunit (atpD) as 
phylogenetic tools for species of the family Enter obacteriaceae. 

Example 44: Testing new pairs of PCR primers selected from two species- 
specific genomic DNA fragments which are objects of US patent 
6,001,564, 

Example 45: Testing modified versions of PCR primers derived from the 
sequence of several primers which are objects of US patent 
6,001,564. 
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The various Annexes show the strategies used for the selection of a variety of 
DNA amplification primers, nucleic acid hybridization probes and molecular 
beacon internal probes: 

(i) Annex I shows the amplification primers used for nucleic acid 
amplification-from /w/sequencesr 

(ii) Annex II shows the amplification primers used for nucleic acid 
amplification from atpD sequences. 

(iii) Annex III shows the internal hybridization probes for detection of tuf 
sequences. 

(iv) Annex IV illustrates the strategy used for the selection of the 
amplification primers specific for atpD sequences of the F-type. 

(v) Annex V illustrates the strategy used for the selection of the amplification 
primers specific for atpD sequences of the V-type. 

(vi) Annex VI illustrates the strategy used for the selection of the 
amplification primers specific for the tuf sequences of organelle lineage 
(M, the letter M is used to indicate that in most cases, the organelle is the 
mitochondria). 

(vii) Annex VII illustrates the strategy used for the selection of the 
amplification primers specific for the ta/sequences of eukaryotes (EF-1). 

(viii) Annex VIII illustrates the strategy for the selection of Streptococcus 
agalactiae-specific amplification primers from tuf sequences. 

(ix) Annex IX illustrates the strategy for the selection of Streptococcus 
agalactiae-specific hybridization probes from tuf sequences. 

(x) Annex X illustrates the strategy for the selection of Streptococcus 
agalactiae-specific amplification primers from atpD sequences. 

(xi) Annex XI illustrates the strategy for the selection from tuf sequences of 
Candida albicans! dubliniensis-sptcific amplification primers, Candida 
a/fr/aws-specific hybridization probe and Candida dubliniensis-sptcific 
hybridization probe. 
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(xii) Annex XII illustrates the strategy for the selection of Staphylococcus- 
specific amplification primers from tuf sequences. 

(xiii) Annex XIII illustrates the strategy.for the selection of the Staphylococcus- 
specific hybridization probe from tuf sequences. 

(xiv) Annex XIV illustrates the strategy for the selection of Staphylococcus 
saprophyticuszsptcific and Staphylococcus haemolyticus-sptcifio — 

hybridization probes from tuf sequences. 

(xv) Annex XV illustrates the strategy for the selection of Staphylococcus 
aureus-specific and Staphylococcus epidermidis-sptcific hybridization 
probes from ta/ sequences. 

(xvi) Annex XVI illustrates the strategy for the selection of the Staphylococcus 
/lam/ms-specific hybridization probe from tuf sequences. 

(xvii) Annex XVII illustrates the strategy for the selection of the Enterococcus- 
specific amplification primers from tuf sequences. 

(xviii) Annex XVIII illustrates the strategy for the selection of the Enterococcus 
faecalis-spzcific hybridization probe, of the Enterococcus faecium- 
specific hybridization probe and of the Enterococcus casseliflavus- 
flavescens-gallinarum group-specific hybridization probe from tuf 
sequences. 

(xix) Annex XIX illustrates the strategy for the selection of primers from tuf 
sequences for the identification of platelets contaminants. 

(xx) Annex XX illustrates the strategy for the selection of the universal 
amplification primers from atpD sequences. 

(xxi) Annex XXI shows the amplification primers used for nucleic acid 
amplification from reck sequences. 

(xxii) Annex XXII shows the specific and ubiquitous primers for nucleic acid 
amplification from speA sequences. 

(xxiii) Annex XXIII illustrates the first strategy for the selection of 
Streptococcus pyogenes-sptcific amplification primers from speA 
sequences. 
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(xxiv) Annex XXIV illustrates the second strategy for the selection of 
Streptococcus pyogenes-spzcific amplification primers from speA 
sequences. 

(xxv) Annex XXV illustrates the strategy for the selection of Streptococcus 
pyogenes-spzcific amplification primers from tuf sequences. 

(xxvi) -Annex- XXVI illustrates the- strategy for the- selection of-stxy-specific 

amplification primers and hybridization probe. 

(xxvii) Annex XXVII illustrates the strategy for the selection of .^-specific 
amplification primers and hybridization probe. 

(xxviii) Annex XXVIII illustrates the strategy for the selection of vanA-specific 
amplification primers from van sequences. 

(xxix) Annex XXIX illustrates the strategy for the selection of vanfl-specific 
amplification primers from van sequences. 

(xxx) Annex XXX illustrates the strategy for the selection of vanC-specific 
amplification primers from vanC sequences. 

(xxxi) Annex XXXI illustrates the strategy for the selection of Streptococcus 
pneumoniae-specific amplification primers and hybridization probes from 
pbpla sequences. 

(xxxii) Annex XXXII shows the specific and ubiquitous primers for nucleic acid 
amplification from toxin gene sequences. 

(xxxiii) Annex XXXIII shows the molecular beacon internal hybridization probes 
for specific detection of toxin sequences. 

(xxxiv) Annex XXXIV shows the specific and ubiquitous primers for nucleic acid 
amplification from van sequences. 

(xxxv) Annex XXXV shows the internal hybridization probes for specific 
detection of van sequences. 

(xxxvi) Annex XXXVI shows the specific and ubiquitous primers for nucleic acid 
amplification from pbp sequences. 

(xxxvii) Annex XXXVII shows the internal hybridization probes for specific 
detection of pbp sequences. 
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(xxxviii) Annex XXXVIII illustrates the strategy for the selection of vanAB- 
specific amplification primers and vanA- and vanB- specific hybridization 
probes from van sequences. 

(xxxix) Annex XXXIX shows the internal hybridization probe for specific 
detection of meek. 

(xl) Annex -XL~ shows-the .specific, and -ubiquitous primers for nucleic acid- 
amplification from hexA sequences, 
(xli) Annex XLI shows the internal hybridization probe for specific detection 
of hexA. 

(xlii) Annex XLII illustrates the strategy for the selection of Streptococcus 

pneumoniae species-specific amplification primers and hybridization 

probe from hexA sequences, 
(xliii) Annex XLIII shows the specific and ubiquitous primers for nucleic acid 

amplification from pep sequences, 
(xliv) Annex XLIV shows specific and ubiquitous primers for nucleic acid 

amplification of S. saprophyticus sequences of unknown coding potential, 
(xlv) Annex XLV shows the molecular beacon internal hybridization probes for 

specific detection of antimicrobial agents resistance gene sequences, 
(xlvi) Annex XLVI shows the molecular beacon internal hybridization probe for 

specific detection of S. aureus gene sequences of unknown coding 

potential. 

(xlvii) Annex XLVII shows the molecular beacon hybridization internal probe 

for specific detection of tuf sequences, 
(xlviii) Annex XLVIII shows the molecular beacon internal hybridization probes 

for specific detection of ddl and mtl sequences, 
(xlix) Annex XLIX shows the internal hybridization probe for specific detection 

of S. aureus sequences of unknown coding potential. 
(1) Annex L shows the amplification primers used for nucleic acid 

amplification from antimicrobial agents resistance genes sequences. 
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(li) Annex LI shows the internal hybridization probes for specific detection of 

antimicrobial agents resistance genes sequences, 
(lii) Annex LII shows the molecular beacon internal hybridization probes for 

specific detection of atpD sequences, 
(liii) Annex LIU shows the internal hybridization probes for specific detection 

of ~atpD sequences. 1 " 

0iv) Annex LIVI shows the internal hybridization probes for specific detection 

of ddl and mtl sequences. 

As shown in these Annexes, the selected amplification primers may contain 
inosines and/or base ambiguities. Inosine is a nucleotide analog able to specifically 
bind to any of the four nucleotides A, C, G or T. Alternatively, degenerated 
oligonucleotides which consist of an oligonucleotide mix having two or more of 
the four nucleotides A, C, G or T at the site of mismatches were used. The 
inclusion of inosine and/or of degeneracies in the amplification primers allows 
mismatch tolerance thereby permitting the amplification of a wider array of target 
nucleotide sequences (Dieffenbach and Dveksler, 1995 PCR Primer: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Plainview, New York). 

EXAMPLES 

EXAMPLE 1: 

Sequencing of bacterial atpD (F-type and V-type) gene fragments . As shown in 
Annex IV, the comparison of publicly available atpD (F-type) sequences from a 
variety of bacterial species revealed conserved regions allowing the design of PCR 
primers able to amplify atpD sequences (F-type) from a wide range of bacterial 
species. Using primers pairs SEQ ID NOs. 566 and 567, 566 and 814, 568 and 567, 
570 and 567, 572 and 567, 569 and 567, 571 and 567, 700 and 567, it was possible 
to Amplify and sequence atpD sequences SEQ ID NOs. 242-270, 272-398, 673- 
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674, 737-767, 866-867, 942-955, 1245-1254, 1256-1265, 1527, 1576, 1577, 1600- 
1604, 1640-1646, 1649, 1652, 1655, 1657, 1659-1660, 1671, 1844-1845, and 
1849-1865. 

Similarly, Annex V shows the strategy to design the PCR primers able to amplify 
_a//?D-sequences-of-the V--type from-a wide range-of archaeal and-bacterial- species. — 
Using primers SEQ ID NOs. 681-683, it was possible to amplify and sequence 
atpD sequences SEQ ID NOs. 827-832, 929-931, 958 and 966. As the gene was 
difficult to amplify for several species, additional amplification primers were 
designed inside the original amplicon (SEQ ID NOs. 1203-1207) in order to obtain 
sequence information for these species. Other primers (SEQ ID NO. 1212, 1213, 
2282-2285) were also designed to amplify regions of the atpD gene (V-type) in 
archaebacteria. 

EXAMPLE 2: 

Sequencing of eukarvotic atpD (F-tvpe and V-tvpe) gene fragments . The 
comparison of publicly available atpD (F-type) sequences from a variety of fungal 
and parasitical species revealed conserved regions allowing the design of PCR 
primers able to amplify atpD sequences from a wide range of fungal and parasitical 
species. Using primers pairs SEQ ID NOs. 568 and 573, 574 and 573, 574 and 708, 
and 566 and 567, it was possible to amplify and sequence atpD sequences SEQ ID 
NOs. 458-497, 530-538, 663, 667, 676, 678-680, 768-778, 856-862, 889-896, 941, 
1638-1639, 1647, 1650-1651, 1653-1654, 1656, 1658, 1684, 1846-1848, and 2189- 
2192. 

In the same manner, the primers described in Annex V (SEQ ID NOs. 681-683) 
could amplify the atpD (V-type) gene from various fungal and parasitical species. 
This strategy allowed to obtain SEQ ID NOs. 834-839, 956-957, and 959-965. 
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Sequencing of eukarvotic tuf(EF-l) gene fragments. As shown in Annex VII, the 
comparison of publicly available tuf(EF-l) sequences from a variety of fungal and 
parasitical species revealed conserved regions allowing the design of PCR primers 

-able-to-amplifym/sequences-from-a-wide range of -fungal -and-parasitical species: 

Using primers pairs SEQ ID NOs. 558 and 559, 813 and 559, 558 and 815, 560 
and 559, 653 and 559, 558 and 655, and 654 and 559, 1999 and 2000, 2001 and 
2003, 2002 and 2003, it was possible to amplify and sequence tuf sequences SEQ 
ID NOs. 399-457, 509-529, 622-624, 677, 779-790, 840-842, 865, 897-903, 1266- 
1287, 1561-1571 and 1685. 

EXAMPLE 4: 

Sequencing of eukarvotic tuf (organelle origin. M) gene fragments. As shown in 
Annex VI, the comparison of publicly available tuf (organelle origin, M) sequences 
from a variety of fungal and parasitical organelles revealed conserved regions 
allowing the design of PCR primers able to amplify tuf sequences of several 
organelles belonging to a wide range fungal and parasitical species. Using primers 
pairs SEQ ID NOs. 664 and 652, 664 and 561, 91 1 and 914, 912 and 914, 913 and 
915, 916 and 561, 664 and 917, it was possible to amplify and sequence tuf 
sequences SEQ ID NOs. 498-508, 791-792, 843-855, 904-910, 1664, 1666-1667, 
1669-1670, 1673-1683, 1686-1689, 1874-1876, 1879, 1956-1960, and 2193-2199. 

EXAMPLE 5: 

Specific detection and identification of Streptococcus agalactiae using tuf 
sequences . As shown in Annex VIII, the comparison of tuf sequences from a 
variety of bacterial species allowed the selection of PCR primers specific for S. 
agalactiae. The strategy used to design the PCR primers was based on the analysis 
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of a multiple sequence alignment of various tuf sequences. The multiple sequence 
alignment includes the tuf sequences of four bacterial strains from the target 
species as well as tuf sequences from other species and bacterial genera, especially 
representatives of closely related species. A careful analysis of this alignment 
allowed the selection of oligonucleotide sequences which are conserved within the 
target species but which discriminate sequences from other species and genera, 
especially from the closely related species, thereby permitting the species-specific, 
ubiquitous and sensitive detection and identification of the target bacterial species. 

The chosen primer pair, oligos SEQ ID NO. 549 and SEQ ID NO. 550, gives an 
amplification product of 252 bp. Standard PCR was carried out using 0.4 /xM of 
each primer, 2.5 mM MgCl 2 , BSA 0.05 mM, IX Taq Buffer (Promega), dNTP 0.2 
mM (Pharmacia), 0,5 U Taq DNA polymerase (Promega) coupled with TaqStart™ 
antibody (Clontech Laboratories Inc., Palo Alto), 1 /d of genomic DNA sample in 
a final volume of 20 fil using a PTC-200 thermocycler (MJ Research Inc.). The 
optimal cycling conditions for maximum sensitivity and specificity were 3 minutes 
at 95 °C for initial denaturation, then forty cycles of two steps consisting of 1 
second at 95 °C and 30 seconds at 62 °C, followed by terminal extension at 72 °C 
for 2 minutes. Detection of the PCR products was made by electrophoresis in 
agarose gels (2 %) containing 0.25 /xg/ml of ethidium bromide. 

Specificity of the assay was tested by adding into the PCR reactions, 0.1 ng of 
genomic DNA from each of the bacterial species listed in Table 8. Efficient 
amplification was observed only for the 5 S. agalactiae strains listed. Of the other 
bacterial species, including 32 species representative of the vaginal flora and 27 
other streptococcal species, only S. acidominimus yielded amplification. The signal 
with 0.1 ng of S. acidominimus genomic DNA was weak and the detection limit for 
this species was 10 pg (corresponding to more than 4000 genome copies) while the 
detection limit for S. agalactiae was 2.5 fg (corresponding to one genome copy) of 
genomic DNA. 
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To increase the specificity of the assay, internal probes were designed for FRET 
(Fluorescence Resonance Energy Transfer) detection using the LightCycler™ 
(Idaho Technology). As illustrated in Annex IX, a multiple sequence alignment of 
streptococcal tuf sequence fragments corresponding to the 252 bp region amplified 
by primers^EQ ID Na was used for die design of 

internal probes TSagHF436 (SEQ ID NO. 582) and TSagHF465 (SEQ ID NO. 
583). The region of the amplicon selected for internal probes contained sequences 
unique and specific to S. agalactiae. SEQ ID NO. 583, the more specific probe, is 
labelled with fluorescein in 3', while SEQ ID NO. 582, the less discriminant probe, 
is labelled with CY5 in 5' and blocked in 3' with a phosphate group. However, 
since the FRET signal is only emitted if both probes are adjacently hybridized on 
the same target amplicon, detection is highly specific. 

Real-time detection of PCR products using the LightCycler™ was carried out 
using 0.4 fiM of each primer (SEQ ID NO. 549-550), 0.2 fiM of each probe (SEQ 
ID NO. 582-583), 2.5 mM MgCl 2 , BSA 450 /ig/ml, IX PC2 Buffer (AB Peptides, 
St-Louis, MO), dNTP 0.2 mM (Pharmacia), 0.5 U KlenTaql™ DNA polymerase 
(AB Peptides) coupled with TaqStart™ antibody (Clontech Laboratories Inc., Palo 
Alto), 0.7 jxl of genomic DNA sample in a final volume of 7 fil using a 
LightCycler thermocycler (Idaho Technology). The optimal cycling conditions for 
maximum sensitivity and specificity were 3 minutes at 94 °C for initial 
denaturation, then forty cycles of three steps consisting of 0 second (this setting 
meaning the LightCycler will reach the target temperature and stay at it for its 
minimal amount of time) at 94 °C, 10 seconds at 64 °C, 20 seconds at 72 °C. 
Amplification was monitored during each annealing steps using the fluorescence 
ratio. The streptococcal species having close sequence homologies with the tuf 
sequence of 5. agalactiae (S: acidominimus, S. anginosus, S. bovis, S. dysgalactiae, 
S. equi, S. ferus, S. gordonii, 5. intermedins, S. parasanguis, S. parauberis, S. 
salivarius, 5. sanguis, S. suis) as well as S. agalactiae were tested in the 
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LightCycler with 0.07 ng of genomic DNA per reaction. Only S. agalactiae 
yielded an amplification signal, hence demonstrating that the assay is species- 
specific. With the LightCycler™ assay using the internal FRET probes, the 
detection limit for S. agalactiae was 1-2 genome copies of genomic DNA. 

-EXAMPLE 6: : 

Specific detection and identification of Streptococcus agalactiae using atpD 
sequences . As shown in Annex X, the comparison of atpD sequences from a 
variety of bacterial species allowed the selection of PCR primers specific for S. 
agalactiae. The primer design strategy is similar to the strategy described in the 
preceding Example except that atpD sequences were used in the alignment. 

Four primers were selected, ASag42 (SEQ ID NO. 627), ASag52 (SEQ ID NO. 
628), ASag206 (SEQ ID NO. 625) and ASag371 (SEQ ID NO. 626). The 
following combinations of these four primers give four amplicons; SEQ ID NO. 
627 + SEQ ID NO. 625 = 190 bp, SEQ ID NO. 628 + SEQ ID NO. 625 = 180 bp, 
SEQ ID NO. 627 + SEQ ID NO. 626 = 355 bp, and SEQ ID NO. 628 + SEQ ID 
NO. 626 = 345 bp. 

Standard PCR was carried out on PTC-200 thermocyclers (MJ Research Inc) using 
0.4 /iM of each primers pair, 2.5 mM MgCl 2 , BSA 0.05 mM, IX taq Buffer 
(Promega), dNTP 0.2 mM (Pharmacia), 0.5 U Taq DNA polymerase (Promega) 
coupled with TaqStart™ antibody (Clontech Laboratories Inc., Palo Alto), 1 fi\ of 
genomic DNA sample in a final volume of 20 fiL. The optimal cycling conditions 
for maximum sensitivity and specificity were adjusted for each primer pair. Three 
minutes at 95 °C for initial denaturation, then forty cycles of two steps consisting 
of 1 second at 95 °C and 30 seconds at the optimal annealing temperature specified 
below were followed by terminal extension at 72 °C for 2 minutes. Detection of 
the PCR products was made by electrophoresis in agarose gels (2 %) containing 
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0.25 /xg/ml of ethidium bromide. Since atpD sequences are relatively more specific 
than tuf sequences, only the most closely related species namely, the steptococcal 
species listed in Table 9, were tested. 

All four primer pairs only amplified the six S. agalactiae strains. With an 
- annealing temperature of 63°C7 the primer pair SEQTD-NO. 627+-SEQ ID-NO.- - 
625 had a sensitivity of 1-5 fg (equivalent to 1-2 genome copies). At 55 °C, the 
primer pair SEQ ID NO. 628 + SEQ ID NO. 625 had a sensitivity of 2.5 fg 
(equivalent to 1 genome copy). At 60 °C, the primer pair SEQ ID NO, 627 + SEQ 
ID NO. 626 had a sensitivity of 10 fg (equivalent to 4 genome copies). At 58 °C, 
the primer pair SEQ ID NO. 628 + SEQ ID NO. 626 had a sensitivity of 2.5-5 fg 
(equivalent to 1-2 genome copies). This proves that all four primer pairs can detect 
5. agalactiae with high specificity and sensitivity. Together with Example 5, this 
example demonstrates that both tuf and atpD sequences are suitable and flexible 
targets for the identification of microorganisms at the species level. The fact that 4 
different primer pairs based on atpD sequences led to efficient and specific 
amplification of S. agalactiae demonstrates that the challenge is to find target 
genes suitable for diagnostic purposes, rather than finding primer pairs from these 
target sequences. 

EXAMPLE 7: 

Development of a PCR assay for detection and identification of staphylococci at 
genus and species levels . 

Materials and Methods 

Bacterial strains. The specificity of the PCR assay was verified by using a panel 
of ATCC (America Type Culture Collection) and DSMZ (Deutsche Sammlung 
von Mikroorganismen und Zellkulturen GmbH; German Collection of 



70 



WO 01/23604 PCT/CA00/01150 

Microorganisms and Cell Cultures) reference strains consisting of 33 gram- 
negative and 47 gram-positive bacterial species (Table 12). In addition, 295 
clinical isolates representing 11 different species of staphylococci from the 
microbiology laboratory of the Centre Hospitalier Universitaire de Quebec, 
Pavilion Centre Hospitalier de l'Universit<§ Laval (CHUL) (Ste-Foy, Quebec, 

Canada) _were- also_ tested, to _further„ validate, the. StaphylococcuszsptcifiC- PCR 

assay. These strains were all identified by using (i) conventional methods or (ii) the 
automated MicroScan Autoscan-4 system equipped with the Positive BP Combo 
Panel Type 6 (Dade Diagnostics, Mississauga, Ontario, Canada). Bacterial strains 
from frozen stocks kept at -80 °C in brain heart infusion (BHI) broth containing 
10% glycerol were cultured on sheep blood agar or in BHI broth (Quelab 
Laboratories Inc, Montreal, Quebec, Canada). 

PCR primers and internal probes. Based on multiple sequence alignments, 
regions of the tuf gene unique to staphylococci were identified. Staphylococcus- 
specific PCR primers TStaG422 (SEQ ID NO. 553) and TStaG765 (SEQ ID NO. 
575) were derived from these regions (Annex XII). These PCR primers are displaced 
by two nucleotide positions compared to original Staphylococcus-sptcific PCR 
primers described in our patent publication WO98/20157 (SEQ ID NOs. 17 and 20 in 
the said patent publication). These modifications were done to ensure specificity and 
ubiquity of the primer pair, in the light of new tuf sequence data revealed in the 
present patent application for several additional staphylococcal species and strains. 

Similarly, sequence alignment analysis were performed to design genus and 
species-specific internal probes (see Annexes XIII to XVI). Two internal probes 
specific for Staphylococcus (SEQ ID NOs. 605-606), five specific for 5. aureus (SEQ 
ID NOs. 584-588), five specific for S. epidermidis (SEQ ID NO. 589-593), two 
specific for S. haemolyticus (SEQ ID NOs, 594-595), three specific for S. hominis 
(SEQ ID NOs. 596-598), four specific for S. saprophyticus (SEQ ID NOs. 599-601 
and 695), and two specific for coagulase-negative Staphylococcus species including 
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S. epidermidis, S. hominis, 5. saprophytics, S. auricularis, S. capitis, S> 
haemolyticus, S. lugdunensis, S. simulans, 5. cohnii and S. warned (SEQ ID NOs. 
1175-1176) were designed. The range of mismatches between the Staphylococcus- 
specific 371 -bp amplicon and each of the 20-mer species-specific internal probes was 
from 1 to 5, in the middle of the probe when possible. No mismatches were present 
in the two Staphylococcus-sptcific probes for The 11 "species Yn^yzedV S.liufeus, ~S7 
auricularis, S. capitis, S. cohnii, S. epidermidis, S. haemolyticus, S. hominis, S. 
lugdunensis, 5. saprophytics, S. simulans and 5. warneri. In order to verify the 
intra-specific sequence conservation of the nucleotide sequence, sequences were 
obtained for the 371 -bp amplicon from five unrelated ATCC and clinical strains for 
each of the species S. aureus, 5. epidermidis, S. haemolyticus, S. hominis and S. 
saprophytics. The Oligo™ (version 5.0) primer analysis software (National 
Biosciences, Plymouth, Minn.) was used to confirm the absence of self- 
complementary regions within and between the primers or probes. When required, 
the primers contained inosines or degenerated nucleotides at one or more variable 
positions. Oligonucleotide primers and probes were synthesized on a model 394 
DNA synthesizer (Applied Biosystems, Mississauga, Ontario, Canada). Detection of 
the hybridization was performed with the DIG-labeled dUTP incorporated during 
amplification with the Staphylococcus-spzcific PCR assay, and the hybridization 
signal was detected with a luminometer (Dynex Technologies) as described above in 
the section on luminescent detection of amplification products. Annexes XIII to XVI 
illustrate the strategy for the selection of several internal probes. 

PCR amplification. For all bacterial species, amplification was performed from 
purified genomic DNA or from a bacterial suspension whose turbidity was adjusted 
to that of a 0.5 McFarland standard, which corresponds to approximately 1.5 x 10 8 
bacteria per ml. One nanogram of genomic DNA or 1 of the standardized bacterial 
suspension was transferred directly to a 19 |il PCR mixture. Each PCR reaction 
contained 50 mM KC1, 10 mM Tris-HCl (pH 9.0), 0.1% Triton X-100, 2.5 mM 
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MgCl2, 0.2 |iM (each) of the two Staphylococcus genus-specific primers (SEQ ID 
NOs. 553 and 575), 200 \M (each) of the four deoxynucleoside triphosphates 
(Pharmacia Biotech), 3.3 (xg/|xl bovine serum albumin (BSA) (Sigma-Aldrich Canada 
Ltd, Oakville, Ontario, Canada), and 0.5 U Taq polymerase (Promega) coupled with 
JTagStart™ Antibody (Clontech). The PCR amplification was performed as follows: 
3 min. at 94 °C for initial denaturation, then forty cycles of two steps consisting of 1 
second at 95 °C and 30 seconds at 55 °C, plus a terminal extension at 72 °C for 2 
minutes. Detection of the PCR products was made by electrophoresis in agarose gels 
(2 %) containing 0.25 /ig/ml of ethidium bromide. Visualization of the PCR products 
was made under UV at 254 nm. 

For determination of the sensitivities of the PCR assays, two-fold dilutions of 
purified genomic DNA were used to determine the minimal number of genome 
copies which can be detected. 

Results 

Amplifications with the Staphylococcus genus-specific PCR assay. The 
specificity of the assay was assessed by performing 30-cycle and 40-cycle PCR 
amplifications with the panel of gram-positive (47 species from 8 genera) and gram- 
negative (33 species from 22 genera) bacterial species listed in Table 12. The PCR 
assay was able to detect efficiently 27 of 27 staphylococcal species tested in both 30- 
cycle and 40-cycle regimens. For 30-cycle PCR, all bacterial species tested other 
than staphylococci were negative. For 40-cycle PCR, Enterococcus faecalis and 
Macrococcus caseolyticus were slightly positive for the Stdphylococcus-specific 
PCR assay. The other species tested remained negative. Ubiquity tests performed on 
a collection of 295 clinical isolates provided by the microbiology laboratory of the 
Centre Hospitalier Universitaire de Quebec, Pavilion Centre Hospitalier de 
l'Universite' Laval (CHUL), including Staphylococcus aureus (n=34), S. auricularis 
(n=2), S. capitis (n=19), S. cohnii (n=5), S. epidermidis (n=18), S. haemolyticus 
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(n=21), S. hominis (n=73), S. lugdunensis (n=17), S. saprophytics (n=6), S, 
simulans (n=3), S. warneri (n=32) and Staphylococcus sp. (n=65), showed a unifonr 
amplification signal with the 30-cycle PCR assays and a perfect relation between the 
genotype and classical identification schemes. 

The sensitivity of the Staphylococcus-sptcific assay with 30-cycle and 40-cyck 
PCR protocols was determined by using purified genomic DNA from the 11 
staphylococcal species previously mentioned. For PCR with 30 cycles, a detection 
limit of 50 copies of genomic DNA was consistently obtained. In order to enhance 
the sensitivity of the assay, the number of cycles was increased. For 40-cycle PCR 
assays, the detection limit was lowered to a range of 5-10 genome copies, depending 
on the staphylococcal species tested. 

Hybridization between the Staphylococcus-sveciric 371-bp amplicon and 
species-specific or genus-specific internal probes. Interspecies polymorphism 
was sufficient to generate species-specific internal probes for each of the principal 
species involved in human diseases (5. aureus, S. epidermidis, S. haemolyticus, S. 
hominis and S. saprophytics). In order to verify the intra-species sequence 
conservation of the nucleotide sequence, sequence comparisons were performed on 
the 371-bp amplicon from five unrelated ATCC and clinical strains for each of the 
5 principal staphylococcal species: S. aureus, 5. epidermidis, S. haemolyticus t S. 
hominis and S. saprophytics. Results showed a high level of conservation of 
nucleotide sequence between different unrelated strains from the same species. 
This sequence information allowed the development of staphylococcal species 
identification assays using species-specific internal probes hybridizing to the 371- 
bp amplicon. These assays are specific and ubiquitous for those five 
staphylococcal species. In addition to the species-specific internal probes, the 
genus-specific internals probes were able to recognize all or most Staphylococcus 
species tested. 
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EXAMPLE 8: 

Differentiating between the two closely related yeast species Candida albicans and 
Candida dubliniensis . It is often useful for the clinician to be able to differentiate 
^between twovery-closely related species-otmicroorganisms. Candida albicans is 
the most important cause of invasive human mycose. In recent years, a very closely 
related species, Candida dubliniensis, was isolated in immunosuppressed patients. 
These two species are difficult to distinguish by classic biochemical methods. This 
example demonstrates the use of tuf sequences to differentiate Candida albicans 
and Candida dubliniensis. PCR primers SEQ ID NOs. 11-12, from previous patent 
publication WO98/20157, were selected for their ability to specifically amplify a 
tuf (elongation factor 1 alpha type) fragment from both species (see Annex XI for 
primer positions). Within this tuf fragment, a region differentiating C albicans and 
C. dubliniensis by two nucleotides was selected and used to design two internal 
probes (see Annex XI for probe design, SEQ ID NOs. 577 and 578) specific for 
each species. Amplification of genomic DNA from C. albicans and C. dubliniensis 
was carried out using DIG-ll-dUTP as described above in the section on 
chemiluminescent detection of amplification products. Internal probes SEQ ID 
NOs. 577 and 578 were immobilized on the bottom of individual microtiter plates 
and hybridization was carried out as described above in the above section on 
chemiluminescent detection of amplification products. Luminometer data showed 
that the amplicon from C. albicans hybridized only to probe SEQ ID NO. 577 
while the amplicon from C. dubliniensis hybridized only to probe SEQ ID NO. 
578, thereby demonstrating that each probe was species-specific. 

EXAMPLE 9: 

Specific identification of Entamoeba histolytica . Upon analysis of tuf (elongation 
factor 1 alpha) sequence data, it was possible to find four regions where 
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Entamoeba histolytica sequences remained conserved while other parasitical and 
eukaryotic species have diverged. Primers TEntG38 (SEQ ID NO. 703), TEntG442 
(SEQ ID NO. 704), TEntG534 (SEQ ID NO. 705), and TEntG768 (SEQ ID NO. 
706) were designed so that SEQ ID NO. 703 could be paired with the three other 
primers. On PTC-200 thermocyclers (MJ Research), the cycling conditions for 
initial" sensitivity "and "spetificity" testing were ~3~ min.- at- 94 -°G for- initial- - 
denaturation, then forty cycles of two steps consisting of 1 second at 95 °C and 30 
seconds at 55 °C, followed by terminal extension at 72 °C for 2 minutes. Detection 
of the PCR products was made by electrophoresis in agarose gels (2 %) containing 
0.25 /xg/ml of ethidium bromide. The three primer pairs could detect the equivalent 
of less than 200 E. histolytica genome copies. Specificity was tested using 0.5 ng 
of purified genomic DNA from a panel of microorganisms including Babesia 
boviSj Babesia microtti, Candida albicans, Crithidia fasciculata, Leishmania 
major, Leishmania hertigi and Neospora caninum. Only E. histolytica DNA could 
be amplified, thereby suggesting that the assay was species-specific. 

EXAMPLE 10: 

Sensitive identification of Chlamydia trachomatis . Upon analysis of tuf sequence 
data, it was possible to find two regions where Chlamydia trachomatis sequences 
remained conserved while other species have diverged. Primers Ctr82 (SEQ ID 
NO. 554) and Ctr249 (SEQ ID NO. 555) were designed. With the PTC-200 
thermocyclers (MJ Research), the optimal cycling conditions for maximum 
sensitivity and specificity were determined to be 3 min. at 94 °C for initial 
denaturation, then forty cycles of two steps consisting of 1 second at 95 °C and 30 
seconds at 60 °C, followed by terminal extension at 72 °C for 2 minutes. Detection 
of the PCR products was made by electrophoresis in agarose gels (2 %) containing 
0.25 /ig/ml of ethidium bromide. The assay could detect the equivalent of 8 C 
trachomatis genome copies. Specificity was tested with 0.1 ng of purified genomic 
DNA from a panel of microorganisms including 22 species commonly encountered 
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in the vaginal flora (Bacillus subtilis, Bacteroides fragilis, Candida albicans, 
Clostridium difficile, Corynebacterium cervicis, Corynebacterium urealyticum, 
Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Fusobacterium 
nucleatum, Gardnerella vaginalis, Haemophilus influenzae, Klebsiella oxytoca, 
^Lactobacillus^ acidophilus, Peptococcus niger, Peptostreptococcus prevotii, 
Porphyromonas asaccharolytica, Prevotella melaninogenica, Propioriibacterium~ 
acnes, Staphylococcus aureus, Streptococcus acidominimus, and Streptococcus 
agalactiae). Only C. trachomatis DNA could be amplified, thereby suggesting that 
the assay was species-specific. 

EXAMPLE 11: 

Genus-specific detection and identification of enterococci . Upon analysis of tuf 
sequence data and comparison with the repertory of tuf sequences, it was possible 
to find two regions where Enterococcus sequences remained conserved while other 
genera have diverged (Annex XVII). Primer pair Encg313dF and Encg599c (SEQ 
ID NOs. 1137 and 1136) was tested for its specificity by using purified genomic 
DNA from a panel of bacteria listed in Table 10. Using the PTC-200 thermocycler 
(MJ Research), the optimal cycling conditions for maximum sensitivity and 
specificity were determined to be 3 min. at 94 °C for initial denaturation, then forty 
cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 55 °C, 
followed by terminal extension at 72 °C for 2 minutes. Detection of the PCR 
products was made by electrophoresis in agarose gels (2 %) containing 0.25 ftg/ml 
of ethidium bromide. Visualization of the PCR products was made under UV at 
254 nm. The 18 enterococcal species listed in Table 10 were all amplified 
efficiently. The only other species amplified were Abiotrophia adiacens, Gemella 
haemolysans and Gemella morbillorum, three gram-positive species. Sensitivity 
tested with several strains of £. casseliflavus, E. faecium, E. faecalis, £. flavescens 
and E. gallinarum and with one strain of each other Enterococcus species listed in 
Table 10 ranged from 1 to 10 copies of genomic DNA. The sequence variation 
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within the 308-bp amplicon was sufficient so that internal probes could be used to 
speciate the amplicon and differenciate enterococci from Abiotrophia adiacens, 
Gemella haemolysans and Gemella morbillorum, thereby allowing to achieve 
excellent specificity. Species-specific internal probes were generated for each of 
the clinically important species, E. faecalis (SEQ ID NO. 1 174), E. faecium (SEQ 
ID ~N07 602); and" the group including- E.-casseliflavus^ E^flavescens and E. 
gallinarum (SEQ ID NO. 1122) (Annex XVIII). The species-specific internal 
probes were able to differentiate their respective Enterococcus species from all 
other Enterococcus species. These assays are sensitive, specific and ubiquitous for 
those five Enterococcus species. 

EXAMPLE 12: 

Identification of the major bacterial platelets contaminants using tuf sequences 
with a multiplex PCR test. Blood platelets preparations need to be monitored for 
bacterial contaminations. The tuf sequences of 17 important bacterial contaminants 
of platelets were aligned. As shown in Annex XIX, analysis of these sequences 
allowed the design of PCR primers. Since in the case of contamination of platelet 
concentrates, detecting all species (not just the more frequently encountered ones) 
is desirable, perfect specificity of primers was not an issue in the design. However, 
sensitivity is important. That is why, to avoid having to put too much degeneracy, 
only the most frequent contaminants were included in primer design, knowing that 
the selected primers would anyway be able to amplify more species than the 17 
used in the design because they target highly conserved regions of tuf sequences. 
Oligonucleotide sequences which are conserved in these 17 major bacterial 
contaminants of platelet concentrates were chosen (oligos Tplaq 769 and Tplaq 
991, respectively SEQ ID NOs. 636 and 637) thereby permitting the detection of 
these bacterial species. However, sensitivity was slightly deficient with 
staphylococci. To ensure maximal sensitivity in the detection of all the more 
frequent bacterial contaminants, a multiplex assay also including oligonucleotide 
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primers targetting the Staphylococcus genera (oligos Stag 422, SEQ ID NO. 553; 
and Stag 765, SEQ ID NO. 575) was developed. The bacterial species detected 
with the assay are listed in Table 14. 

- -The -primer_pairs,_ oligos JEQ JD NO. _636 and SEQ ID NO. 637 that give an 
amplification product of 245 pb, and oligos SEQ ID NO. 553 and SEQ ID NCX 575 
that give an amplification product of 368 pb, were used simultaneously in the 
multiplex PCR assay. Detection of these PCR products was made on the 
LightCycler thermocycler (Idaho Technology) using SYBR® Green I (Molecular 
Probe Inc.). SYBR® Green I is a fluorescent dye that binds specifically to double- 
stranded DNA. 

Huorogenic detection of PCR products with the LightCycler was carried out using 
1.0 nM of both Tplaq primers (SEQ ID NOs. 636-637) and 0.4 of both TStaG 
primers (SEQ ID NOs. 553 and 575), 2.5 mM MgCl 2 , BSA 7.5 fiM . dNTP 0.2 
mM (Pharmacia), 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 0.5 U Taq DNA 
polymerase (Boerhinger Mannheim) coupled with TaqStart™ antibody (Clontech), 
and 0.07 ng of genomic DNA sample in a final volume of 7 fi\. The optimal 
cycling conditions for maximum sensitivity and specificity were 1 minute at 94 °C 
for initial denaturation, then forty-five cycles of three steps consisting of 0 second 
at 95 °C, 5 seconds at 60 °C and 9 seconds at 72 °C. Amplification was monitored 
during each elongation cycle by measuring the level of SYBR® Green 1. However, 
real analysis takes place after PCR. Melting curves are done for each sample and 
transformation of the melting peak allows determination of Tm. Thus primer-dimer 
and specific PCR product are discriminated. With this assay, all prominent 
bacterial contaminants of platelet concentrates listed in Annex XIX and Table 14 
were detected. Sensitivity tests were performed on the 9 most frequent bacterial 
contaminants of platelets. The detection limit was less than 20 genome copies for 
E cloacae, B. cereus, S. choleraesuis and S. marcescens\ less than 15 genome 
copies for P. aeruginosa; and 2 to 3 copies were detected for S. aureus, S. 
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epidermidis, E. coli and K. pneumoniae. Further refinements of assay conditions 
should increase sensitivity levels. 



EXAMPLE 13: 



The resolving power of the tuf and atpD sequences databases is comparable to the" 
biochemical methods for bacterial identification , The present gold standard for 
bacterial identification is mainly based on key morphological traits and batteries of 
biochemical tests. Here we demonstrate that the use of tuf and atpD sequences 
combined with simple phylogenetic analysis of databases formed by these 
sequences is comparable to the gold standard. In the process of acquiring data for 
the tuf sequences, we sequenced the tuf gene of a strain that was given to us 
labelled as Staphylococcus hominis ATCC 35982. That tuf sequence (SEQ ID NO. 
192) was incorporated into the tuf sequences database and subjected to a basic 
phylogenic analysis using the Pileup command from version 10 of the GCG 
package (Genetics Computer Group). This analysis indicated that SEQ ID NO. 192 
is not associated with other £ hominis strains but rather with the S. warneri strains. 
The ATCC 35982 strain was sent to the reference laboratory of the Laboratoire de 
sante publique du Quebec (LSPQ). They used the classic identification scheme for 
staphylococci (Kloos and Schleifer, 1975., J. Clin. Microbiol. 1:82-88). Their 
results shown that although the colonial morphology could correspond to S. 
hominis, the more precise biochemical assays did not. These assays included 
discriminant mannitol, mannose and ribose acidification tests as well as rapid and 
dense growth in deep thioglycolate agar. The LSPQ report identified strain ATCC 
35982 as 5. warneri which confirms our database analysis. The same thing 
happened for S. warneri (SEQ ID NO. 187) which had initially been identified as 
S. haemolyticus by a routine clinical laboratory using a low resolving power 
automated system (MicroScan, AutoScan-4™). Again, the tuf and LSPQ analysis 
agreed on its identification as S. warneri. In numerous other instances, in the 
course of acquiring tuf and atpD sequence data from various species and genera, 
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analysis of our tuf and/or atpD sequence databases permitted the exact 
identification of mislabelled or erroneously identified strains. These results clearly 
demonstrate the usefulness and the high resolving power of our sequence-based 
identification assays using the tuf and atpD sequences databases. 

EXAMPLE 14: 

Detection of group B streptococci from clinical specimens. 

Introduction 

Streptococcus agalactiae, the group B streptococcus (GBS), is responsible for 
a severe illness affecting neonate infants. The bacterium is passed from the healthy 
earner mother to the baby during delivery. To prevent this infection, it is 
recommended to treat expectant mothers susceptible of carrying GBS in their 
vaginal/anal flora. Carrier status is often a transient condition and rigorous 
monitoring requires cultures and classic bacterial identification weeks before 
delivery. To improve the detection and identification of GBS we developped a 
rapid, specific and sensitive PGR test fast enough to be performed right at delivery. 

Materials and Methods 

GBS clinical specimens. A total of 66 duplicate vaginal/anal swabs were 
collected from 41 consenting pregnant women admitted for delivery at the Centre 
Hospitalier Universitaire de Quebec, Pavilion Saint-Fra^ois d' Assise following 
the CDC recommendations. The samples were obtained either before or after 
rupture of membranes. The swab samples were tested at the Centre de Recherche 
en Infectiologie de 1'Universite Laval within 24 hours of collection. Upon receipt, 
one swab was cut and then the tip of the swab was added to GNS selective broth 
for identification of group B streptococci (GBS) by the standard culture methods 
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recommended by the CDC. The other swab was processed following the 
instruction of the IDI DNA extraction kit (Infectio Diagnotics (IDI) Inc.) prior to 
PCR amplification. 

Oligonucleotides. PCR primers, Tsag340 (SEQ ID NO. 549) and Tsag552 
-(SEQJD_NO._ 150) complementary to the regions of the tuf gene unique for GBS 
were designed based upon a multiple sequence alignment using our repertory of tuf 
sequences. Oligo primer analysis software (version 5.0) (National Biosciences) 
was used to analyse primers annealing temperature, secondary structure potential 
as well as mispriming and dimerization potential. The primers were synthesized 
using a model 391 DNA synthesizer (Applied Biosystems). 

A pair of fluorescently labeled adjacent hybridization probes Sag465-F (SEQ 
ID NO. 583) and Sag436-C (SEQ ID NO. 582) were synthesized and purified by 
Operon Technologies. They were designed to meet the recommendations of the 
manufacturer (Idaho Technology) and based upon multiple sequence alignment 
analysis using our repertory of tuf sequences to be specific and ubiquitous for 
GBS. These adjacent probes, which are separated by one nucleotide, allow 
fluorescence resonance energy transfer (FRET), generating an increased 
fluorescence signal when both hybridized simultaneously to their target sequences. 
The probe SEQ ID NO. 583 was labeled with FITC in 3 prime while SEQ ID NO. 
582 was labeled with Cy5 in 5 prime. The Cy5-labeled probes contained a 3- 
blocking phosphate group to prevent extension of the probes during the PCR 
reactions. 

PCR amplification. Conventional amplifications were performed either from 
2 pi of a purified genomic DNA preparation or cell lysates of vaginal/anal 
specimens. The 20 (i\ PCR mixture contained 0.4 pM of each GBS-specific primer 
(SEQ ID NOs. 549-550), 200 pM of each deoxyribonucleotide (Pharmacia 
Biotech), 10 mM Tris-HCl (pH 9.0), 50 mM KC1, 0.1% Triton X-100, 2.5 mM 
MgCl 2 , 3.3 mg/ml bovine serum albumin (BSA) (Sigma), and 0.5 U of Taq 
polymerase (Promega) combined with the TaqStart™ antibody (Clontech). The 
TaqStart™ antibody, which is a neutralizing monoclonal antibody of Taq DNA 
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polymerase, was added to all PCR reactions to enhance the efficiency of the 
amplification. The PCR mixtures were subjected to thermal cycling (3 min at 95 
°C and then 40 cycles of 1 s at 95 °C, and 30 s at 62 °C with a 2-min final 
extension at 72 °C) with a PTC-200 DNA Engine thermocycler (MJ research). The 
-PCR-amplified.reaction mixture was resolved by agarose gel electrophoresis. 

The LightCycler™ PCR amplifications were performed with 1 /if of a 
purified genomic DNA preparation or cell lysates of vaginal/anal specimens. The 
lO/il amplification mixture consisted of 0.4 /*M each GBS-specific primer (SEQ 
ID NOs. 549-550), 200 fiM each dNTP, 0.2 (iM each fluorescently labeled probe 
(SEQ ID NOs. 582-583), 300 /ig/ml BSA (Sigma), and 1 /tl of lOx PC2 buffer 
(containing 50 mM Tris-HCl (pH 9.1), 16 mM ammonium sulfate, 3.5 mM Mg 2+ , 
and 150 fig/ml BSA) and 0.5 U KlenTaql™ (AB Peptides) coupled with 
TaqStart™ antibody (Clontech). KlenTaql™ is a highly active and more heat- 
stable DNA polymerase without 5'-exonuclease activity. This prevents hydrolysis 
of hybridized probes by the 5' to 3' exonuclease activity. A volume of 7 \l\ of the 
PCR mixture was transferred into a composite capillary tube (Idaho Technology). 
The tubes were then centrifuged to move the reaction mixture to the tips of the 
capillaries and then cleaned with optical-grade methanol. Subsequently the 
capillaries were loaded into the carousel of a LC32 LightCycler™ (Idaho 
Technology), an instrument that combines rapid-cycle PCR with fluorescence 
analysis for continuous monitoring during amplification. The PCR reaction 
mixtures were subjected to a denaturation step at 94 °C for 3 min followed by 45 
cycles of 0 s at 94 °C, 20 s at 64 °C and 10 s at 72 °C with a temperature transition 
rate of 20 °C/s. Fluorescence signals were obtained at each cycle by sequentially 
positioning each capillary on the carousel at the focus of optical elements affiliated 
to the built-in fluorimeter for 100 milliseconds. Complete amplification and 
analysis required about 35 min. 

Specificity and sensitivity tests. The specificity of the conventional and 
LightCycler™ PCR assays was verified by using purified genomic DNA (0.1 
ng/reaction) from a battery of ATCC reference strains representing 35 clinically 
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relevant gram-positive species (Abiotrophia defectiva ATCC 49176, 
Bifidobacterium breve ATCC 15700, Clostridium difficile ATCC 9689, 
Corynebacterium urealyticum ATCC 43042, Enterococcus casseliflavus ATCC 
25788, Enterococcus durans ATCC 19432, Enterococcus faecalis ATCC 29212, 
-Enterococcus i faecium ATCC 19434, Enterococcus gallinarum ATCC 49573, 
Enterococcus rajfinosus ATCC 49427, Lactobacillus Yeulen " 7LTCC~23273r~ 
Lactococcus lactis ATCC 19435, Listeria monocytogenes ATCC 15313, 
Peptococcus niger ATCC 27731, Peptostreptococcus anaerobius ATCC 27337, 
Peptostreptococcus prevotii ATCC 9321, Staphylococcus aureus ATCC 25923, 
Staphylococcus epidermidis ATCC 14990, Staphylococcus haemolyticus ATCC 
29970, Staphylococcus saprophytics ATCC 15305, Streptococcus agalactiae 
ATCC 27591, Streptococcus anginosus ATCC 33397, Streptococcus bovis ATCC 
33317, Streptococcus constellatus ATCC 27823, Streptococcus dysgalactiae 
ATCC 43078, Streptococcus gordonii ATCC 10558, Streptococcus mitis ATCC 
33399, Streptococcus mutans ATCC 25175, Streptococcus oralis ATCC 35037, 
Streptococcus parauberis ATCC 6631, Streptococcus pneumoniae ATCC 6303, 
Streptococcus pyogenes ATCC 19615, Streptococcus salivarius ATCC 7073, 
Streptococcus sanguinis ATCC 10556, Streptococcus uberis ATCC 19436). These 
microbial species included 15 species of streptococci and many members of the 
normal vaginal and anal floras. In addition, 40 GBS isolates of human origin, 
whose identification was confirmed by a latex agglutination test (Streptex, Murex), 
were also used to evaluate the ubiquity of the assay. 

For determination of the sensitivities (i.e., the minimal number of genome 
copies that could be detected) for conventional and LightCycler™ PCR assays, 
serial 10-fold or 2-fold dilutions of purified genomic DNA from 5 GBS ATCC 
strains were used. 

Results 

Evaluation of the GBS-specific conventional and LightCycler™ PCR 
assays. The specificity of the two assays demonstrated that only DNAs from GBS 
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strains could be amplified. Both PCR assays did not amplify DNAs from any other 
bacterial species tested including 14 streptococcal species other than GBS as well 
as phylogenetically related species belonging to the genera Enterococcus, 
Peptostreptococcus and Lactococcus. Important members of the vaginal or anal 
-_flora,_ including coagulase-negative staphylococci, Lactobacillus sp., and 
Bacteriodes sp. were also negative with the GBS"-specific~PCR "assay; The 
LightCycler™ PCR assays detected only GBS DNA by producing an increased 
fluorescence signal which was interpreted as a positive PCR result. Both PCR 
methods were able to amplify all of 40 GBS clinical isolates, showing a perfect 
correlation with the phenotypic identification methods. 

The sensitivity of the assay was determined by using purified genomic DNA 
from the 5 ATCC strains of GBS. The detection limit for all of these 5 strains was 
one genome copy of GBS. The detection limit of the assay with the LightCycler™ 
was 3.5 fg of genomic DNA (corresponding to 1-2 genome copies of GBS). These 
results confirmed the high sensitivity of our GBS-specific PCR assay. 

Direct Detection of GBS from vaginal/anal specimens. Among 66 
vaginal/anal specimens tested, 1 1 were positive for GBS by both culture and PCR. 
There was one sample positive by culture only. The sensitivity of both PCR 
methods with vaginal/anal specimens for identifying colonization status in 
pregnant women at delivery was 91.7% when compared to culture results. The 
specificity and positive predictive values were both 100% and the negative 
predictive value was 97.8%. The time for obtaining results was approximately 45 
min for LightCycler™ PCR, approximately 100 min for conventional PCR and 48 
hours for culture. 

Conclusion 

We have developed two PCR assays (conventional and LightCycler™) for the 
detection of GBS, which are specific (i.e., no amplification of DNA from a variety 
of bacterial species other than GBS) and sensitive (i.e., able to detect around 1 
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genome copy for several reference ATCC strains of GBS). Both PCR assays are 
able to detect GBS directly from vaginal/anal specimens in a very short turnaround 
time. Using the real-time PCR assay on LightCycler™, we can detect GBS 
carriage in pregnant women at delivery within 45 minutes. 



EXAMPLE 15: 

Simultaneous detection and identification of Streptococcus pyogenes and its 
pvrogenic exotoxin A. The rapid detection of Streptococcus pyogenes and of its 
pyrogenic exotoxin A is of clinical importance. We developed a multiplex assay 
which permits the detection of strains of 5. pyogenes carrying the pyrogenic toxin 
A gene, which is associated with scarlet fever and other pathologies. In order to 
specifically detect S. pyogenes, nucleotide sequences of the pyrrolidone carboxylyl 
peptidase (pep) gene were aligned to design PCR primers Spy291 (SEQ ID NO. 
1211) and Spy473 (SEQ ID NO. 1210). Next, we designed primers for the specific 
detection of the pyrogenic exotoxin A. Nucleotide sequences of the speA gene, 
carried on the bacteriophage T12, were aligned as shown in Annex XXIII to design 
PCR primers Spytx814 (SEQ ID NO. 994) and Spytx 927 (SEQ ID NO. 995). 

The primer pairs: oligos SEQ ID NOs. 1210-1211, yielding an amplification 
product of 207 bp, and oligos SEQ ID NOs. 994-995, yielding an amplification 
product of 1 35 bp, were used in a multiplex PCR assay. 

PCR amplification was carried out using 0.4 /iM of both pairs of primers, 2.5 mM 
MgCl 2 , BSA 0.05 pM , dNTP 0.2 /xM (Pharmacia), lOmM Tris-HCl (pH 9.0), 
0.1% Triton X-100, 2.5 mM MgCl 2 , 0.5 U Taq DNA polymerase (Promega) 
coupled with TaqStart™ antibody (Clontech Laboratories Inc.), and 1 /tl of 
genomic DNA sample in a final volume of 20 fi\. PCR amplification was 
performed using a PTC-200 thermal cycler (MJ Research). The optimal cycling 
conditions for maximum specificity and sensitivity were 3 minutes at 94 °C for 



86 



( 

WO 01/23604 PCT/CAOO/01150 

initial denaturation, then forty cycles of two steps consisting of 1 second at 95 °C 
and 30 seconds at 63 °C, followed by a final step of 2 minutes at 72 °C. Detection 
of the PCR products was made by electrophoresis in agarose gels (2 %) containing 
0.25 fig/ml of ethidium bromide. Visualization of the PCR products was made 
- under UV-at-25-4nm. 

The detection limit was less than 5 genome copies for both S. pyogenes and its 
pyrogenic exotoxin A. The assay was specific for pyrogenic exotoxin A-producing 
S. pyogenes: strains of the 27 other species of Streptococcus tested, as well as 20 
strains of various gram-positive and gram-negative bacterial species were all 
negative. 

A similar approach was used to design an alternative set of jpeA-specific primers 
(SEQ ID NOs. 996 to 998, see Annex XXIV). In addition, another set of primers 
based on the /w/gene (SEQ ID NOs. 999 to 1001, see Annex XXV) could be used 
to specifically detect Streptococcus pyogenes. 

EXAMPLE 16: 

Real-time detection and identification of Shiga toxin-producing bacteria. Shiga 
toxin-producing Escherichia coli and Shigella dysenteriae cause bloody diarrhea. 
Currently, identification relies mainly on the phenotypic identification of S. 
dysenteriae and E. coli serotype 0157:H7. However, other serotypes of E. coli are 
increasingly found to be producers of type 1 and/or type 2 Shiga toxins. Two pairs 
of PCR primers targeting highly conserved regions present in each of the Shiga 
toxin genes stxj and stx 2 were designed to amplify all variants of those genes (see 
Annexes XXVI and XXVII). The first primer pair, oligonucleotides 1SLT224 
(SEQ ID NO. 1081) and 1SLT385 (SEQ ID NO. 1080), yields an amplification 
product of 186 bp from the stxj gene. For this amplicon, the ISLTBl-Fam (SEQ 
ID NO. 1084) molecular beacon was designed for the specific detection of stxj 
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using the fluorescent label 6-carboxy-fluorescein. The lSltSl-FAM (SEQ ID NO. 
2012) molecular scorpion was also designed as an alternate way for the specific 
detection of stxj. A second pair of PCR primers, oligonucleotides 2SLT537 (SEQ 
ID NO. 1078) and 2SLT678b (SEQ ID NO. 1079), yields an amplification product 
of r Wbp ffonrthe stx 2 gene.-Molecular beacon 2SLTB1-Tet (SEQ ID NO. 1085) 
was designed for the specific detection of stx 2 using the fluorescent label 5- 
tetrachloro-fluorescein. Both primer pairs were combined in a multiplex PCR 
assay. 

PCR amplification was carried out using 0.8 /iM of primer pair SEQ ID NOs. 
1080-1081, 0.5 iiM of primer pair SEQ ID NOs. 1078-1079, 0.3 /zM of each 
molecular beacon, 8 mM MgCl 2 , 490 ng/mL BSA, 0.2 mM dNTPs (Pharmacia), 
50 mM Tris-HCl, 16 mM NH 4 S0 4 , IX TaqMaster (Eppendorf), 2.5 U KlenTaql 
DNA polymerase (AB Peptides) coupled with TaqStart™ antibody (Clontech 
Laboratories Inc.), and 1 /zl of genomic DNA sample in a final volume of 25 ill. 
PCR amplification was performed using a SmartCycler thermal cycler (Cepheid). 
The optimal cycling conditions for maximum sensitivity and specificity were 60 
seconds at 95 °C for initial denaturation, then 45 cycles of three steps consisting of 
10 seconds at 95 °C, 15 seconds at 56 °C and 5 seconds at 72 °C. Detection of the 
PCR products was made in real-time by measuring the fluorescent signal emitted 
by the molecular beacon when it hybridizes to its target at the end of the annealing 
step at 56 °C. 

The detection limit was the equivalent of less than 5 genome copies. The assay was 
specific for the detection of both toxins, as demonstrated by the perfect correlation 
between PCR results and the phenotypic characterization performed using 
antibodies specific for each Shiga toxin type. The assay was successfully 
performed on several Shiga toxin-producing strains isolated from various 
geographic areas of the world, including 10 0157:H7 E. coli, 5 non-0157:H7 E. 
coli and 4 S. dysenteriae. 
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EXAMPLE 17: 

Development of a PCR assay for the detection and identification of staphylococci 
^ at-genus and- species levels ancHts associated mecA gene . The Staphylococcus- 
specific PCR primers described in Example 7 (SEQlD NOs~~553 arfd~575) were- 
used in multiplex with the mecA-specific PCR primers and the S. aureus-spzcific 
primers described in our assigned US patent no. 5,994,066 (SEQ ID NOs. 261 and 
262 for mecA and SEQ ID NOs. 152 and 153 for S.aureus in the said patent). 
Sequence alignment analysis of 10 publicly available mecA gene sequences 
allowed to design an internal probe specific to mecA (SEQ ID NO. 1177). An 
internal probe was also designed for the 5. awreitf-specific amplicon (SEQ ID NO 
1234). PCR amplification and agarose gel electrophoresis of the amplified products 
were performed as described in Example 7, with the exception that 0.4 /iM (each) 
of the two Staphylococcus-spzcific primers (SEQ ID NOs. 553 and 575) and 0.4 
fiM (each) of the mecA-specific primers and 0.4 /xM (each) of the 5. aureus- 
specific primers were used in the PCR mixture. The specificity of the multiplex 
assay with 40-cycle PCR protocols was verified by using purified genomic DNA 
from five methicillin-resistant and fifteen methicillin- sensitive staphylococcal 
strains. The sensitivity of the multiplex assay with 40-cycle PCR protocols was 
determined by using purified genomic DNA from twenty-three methicillin- 
resistant and twenty-eight methicillin-sensitive staphylococcal strains. The 
detection limit was 2 to 10 genome copies of genomic DNA, depending on the 
staphylococcal species tested. Furthermore, the mecA-specific internal probe, the 5. 
aurewj-specific internal probe and the coagulase-negative staphylococci-specific 
internal probe (described in Example 7) were able to recognize twenty-three 
methicillin-resistant staphylococcal strains and twenty-eight methicillin-sensitive 
staphylococcal strains with high sensitivity and specificity. 
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The format of the assay is not limited to the one described above. A person skilled 
in the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 1232 for detection of 
~~the 5-awr^-specific..amplicon,^EQ ID NO. 1233 for detection of coagulase- 
negative staphylococci and SEQ ID NO. 1231 for detection of meek. 

Alternatively, a multiplex PCR assay containing the Staphylococcus-specific PCR 
primers described in Example 7 (SEQ ID NOs. 553 and 575) and the meek- 
specific PCR primers described in our assigned US patent no. 5,994,066 (SEQ ID 
NOs. 261 and 262 in the said patent) were developed. PCR amplification and 
agarose gel electrophoresis of the amplified products were performed as described 
in Example 7, with the exception that 0.4 ptM (each) of the Staphylococcus-spzcific 
primers (SEQ ID NOs. 553 and 575) and 0.4 (M (each) of the meck-spzcific 
primers described in our assigned US patent no. 5,994,066 (SEQ ID NOs. 261 and 
262 in the said patent) were used in the PCR mixture. The sensitivity of the 
multiplex assay with 40-cycle PCR protocols was determined by using purified 
genomic DNA from two methicillin-resistant and five methicillin-sensitive 
staphylococcal strains. The detection limit was 2 to 5 copies of genomic DNA, 
depending on the staphylococcal species tested. The specificity of the multiplex 
PCR assay coupled with capture-probe hybridization was tested with two strains of 
methicillin-resistant S. aureus, two strains of methicillin-sensitive S. aureus and 
seven strains of methicillin-sensitive coagulase-negative staphylococci. The mecA- 
specific internal probe (SEQ ID NO. 1177) and the S. aurewj-specific internal 
probe (SEQ ID NO. 587) described in Example 7 were able to recognize all the 
strains with high specificity showing a perfect correlation with susceptibility to 
methicillin. The sensitivity of the PCR assay coupled with capture-probe 
hybridization was tested with one strain of methicillin-resistant S. aureus. The 
detection limit was around 10 copies of genomic DNA. 
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EXAMPLE 18: 
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1019-1033, and pbp2x sequences SEQ ID nus. 
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(SEQ ID NOs. 1 127-1 128, 1 144-1 145, 1 148-1 149) were also designed and used to 
complete the sequencing of pbpla, pbp2b and pbp2x amplification products. The 
described primers (SEQ ID NOs. 1125 and 1126, SEQ ID NOs. 1142 and 1143, 
SEQ ID NOs. 1146 and 1147, SEQ ID NOs. 1127-1128, 1144-1145, 1148-1149) 
represent a powerful tool for generating new pbp sequences for design of primers 
and/or probes for detectionof (3-lactam resistance inSrpneumoniae. 

EXAMPLE 19: 

Sequencing of hexA genes of Streptococcus species . The hexA sequence of S. 
pneumoniae described in our assigned US patent no. 5,994,066 (SEQ ID NO. 31 in 
the said patent, SEQ ID NO. 1 1 83 in the present application) allowed the design of 
a PCR primer (SEQ ID NO. 1 182) which was used with primer Spnl401 described 
in our assigned US patent no. 5,994,066 (SEQ ID NO. 156 in the said patent, SEQ 
ID NO. 1 179 in the present application) to generate a database for hexA sequences 
that can be used to design primers and/or probes for the specific identification and 
detection of S. pneumoniae (Annex XLII). Using primers SEQ ID NO. 1179 and 
SEQ ID NO. 1182 (Annex XLII), it was possible to amplify and determine the 
hexA sequence from 5. pneumoniae (4 strains) (SEQ ID NOs. 1 184-1 187), 5. mitis 
(three strains) (SEQ ID NOs. 1 189-1 191) and S. oralis (SEQ ID NO. 1 1 88). 

EXAMPLE 20: 

Development of multiplex PCR assays coupled with capture probe hybridization 
for the detection and identification of Streptococcus pneumoniae and its penicillin 
resistance genes . 

Two different assays were developed to identify 5. pneumoniae and its 
susceptibility to penicillin. 
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Bacterial strains. The specificity of the multiplex PCR assay was verified by 
using a panel of ATCC (American Type Culture Collection) reference strains 
consisting of 33 gram-negative and 67 gram-positive bacterial species (Table 13). 
Iff addition, - a~total _ of"98 ~strains~of Sr pneumoniae, ~1"6 "strains of S: mitis ~and~3~ ~" 
strains of S. oralis from the American Type Culture Collection, the microbiology 
laboratory of the Centre Hospitalier Universitaire de Quebec, Pavilion Centre 
Hospitalier de l'Universite Laval (CHUL), (Ste-Foy, Quebec, Canada), the 
Laboratoire de sante publique du Quebec, (Sainte-Anne-de-Bellevue, Quebec, 
Canada), the Sunnybrook and Women's College Health Sciences Centre (Toronto, 
Canada), the Infectious Diseases Section, Department of Veterans Affairs Medical 
Center, (Houston, USA) were also tested to further validate the Streptococcus 
pneumoniae-specific PCR assay. The penicillin MICs (minimal inhibitory 
concentrations) were measured by the broth dilution method according to the 
recommended protocol of NCCLS. 

PCR primers and internal probes. The analysis of hexA sequences from a variety 
of streptococcal species from the publicly avalaible hexA sequence and from the 
database described in Example 19 (SEQ ID NOs. 1 184-1 191) allowed the selection 
of a PCR primer specific to S. pneumoniae, SEQ ID NO. 1181. This primer was 
used with the 5. pneumom'ae-specific primer SEQ ID NO. 1179 to generate an 
amplification product of 241 bp (Annex XLII). The PCR primer SEQ ID NO. 1 181 
is located 127 nucleotides downstream on the hexA sequence compared to the 
original 5. pneumoniae-specific PCR primer Spnl515 described in our assigned 
US patent no. 5,994,066 (SEQ ID NO. 157 in the said patent). These modifications 
were done to ensure the design of the S. pneumoniae-specific internal probe 
according to the new hexA sequences of several streptococcal species from the 
database described in Example 19 (SEQ ID NOs. 1 184-1 191). 
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The analysis of pbpla sequences from S. pneumoniae strains with various levels of 
penicillin resistance from public databases and from the database described in 
Example 18 allowed the identification of amino acid substitutions Ile-459 to Met 
and Ser-462 to Ala that occur in isolates with high-level penicillin resistance 
(MICs > l|ig/ml), and amino acid substitutions Ser-575 to Thr, Gln-576 to Gly and 
Phe-577 to Tyr that are common to all penicillin-resistant isolates with MICs > 
0.25 Hg/ml. As shown in Annex XXXI, PCR primer pair SEQ ID NOs. 1130 
and 1131 were designed to detect high-level penicillin resistance (MICs > lug/ml), 
whereas PCR primer pair SEQ ID NOs. 1129 and 1131 were designed to detect 
intermediate- and high-level penicillin resistance (MICs > 0.25|J.g/ml). 

The analysis of hexA sequences from the publicly avalaible hexA sequence and 
from the database described in Example 19 allowed the design of an internal probe 
specific to 5. pneumoniae (SEQ ID NO. 1180) (Annex XLII). The range of 
mismatches between the S. pneumoniae-specific 241 -bp amplicon was from 2 to 5, 
in the middle of the 19-bp probe. The analysis of pbpla sequences from public 
databases and from the database described in Example 18 allowed the design of 
five internal probes containing all possible mutations to detect the high-level 
penicillin resistance 383-bp amplicon (SEQ ID NOs. 1197, 1217-1220). 
Alternatively, two other internal probes (SEQ ID NOs. 2024-2025) can also be 
used to detect the high-level penicillin resistance 383-bp amplicon. Five internal 
probes containing all possible mutations to detect the 157-bp amplicon which 
includes intermediate- and high-level penicillin resistance were also designed 
(SEQ ID NOs. 1094, 1 192-1193, 1214 and 1216). Design and synthesis of primers 
and probes, and detection of the probe hybridization were performed as described 
in Example 7. Annex XXXI illustrates one of the internal probe for detection of the 
high-level penicillin resistance 383-bp amplicon (SEQ ID NO. 1 197) and one of 
the internal probe for detection of the intermediate- and high-level penicillin 
resistance 157-bp amplicon (SEQ ID NO. 1193). 
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PCR amplification. For all bacterial species, amplification was performed from 
purified genomic DNA using a PTC-200 thermocycler (MJ Research). 1 /d of 
genomic DNA at 0.1 ng//«il, or 1 (i\ of a bacterial lysate, was transferred to a 19 fii 
PCR mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (H 9.0), 
0.1% Triton X-100, 2.5 mM MgCl 2 , 0.1 /iM (each) of the S. pneumoniae-specific 
primers SEQID Na 1 179 and SEQID NO" 1 18 f, 02 JM of prime/SEQlD NOT 

1 129, 0.7 nM of primer SEQ ID NO. 1131, and 0.6 /iM of primer SEQ ID NO. 

1130, 0.05 mM bovine serum albumin (BSA), and 0.5 U Taq polymerase 
(Promega) coupled with TaqStart™ antibody. In order to generate Digoxigenin 
(DIG)-labeled amplicons for capture probe hybridization, 0.1X PCR DIG labeling 
four deoxynucleoside triphosphates mix (Boehringer Mannheim GmbH) was used 
for amplification. 

For determination of the sensitivitiy of the PCR assays, 10-fold dilutions of 
purified genomic DNA were used to determine the minimal number of genome 
copies which can be detected. 

Capture probe hybridization. The DIG-labeled amplicons were hybridized to the 
capture probes bound to 96-well plates. The plates were incubated with anti-DIG- 
alkaline phosphatase and the chemiluminescence was measured by using a 
luminometer (MLX, Dynex Technologies Inc.) after incubation with CSPD and 
recorded as Relative Light Unit (RLU). The RLU ratio of tested sample with and 
without captures probes was then calculated. A ratio > 2.0 was defined as a 
positive hybridization signal. All reactions were performed in duplicate. 

Results 

Amplifications with the multiplex PCR assay. The specificity of the assay was 
assessed by performing 40-cycle PCR amplifications with the panel of gram- 
positive (67 species from 12 genera) and gram-negative (33 species from 17 
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genera) bacterial species listed in Table 13. All bacterial species tested other than 
S. pneumoniae were negative except S. mitis and S. oralis. Ubiquity tests were 
performed using a collection of 98 S. pneumoniae strains including high-level 
penicillin resistance (n=53), intermediate resistance (n=12) and sensitive (n=33) 
strains. There was a perfect correlation between PCR and standard susceptibility 
testing for 33 penicillin-sensitive isolates. Among 12 S. pneumoniae isolates with 
intermediate penicillin resistance based on susceptibility testing, 11 had 
intermediate resistance based on PCR, but one S. pneumoniae isolate with 
penicillin MIC of 0.25 fig/ml showed a high-level penicillin resistance based on 
genotyping. Among 53 isolates with high-level penicillin resistance based on 
susceptibility testing, 51 had high-level penicillin resistance based on PCR but two 
isolates with penicillin MIC > 1 Jig/ml showed an intermediate penicillin resistance 
based on genotyping. In general, there was a good correlation between the 
genotype and classical culture method for bacterial identification and susceptibility 
testing. 

The sensitivity of the 5. pneumoniae-specific assay with 40-cycle PCR protocols 
was determined by using purified genomic DNA from 9 isolates of S. pneumoniae. 
The detection limit was around 10 copies of genomic DNA for all of them. 

Post-PCR hybridization with internal probes. The specificity of the multiplex 
PCR assay coupled with capture-probe hybridization was tested with 98 strains of 
S. pneumoniae , 16 strains of S. mitis and 3 strains of S. oralis. The internal probe 
specific to 5. pneumoniae (SEQ ID NO. 1180) detected all 98 S. pneunoniae 
strains but did not hybridize to the S. mitis and S. oralis amplicons. The five 
internal probes specific to the high-level resistance amplicon (SEQ ID NOs. 1197, 
1217-1220) detected all amplification patterns corresponding to high-level 
resistance. The two S. pneumoniae strains with penicillin MIC > 1 |Hg/ml that 
showed an intermediate penicillin resistance based on PCR amplification were also 
intermediate resistance based on probe hybridization. Similarly, among 12 strains 
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with intermediate-penicillin resistance based on susceptibility testing, 1 1 showed 
intermediate-penicillin resistance based on hybridization with the five internal 
probes specific to the intermediate and high-level resistance amplicon (SEQ ID 
NOs. 1094, 1192-1193, 1214 and 1216). The strain described above having a 
penicillin MIC of 0.25 ^g/ml which was high-level penicillin resistance based on 
PCFT an^lTfication "was alsoTugh4^ell^sist^ice Fased^^robe^ 
summary, the combination of the multiplex PCR and hybridization assays results 
in a highly specific test for the detection of penicillin-resistant Streptococcus 
pneumoniae. 

ASSAY II: 

Bacterial strains. The specificity of the multiplex PCR assay was verified by 
using the same strains as those used for the development of Assay I. The penicillin 
MICs (minimal inhibitory concentrations) were measured by the broth dilution 
method according to the recommended protocol of NCCLS. 

PCR primers and internal probes. The analysis of pbpla sequences from S. 
pneumoniae strains with various levels of penicillin resistance from public 
databases and from the database described in Example 18 allowed the design of 
two primers located in the constant region of pbpla. PCR primer pair (SEQ ID 
NOs. 2015 and 2016) was designed to amplify a 888-bp variable region of pbpla 
from all S. pneumoniae strains. A series of internal probes were designed for 
identification of the pbpla mutations associated with penicillin resistance in S. 
pneumoniae. For detection of high-level penicillin resistance (MICs > ljig/ml), 
three internal probes were designed (SEQ ID NOs. 2017-2019). Alternatively, ten 
other internal probes were designed that can also be used for detection of high- 
level resistance within the 888-bp pbpla amplicon: (1) three internal probes for 
identification of the amino acid substitutions Thr-371 to Ser or Ala within the 
motif S370TMK (SEQ ID NOs. 2031-2033); (2) two internal probes for detection 
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of the amino acid substitutions Ile-459 to Met and Ser-462 to Ala near the motif 
S428RN (SEQ ID NOs. 1135 and 2026); (3) two internal probes for identification 
of the amino acid substitutions Asn-443 to Asp (SEQ ID NOs. 1134 and 2027); 
and (4) three internal probes for detection of all sequence variations within another 
region (SEQ ID NOs. 2028-2030). For detection of high-level and intermediate 
penicillin - resistance (MICs~>~0:25 ~jlg/ml)7 four infemal _ probes were" desiped 
(SEQ ID NOs. 2020-2023). Alternatively, six other internal probes were designed 
for detection of the four consecutive amino acid substitutions T574SQF to 
A574TGY near the motif K557TG (SEQ ID NOs. 2034-2039) that can also be 
used for detection of intermediate- and high-level resistance within the 888-bp 
pbpla amplicon. 

PCR amplification. For all bacterial species, amplification was performed from 
purified genomic DNA using a PTC-200 thermocycler (MJ Research). 1 fi\ of 
genomic DNA at 0.1 ng//il, or 1 /il of a bacterial lysate, was transferred to a 19 fi\ 
PCR mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 0.1% Triton X-100, 2.5 mM MgCl 2 , 0.08 /*M (each) of the S. pneumoniae- 
specific primers SEQ ID NO. 1 179 and SEQ ID NO. 1 181, 0.4 /iM of the pbpla- 
specific primer SEQ ID NO. 2015, 1.2 /iM of pfcpia-specific primer SEQ ID NO. 
2016, 0.05 mM bovine serum albumin (BSA), and 0.5 U Taq polymerase 
(Promega) coupled with TaqStart™ antibody. In order to generate Digoxigenin 
(DIG)-labeled amplicons for capture probe hybridization, 0.1X PCR DIG labeling 
four deoxynucleoside triphosphates mix (Boehringer Mannheim GmbH) was used 
for amplification. 

For determination of the sensitivities of the PCR assays, 10-fold dilutions of 
purified genomic DNA were used to determine the minimal number of genome 
copies which can be detected. 
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Capture probe hybridization. The DIG-labeled amplicons were hybridized to the 
capture probes bound to 96-well plates as described for Assay I. 



Results 

Amplifications with the multiplex PCR assay; The~specificity of tbe~assay"was 
assessed by performing 40-cycle PCR amplifications with the panel of gram- 
positive (67 species from 12 genera) and gram-negative (33 species from 17 
genera) bacterial species listed in Table 13. All bacterial species tested other than 
S. pneumoniae were negative except S. mitis and S. oralis. Ubiquity tests were 
performed using a collection of 98 S. pneumoniae strains including high-level 

r 

penicillin resistance (n=53), intermediate resistance (n=12) and sensitive (n=33) 
strains. All the above S. pneumoniae strains produced the 888-bp amplicon 
corresponding to pbpla and the 241 -bp fragment corresponding to hexA. 

The sensitivity of the S. pneumoniae-specific assay with 40-cycle PCR protocols 
was determined by using purified genomic DNA from 9 isolates of S. pneumoniae. 
The detection limit was around 10 copies of genomic DNA for all of them. 

Post-PCR hybridization with internal probes. The specificity of the multiplex 
PCR assay coupled with capture-probe hybridization was tested with 98 strains of 
S. pneumoniae, 16 strains of 5". mitis and 3 strains of S. oralis. The internal probe 
specific to S. pneumoniae (SEQ ID NO. 1180) detected all 98 S. pneunoniae 
strains but did not hybridize to the S. mitis and S. oralis amplicons. The three 
internal probes (SEQ ID NOs 2017-2019) specific to high-level resistance detected 
all the 43 strains with high-level penicillin resistance based on susceptibility 
testing. Among 12 isolates with intermediate-penicillin resistance based on 
susceptibility testing, 11 showed intermediate-penicillin resistance based on 
hybridization with 4 internal probes (SEQ ID NOs. 2020-2023) and one strain 



99 



WO 01/23604 PCT/CAOO/01150 

having penicillin MIC of 0.25 |Ug/ml was misclassified as high-level penicillin 
resistance. In summary, the combination of the multiplex PCR and hybridization 
assays results in a highly specific test for the detection of penicillin-resistant 
Streptococcus pneumoniae. 

EXAMPfcE-21: ~ 

Sequencing of the vancomycin resistance vanA. vanCL vanC2 and vanC3 eenes . 
The publicly available sequences of the vanH-vanA-vanX-vanY locus of transposon 
Tnl546 from E. faecalis, vanCl sequence from one strain of E. gallinarum, vanC2 
and vanC3 sequences from a variety of E, casseliflavus and E. flavescens strains, 
respectively, allowed the design of PCR primers able to amplify the vanA, vanCJ, 
vanC2 and vanC3 sequences of several Enterococcus species. Using primer pairs 
van6877 and van9106 (SEQ ID NOs. 1150 and 1155), vanCl-122 and vanCl-1315 
(SEQ ID NOs. 1 1 10 and 1 109), and vanC2C3-l and vanC2C3-1064 (SEQ ID NOs. 
1108 and 1107), it was possible to amplify and determine vanA sequences SEQ ID 
NOs. 1049-1057, vanCl sequences SEQ ID NOs. 1058-1059, vanC2 sequences 
SEQ ID NOs. 1060-1063 and vanC3 sequences SEQ ID NOs. 1064-1066, 
respectively. Four other PCR primers (SEQ ID NOs. 1151-1154) were also 
designed and used to complete the sequencing of vanA amplification products. 

EXAMPLE 22: 

Development of a PCR assay for the detection and identification of enterococci at 
genus and species levels and its associated resistance genes vanA and vanB . The 
comparison of vanA and vanB sequences revealed conserved regions allowing the 
design of PCR primers specific to both vanA and vanB sequences (Annex 
XXXVIII). The PCR primer pair vanAB459 and vanAB830R (SEQ ID NOs. 1 1 12 
and 1111) was used in multiplex with the Enterococcus-specific primers 
Encg313dF and Encg599c (SEQ ID NOs. 1137 and 1136) described in Example 
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11. Sequence alignment analysis of vanA and vanB sequences revealed regions 
suitable for the design of internal probes specific to vanA (SEQ ID NO. 1170) and 
vanB (SEQ ID NO. 1171). PGR amplification and agarose gel electropheresis of 
the amplified products were performed as described in Example 11. The optimal 
cycling conditions for maximum sensitivity and specificity were found to be 3 min. 
— at 94 0 Crfollowed~by forty cycles of two steps consisting of 1 secondat 95~°C~and ~ 
30 seconds at 62°C, plus a terminal extension at 72 °C for 2 minutes. The 
specificity of the multiplex assay with 40-cycle PCR was verified by using 0.1 
nanogram of purified genomic DNA from a panel of bacteria listed in Table 10. 
The sensitivity of the multiplex assay with 40-cycle PCR was verified with three 
strains of E. casseliflavus, eight strains of E. gallinarum, two strains of E. 
flavescens, two vancomycin-resistant strains of E. faecalis and one vancomycin- 
sensiti ve strain of E. faecalis, three vancomycin-resistant strains of £. faecium, one 
vancomycin-sensitive strain of E. faecium and one strain of each of the other 
enterococcal species listed in Table 10. The detection limit was 1 to 10 copies of 
genomic DNA, depending on the enterococcal species tested. The vanA- and vanB- 
specific internal probes (SEQ ID NOs. 1170 and 1171), as well as the E. faecalis- 
and E. /aeriwm-specific internal probes (SEQ ID NOs. 1174 and 602) and the 
internal probe specific to the group including E. casseliflavus, E, gallinarum and E. 
flavescens (SEQ ID NO. 1122) described in Example 11, were able to recognize 
vancomycin-resistant enterococcal species with high sensitivity, specificity and 
ubiquity showing a perfect correlation between the genotypic and phenotypic 
analysis. 

The format of the assay is not limited to the one described above. A person skilled 
in the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 1236 for the 
detection of E. faecalis, SEQ ID NO. 1235 for the detection of E. faecium, SEQ ID 
NO. 1240 for the detection of vanA, and SEQ ID NO. 1241 for the detection of 
vanB. 
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EXAMPLE 23: 

Development of a multiplex PCR assay for detection and identification of 
vancomycin-resistant Enterococcus faecalis, Enterococcus faecium and the group 
including Enterococcus 2allinarum. Enterococcus casseliflavus, and Enterococcus 
flavescens . The analysis of vanA and vanB sequences revealed conserved regions 
allowing design of a PCR primer pair (SEQ ID NOs. 1089 and 1090) specific to 
vanA sequences (Annex XXVIII) and a PCR primer pair (SEQ ID NOs. 1095 and 
1096) specific to vanB sequences (Annex XXIX). The va/iA-specific PCR primer 
pair (SEQ ID NOs. 1089 and 1090) was used in multiplex with the van5-specific 
PCR primer pair described in our assigned US patent 5,994,066 (SEQ ID NOs. 
1095 and 1096 in the present patent and SEQ ID NOs. 231 and 232 in the said 
patent). The comparison of vanCl, vanC2 and vanC3 sequences revealed 
conserved regions allowing design of PCR primers (SEQ ID NOs. 1 101 and 1 102) 
able to generate a 158-bp amplicon specific to the group including E gallinarum, 
E. casseliflavus and E. flavescens (Annex XXX). The vanC-specific PCR primer 
pair (SEQ ID NOs. 1101 and 1102) was used in multiplex with the E. faecalis- 
specific PCR primer pair described in our assigned US patent 5,994,066 (SEQ ID 
NOs. 40 and 41 in the said patent) and with the E. /aecium-specific PCR primer 
pair described in our patent publication WO98/20157 (SEQ ID NOs. 1 and 2 in the 
said publication). For both multiplexes, the optimal cycling conditions for 
maximum sensitivity and specificity were found to be 3 min. at 94 °C, followed by 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 58 °C, 
plus a terminal extension at 72 °C for 2 minutes. Detection of the PCR products 
was made by electrophoresis in agarose gels (2 %) containing 0.25 /ig/ml of 
ethidium bromide. The vanA-specific PCR primer pair (SEQ ID NOs. 1089 and 
1090), the vanfl-specific primer pair (SEQ ID NOs. 1095 and 1096) and the vanC- 
specific primer pair (SEQ ID NOs. 1 101 and 1 102) were tested for their specificity 
by using 0.1 nanogram of purified genomic DNA from a panel of 5 vancomycin- 
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sensitive Enterococcus species, 3 vancomycin-resistant Enterococcus species, 13 
other gram-positive bacteria and one gram-negative bacterium. Specificity tests 
were performed with the E. /aecmm-specific PCR primer pair described in our 
patent publication WO98/20157 (SEQ ID NOs. 1 and 2 in the said publication) and 
with the E. faecalis-speciTic PCR primer pair described in our assigned US patent 
-5^994,066 (SEQ ID NOs. 40 and 41 -in-the said-patent)-on a-panel of 37- gram-- 
positive bacterial species. All Enterococcus strains were amplified with high 
specificity showing a perfect correlation between the genotypic and phenotypic 
analysis. The sensitivity of the assays was determined for several strains of E. 
gallinarum, E. casseliflavus, E. flavescens and vancomycin-resistant E. faecalis 
and E. faecium. Using each of the E. faecalis- and E. faecium-specific PCR primer 
pairs as well as vanA-, vanB- and wwC-specific PCR primers used alone or in 
multiplex as described above, the sensitivity ranged from 1 to 10 copies of 
genomic DNA. 

The format of the assay is not limited to the one described above. A person skilled 
in the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 1238 for the 
detection of E. faecalis, SEQ ID NO. 1237 for the detection of E. faecium, SEQ ID 
NO. 1239 for the detection of vanA, and SEQ ID NO. 1241 for the detection of 
vanB. 

Alternatively, another PCR assay was developed for the detection of vancomycin- 
resistant E. faecium and vancomycin-resistant E. faecalis. This assay included two 
multiplex: (1) the first multiplex contained the vanA-specific primer pair (SEQ ID 
NOs. 1090-1091) and the vanfl-specific PCR primer pair described in our assigned 
US patent 5,994,066 (SEQ ID NOs. 1095 and 1096 in the present patent and SEQ 
ID NOs. 231 and 232 in the said patent), and (2) the second multiplex contained 
the E. faecalis-sptcific PCR primer pair described in our assigned US patent 
5,994,066 (SEQ ID NOs. 40 and 41 in the said patent) and the E. /aeawm-specific 
PCR primer pair described in our patent publication W09 8/201 57 (SEQ ID NOs. 1 
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and 2 in the said publication). For both multiplexes, the optimal cycling conditions 
for maximum sensitivity and specificity were found to be 3 min. at 94 °C, followed 
by forty cycles of two steps consisting of 1 second at 95 9 C and 30 seconds at 58 
°C, plus a terminal extension at 72 °C for 2 minutes. Detection of the PCR 
products was made by electrophoresis in agarose gels (2 %) containing 0.25 ptg/wl 
of-ethidium bromide-The two multiplexes-were tested for-their-specificity by using- - 
0.1 nanogram of purified genomic DNA from a panel of two vancomycin-sensitive 
E. faecalis strains, two vancomycin-resistant E. faecalis strains, two vancomycin- 
sensitive E.faecium strains, two vancomycin-resistant E.faecium strains, 16 other 
enterococcal species and 31 other gram-positive bacterial species. All the E. 
faecium and £. faecalis strains were amplified with high specificty showing a 
perfect correlation between the genotypic analysis and the susceptibility to 
glycopeptide antibiotics (vancomycin and teicoplanin). The sensitivity of the assay 
was determined for two vancomycin-resistant E. faecalis strains and two 
vancomycin-resistant E. faecium strains. The detection limit was 5 copies of 
genomic DNA for all the strains. 

This multiplex PCR assay was coupled with capture-probe hybridization. Four 
internal probes were designed: one specific to the vanA amplicon (SEQ ID NO. 
2292), one specific to the vanB amplicon (SEQ ID NO. 2294), one specific to the 
£. faecalis amplicon (SEQ ID NO. 2291) and one specific to the E. faecium 
amplicon (SEQ ID NO. 2287). Each of the internal probes detected their specific 
amplicons with high specificity and sensitivity. 

EXAMPLE 24: 

Universal amplification involving the EF-G (fusA) subdivision of tuf sequences . As 
shown in Figure 3, primers SEQ ID NOs. 1228 and 1229 were designed to amplify 
the region between the end oifusA and the beginning of tuf genes in the str operon. 
Genomic DNAs from a panel of 35 strains were tested for PCR amplification with 
those primers. In the initial experiment, the following strains showed a positive 
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result: Abiotrophia adiacens ATCC 49175, Abiotrophia defectiva ATCC 49176, 
Bacillus subtilis ATCC 27370, Closridium difficile ATCC 9689, Enterococcus 
avium ATCC 14025, Enterococcus casseliflavus ATCC 25788, Enterococcus 
cecorum ATCC 43198, Enterococcus faecalis ATCC 29212, Enterococcus 
faecium ATCC 19434, Enterococcus flavescens ATCC 49996, Enterococcus 
^a/Jwar«minXX~49573 

ATCC 11775, Haemophilus influenzae ATCC 9006, Lactobacillus acidophilus 
ATCC 4356, Peptococcus niger ATCC 27731, Proteus mirabilis ATCC 25933, 
Staphylococcus aureus ATCC 43300, Staphylococcus auricularis ATCC 33753, 
Staphylococcus capitis ATCC 27840, Staphylococcus epidemidis ATCC 14990, 
Staphylococcus haemolyticus ATCC 29970, Staphylococcus hominis ATCC 
27844, Staphylococcus lugdunensis ATCC 43809, Staphylococcus saprophyticus 
ATCC 15305, Staphylococcus simulans ATCC 27848, and Staphylococcus 
warneri ATCC 27836. This primer pair could amplify additional bacterial species; 
however, there was no amplification for some species, suggesting that the PCR 
cycling conditions could be optimized or the primers modified. For example, SEQ 
ID NO. 1227 was designed to amplify a broader range of species. 
In addition to other possible primer combinations to amplify the region covering 
fusA and tuf, Figure 3 illustrates the positions of amplification primers SEQ ID 
NOs. 1221-1227 which could be used for universal amplification of fusA segments. 
All of the above mentioned primers (SEQ ID NOs. 1221-1229) could be useful for 
the universal and/or the specific detection of bacteria. 

Moreover, different combinations of primers SEQ ID NOs. 1221-1229, sometimes 
in combination with tuf sequencing primer SEQ ID NO. 697, were used to 
sequence portions of the str operon, including the intergenic region. In this manner, 
the following sequences were generated: SEQ ID NOs. 1518-1526, 1578-1580, 
1786-1821, 1822-1834, 1838-1843, 2184, 2187, 2188, 2214-2249, and 2255-2269. 

EXAMPLE 25: 
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DNA fragment isolation from Staphylococcus saprophvticus by arbitrarily primed 
PCR. DNA sequences of unknown coding potential for the species-specific 
detection and identification of Staphylococcus saprophyticus were obtained by the 
method of arbitrarily primed PCR (AP-PCR). 

AP-PCR is a method which can be used to generate specific DNA probes for 
Inicro^gariis^ eFaT.7 1993, M6Y^\^Ecb\ogy7r.Z43^250).A desmption 
of the AP-PCR protocol used to isolate a species-specific genomic DNA fragment 
from Staphylococcus saprophyticus follows. Twenty different oligonucleotide 
primers of 10 nucleotides in length (all included in the AP-PCR kit OPAD (Operon 
Technologies, Inc., Alameda, CA)) were tested systematically with DNAs from 5 
bacterial strains of Staphylococcus saprophyticus as well as with bacterial strains 
of 27 other staphylococcal (non-5, saprophyticus) species. For all bacterial species, 
amplification was performed directly from one pL (0.1 ng/pL) of purified genomic 
DNA. The 25 pL PCR reaction mixture contained 50 mM KC1, 10 mM Tris-HCl 
(pH 9.0), 0.1% Triton X-100, 2.5 mM MgCl 2 , 1.2 pM of only one of the 20 
different AP-PCR primers OPAD, 200 pM of each of the four dNTPs, 0.5 U of 
Taq DNA polymerase (Promega Corp., Madison, Wis.) coupled with TaqStart™ 
antibody (Clontech Laboratories Inc., Palo Alto, CA). PCR reactions were 
subjected to cycling using a MJ Research PTC-200 thermal cycler as follows: 3 
min at 96 °C followed by 42 cycles of 1 min at 94 °C for the denaturation step, 1 
min at 31 °C for the annealing step and 2 min at 72 °C for the extension step. A 
final extension step of 7 min at 72 °C followed the 42 cycles to ensure complete 
extension of PCR products. Subsequently, twenty microliters of the PCR-amplified 
mixture were resolved by electrophoresis on a 1.5 % agarose gel containing 0.25 
jLig/ml of ethidium bromide. The size of the amplification products was estimated 
by comparison with a 50-bp molecular weight ladder. 

Amplification patterns specific for Staphylococcus saprophyticus were observed 
with the AP-PCR primer OPAD-16 (sequence: 5 , -AACGGGCGTC-3 t ). 
Amplification with this primer consistently showed a band corresponding to a 
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DNA fragment of approximately 380 bp for all Staphylococcus saprophyticus 
strains tested but not for any of the other staphylococcal species tested. 
The band corresponding to the 380 bp amplicon, specific and ubiquitous for 5. 
saprophyticus based on AP-PCR, was excised from the agarose gel and purified 
using the QIAquick™ gel extraction kit (QIAGEN Inc.). The gel-purified DNA 
fragment was cloned into the T/A cloning site of the pCR 2.1™ plasmid vector 
(Invitrogen Inc.) using T4 DNA ligase (New England BioLabs). Recombinant 
plasmids were transformed into E. coli DH5a competent cells using standard 
procedures. All reactions were performed according to the manufacturer's 
instructions. Plasmid DNA isolation was done by the method of Birnboim and 
Doly (Nucleic Acid Res., 1979, 7:1513-1523) for small-scale preparations. All 
plasmid DNA preparations were digested with the EcoRI restriction endonuclease 
to ensure the presence of the approximately 380 bp AP-PCR insert into the 
plasmid. Subsequently, a large-scale and highly purified plasmid DNA preparation 
was performed from two selected clones shown to carry the AP-PCR insert by 
using the QIAGEN plasmid purification kit (midi format). These large-scale 
plasmid preparations were used for automated DNA sequencing. 
The 380 bp nucleotide sequence was determined for three strains of S. 
saprophyticus (SEQ ID NOs. 74, 1093, and 1198). Both strands of the AP-PCR 
insert from the two selected clones were sequenced by the dideoxynucleotide chain 
termination sequencing method with SP6 and T7 sequencing primers by using the 
Applied Biosystems automated DNA sequencer (model 373A) with their PRISM™ 
Sequenase R ™ Terminator Double-stranded DNA Sequencing Kit (Applied 
Biosystems, Foster City, CA). 

Optimal species-specific amplification primers (SEQ ID NOs. 1208 and 1209) 
have been selected from the sequenced AP-PCR Staphylococcus saprophyticus 
DNA fragments with the help of the primer analysis software Oligo™ 5.0 
(National Biosciences Inc.). The selected primers were tested in PCR assays to 
verify their specificity and ubiquity. Data obtained with DNA preparations from 
reference ATCC strains of 49 gram-positive and 31 gram-negative bacterial 
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species, including 28 different staphylococcal species, indicate that the selected 
primer pairs are specific for Staphylococcus saprophytics since no amplification 
signal has been observed with DNAs from the other staphylococcal or bacterial 
species tested. This assay was able to amplify efficiently DNA from all 60 strains 
of S. saprophyticus from various origins tested. The sensitivity level achieved for 
three S. saprophyticus reference ATCC strains was around 6 genome copies. 

EXAMPLE 26: 

Sequencing of prokarvotic tuf gene fragments. The comparison of publicly 
available tuf sequences from a variety of bacterial species revealed conserved 
regions, allowing the design of PCR primers able to amplify tuf sequences from a 
wide range of bacterial species. Using primer pair SEQ ID NOs. 664 and 697, it 
was possible to amplify and determine tuf sequences SEQ ID NOs.: 1-73, 75-241, 
607-618, 621, 662, 675, 717-736, 868-888, 932, 967-989, 992, 1002, 1572-1575, 
1662-1663, 1715-1733, 1835-1837, 1877-1878, 1880-1881, 2183, 2185, 2200, 
2201, and 2270-2272. 

EXAMPLE 27: 

Sequencing of procarvotic recA gene fragments. The comparison of publicly 
available reck sequences from a variety of bacterial species revealed conserved 
regions, allowing the design of PCR primers able to amplify recA sequences from 
a wide range of bacterial species. Using primer pairs SEQ ID NOs. 921-922 and 
1605-1606, it was possible to amplify and determine recA sequences SEQ ID 
NOs.: 990-991, 1003, 1288-1289, 1714, 1756-1763, 1866-1873 and 2202-2212. 

EXAMPLE 28: 
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Specific detection and identification of Escherichia coli/Shigella sp. using tuf 
sequences. The analysis of tuf sequences from a variety of bacterial species 
allowed the selection of PCR primers (SEQ ID NOs. 1661 and 1665) and of an 
internal probe (SEQ ID NO. 2168) specific to Escherichia coli/Shigella sp. The 
strategy used to design the PCR primers was based on the analysis of a multiple 
le^uence~lifi^riment of ^aiious" 7m/ se^encesfTheT mulFp]e~seque^e^ 
included the tuf sequences of Escherichia coli/Shigella sp. as well as tuf sequences 
from other species and bacterial genera, especially representatives of closely 
related species. A careful analysis of this alignment allowed the selection of 
oligonucleotide sequences which are conserved within the target species but which 
discriminate sequences from other species, especially from the closely related 
species, thereby permitting the species-specific and ubiquitous detection and 
identification of the target bacterial species. 

The chosen primer pair, oligos SEQ ID NOs. 1661 and 1665, gives an 
amplification product of 219 bp. Standard PCR was carried out using 0.4 /xM of 
each primer, 2.5 mM MgCl 2 , BSA 0.05 mM, 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 0.1 % Triton X-100, dNTPs 0.2 mM (Pharmacia), 0,5 U Taq DNA 
polymerase (Promega) coupled with TaqStart™ antibody (Clontech Laboratories 
Inc.), 1 il\ of genomic DNA sample in a final volume of 20 /-il using a PTC-200 
thermocycler (MJ Research). The optimal cycling conditions for maximum 
sensitivity and specificity were 3 minutes at 95 °C for initial denaturation, then 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 60 °C, 
followed by terminal extension at 72 °C for 2 minutes. Detection of the PCR 
products was made by electrophoresis in agarose gels (2 %) containing 0.25 /ig/ml 
of ethidium bromide. Visualization of the PCR products was made under UV at 
254 nm. 

Specificity of the assay was tested by adding to the PCR reactions 0.1 ng of 
genomic DNA from each of the following bacterial species: Escherichia coli (7 
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strains), Shigella sonnei, Shigella flexneri, Shigella dysenteriae, Salmonella 
typhimyurium, Salmonella typhi. Salmonella enteritidis, Tatumella ptyseos, 
Klebsiella pneumoniae (2 strains), Enterohacter aerogenes, Citrobacter farmed, 
Campylobacter jejuni, Serratia marcescens. Amplification was observed only for 
the Escherichia coli and Shigella sp. strains listed and Escherichia fergusoniL The 
sensitivity of the assay with^O-cycle PCR was verified with one strain of E. coli 
and three strains of Shigella sp. The detection limit for E. coli and Shigella sp. was 
1 to 10 copies of genomic DNA, depending on the strains tested. 

EXAMPLE 29: 

Specific detection and identification of Klebsiella pneumoniae using atpD 
sequences. The analysis of atpD sequences from a variety of bacterial species 
allowed the selection of PCR primers specific to K. pneumoniae. The primer 
design strategy is similar to the strategy described in Example 28 except that atpD 
sequences were used in the alignment. 

Two K. pneumoniae-specific primers were selected, (SEQ ID NOs. 1331 and 
1332) which give an amplification product of 115 bp. Standard PCR was carried 
out on PTC-200 thermocyclers (MJ Research) using 0.4 [iM of each primer as 
described in Example 28. The optimal cycling conditions for maximum sensitivity 
and specificity were as follow: three minutes at 95 °C for initial denaturation, then 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 55°C, 
followed by terminal extension at 72 °C for 2 minutes. 

Specificity of the assay was tested by adding to the PCR reactions 0.1 ng of 
genomic DNA from each of the following bacterial species: Klebsiella pneumoniae 
(2 strains), Klebsiella ornitholytica, Klebsiella oxytoca (2 strains), Klebsiella 
planticola, Klebsiella terrigena, Citrobacter freundii, Escherichia coli, Salmonella 
cholerasuis typhi, Serratia marcescens, Enterohacter aerogenes, Proteus vulgaris, 
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Kluyvera ascorbata, Kluyvera georgiana, Kluyvera cryocrescens and Yersinia 
enterolitica. Amplification was detected for the two K. pneumoniae strains, K 
planticola, K. terrigena and the three Kluyvera species tested. Analysis of the 
multiple alignment sequence of the atpD gene allowed the design of an internal 
probe SEQ ID NO. 2167 which can discrimate Klebsiella pneumoniae from other 
— Klebsiellasprznd-KluyverasprTht sensitivity of the assay with 40-cyele PGR was 
verified with one strain of K. pneumoniae. The detection limit for K pneumoniae 
was around 10 copies of genomic DNA. 

EXAMPLE 30: 

Specific detection and identification of Acinetobacter baumannii using atpD 
sequences. The analysis of atpD sequences from a variety of bacterial species 
allowed the selection of PCR primers specific to Acinetobacter baumannii The 
primer design strategy is similar to the strategy described in Example 28. 

Two A. baumanni i-specific primers were selected, SEQ ID NOs. 1690 and 1691, 
which give an amplification product of 233 bp. Standard PCR was carried out on 
PTC-200 thermocyclers (MJ Research) using 0.4 /xM of each primer as described 
in Example 28. The optimal cycling conditions for maximum sensitivity and 
specificity were as follow: three minutes at 95 °C for initial denaturation, then 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 60°C, 
followed by terminal extension at 72 °C for 2 minutes. 

Specificity of the assay was tested by adding to the PCR reactions 0.1 ng of 
genomic DNA from each of the following bacterial species: Acinetobacter 
baumannii (3 strains), Acinetobacter anitratus, Acinetobacter Iwbffi, Serratia 
marcescens, Enterobacter cloacae, Enterococcus faecalis, Pseudomonas 
aeruginosa, Psychrobacter phenylpyruvicus, Neisseria gonorrheoae, Haemophilus 
haemoliticus, Yersinia enterolitica, Proteus vulgaris, Eikenella corrodens, 
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Escherichia coll Amplification was detected only for A. baumannii, A anitratus 
and A. Iwoffi. The sensitivity of the assay with 40-cycle PCR was verified with two 
strains of A. baumannii. The detection limit for the two A. baumannii strains tested 
was 5 copies of genomic DNA. Analysis of the multiple alignment sequence of the 
atpD gene allowed the design of a A. fcaumanmi-specific internal probe (SEQ ID 
NO. 2169). 

EXAMPLE 31: 

Specific detection and identification of Neisseria gonorrhoeae using tuf sequences. 
The analysis of tuf sequences from a variety of bacterial species allowed the 
selection of PCR primers specific to Neisseria gonorrhoeae. The primer design 
strategy is similar to the strategy described in Example 28. 

Two N. gonorrhoeae-spzcific primers were selected, SEQ ID NOs. 551 and 552, 
which give an amplification product of 139 bp. PCR amplification was carried out 
on PTC-200 thermocyclers (MJ Research) using 0.4 fiM of each primer as 
described in Example 28. The optimal cycling conditions for maximum sensitivity 
and specificity were as follow: three minutes at 95 °C for initial denaturation, then 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 65°C, 
followed by terminal extension at 72 °C for 2 minutes. , 

Specificity of the assay was tested by adding into the PCR reactions, 0.1 ng of 
genomic DNA from each of the following bacterial species: Neisseria gonorrhoeae 
(19 strains), Neisseria meningitidis (2 strains), Neisseria lactamica, Neisseria 
flavescens, Neisseria animalis, Neisseria canis, Neisseria cuniculi, Neisseria 
elongata, Neisseria mucosa, Neisseria polysaccharea, Neisseria sicca, Neisseria 
subflava, Neisseria weaveri. Amplification was detected only for N. gonorrhoeae, 
N. sicca and N polysaccharea. The sensitivity of the assay with 40-cycle PCR was 
verified with two strains of N. gonorrhoeae. The detection limit for the N. 
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gonorrhoeae strains tested was 5 copies of genomic DNA. Analysis of the multiple 
alignment sequence of the tuf gene allowed the design of an internal probe, SEQ 
ID NO. 2166, which can discriminate N. gonorrhoeae from N. sicca and N. 
polysaccharea. 

~EXAMPLE"32i" 

Sequencing of bacterial svrA and parC gene fragments . Sequencing of bacterial 
gyrA and parC fragments. One of the major mechanism of resistance to quinolone 
in various bacterial species is mediated by target changes (DNA gyrase and/or 
topoisomerase IV). These enzymes control DNA topology and are vital for 
chromosome function and replication. Each of these enzymes is a tetramer 
composed of two subunits: GyrA and GyrB forming A 2 B 2 complex in DNA 
gyrase; and ParC and ParE forming and C2E2 complex in DNA topoisomerase IV. 
It has been shown that they are hotspots, called the quinolone-resitance- 
determining region (QRDR) for mutations within gyrA that encodes for the GyrA 
subunit of DNA gyrase and within parC that encodes the parC subunit of 
topoisomerase IV. 

In order to generate a database for gyrA and parC sequences that can be used for 
design of primers and/or probes for the specific detection of quinolone resistance 
in various bacterial species, gyrA and parC DNA fragments selected from public 
database (GenBanK and EMBL) from a variety of bacterial species were used to 
design oligonucleotide primers. 

Using primer pair SEQ ID NOs. 1297 and 1298, it was possible to amplify and 
determine gyrA sequences from Klebsiella oxytoca (SEQ ID NO. 1764), Klebsiella 
pneumoniae subsp. ozaneae (SEQ ID NO. 1765), Klebsiella planticola (SEQ ID 
NO. 1766), Klebsiella pneumoniae (SEQ ID NO. 1767), Klebsiella pneumoniae 
subsp. pneumoniae (two strains) (SEQ ID NOs. 1768-1769), Klebsiella 
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pneumoniae subsp. rhinoscleromatis (SEQ ID NO. 1770), Klebsiella terrigena 
(SEQ ID NO. 1771), Kluyvera ascorbata (SEQ ID NO. 2013), Kluyvera georgiana 
(SEQ ID NO. 2014) and Escherichia coli (4 strains) (SEQ ID NOs. 2277-2280). 
Using primer pair SEQ ID NOs. 1291 and 1292, it was possible to amplify and 
determine gyrA sequences from Legionella pneumophila subsp. pneumophila (SEQ 

ID NO: -lV2)rProteusmirabilis-{SEQ ID N0.1V3)-Providencid rettgeri-(SEQ 

ID NO. 1774), Proteus vulgaris (SEQ ID NO. 1775) and Yersinia enterolitica 
(SEQ ID NO. 1776). Using primer pair SEQ ID NOs. 1340 and 1341, it was 
possible to amplify and determine gyrA sequence from Staphylococcus aureus 
(SEQ ID NO. 1255). 

Using primers SEQ ID NOs. 1318 and 1319, it was possible to amplify and 
determine parC sequences from K. oxytoca (two strains) (SEQ ID NOs. 1777- 
1778), Klebsiella pneumoniae subsp. ozaenae (SEQ ID NO. 1779), Klebsiella 
planticola (SEQ ID NO. 1780), Klebsiella pneumoniae (SEQ ID NO. 1781), 
Klebsiella pneumoniae subsp. pneumoniae (two strains) (SEQ ID NOs. 1782- 
1783), Klebsiella pneumoniae subsp. rhinoscleromatis (SEQ ID NO. 1784) and 
Klebsiella terrigena (SEQ ID NO. 1785). 

EXAMPLE 33: 

Development of a PCR assay for the specific detection and identification of 
Staphylococcus aureus and its quinolone resistance genes gvrA and parC. The 
analysis of gyrA and parC sequences from a variety of bacterial species revealed 
conserved regions allowing the design of PCR primers specific to the quinolone- 
resistance-determining region (QRDR) of gyrA and parC from Staphylococcus 
aureus. PCR primer pair SEQ ID NOs. 1.340 and 1341 was designed to amplify the 
gyrA sequence of S. aureus, whereas PCR primer pair SEQ ID NOs. 1342 and 
1343 was designed to amplify S. aureus parC. The comparison of gyrA and parC 
sequences from S. aureus strains with various levels of quinolone resistance 
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allowed the identification of amino acid substitutions Ser-84 to Leu, Glu-88 to Gly 
or Lys in the GyrA subunit of DNA gyrase encoded by gyrA and amino acid 
changes Ser-80 to Phe or Tyr and Ala-116 to Glu in the PaiC subunit of 
topoisomerase IV encoded by parC These amino acid substitutions in GyrA and 
ParC subunits occur in isolates with intermediate- or high-level quinolone 
l^i^anc^ Internal ~pfobesT6fthe specific' detectibTTof wld : t)^)e" Srdufeus J>yrA 
(SEQ ID NO. 1940) and wild-type S. aureus parC (SEQ ID NO. 1941) as well as 
internal probes for the specific detection of each of the gyrA (SEQ ID NOs. 1333- 
1335) and parC mutations identified in quinolone-resistant S. aureus (SEQ ID 
NOs. 1336-1339) were designed. 

The gyrA- and parC-specific primer pairs (SEQ ID NOs. 1340-1341 and SEQ ID 
NOs. 1342-1343) were used in multiplex. PCR amplification was carried out on 
PTC-200 thermocyclers (MJ Research) using 0.3, 0.3, 0.6 and 0.6 /xM of each 
primers, respectively, as described in Example 28. The optimal cycling conditions 
for maximum sensitivity and specificity were 3 minutes at 95 °C for initial 
denaturation, then forty cycles of two steps consisting of 1 second at 95 °C and 30 
seconds at 62 °C, followed by terminal extension at 72 °C for 2 minutes. Detection 
of the PCR products was made by electrophoresis in agarose gels (2 %) containing 
0.25 /xg/ml of ethidium bromide. The specificity of the multiplex assay with 40- 
cycle PCR was verified by using 0.1 ng of purified genomic DNA from a panel of 
gram-positive bacteria. The list included the following: Abiotrophia adiacens, 
Abiotrophia defectiva, Bacillus cereus, Bacillus mycoides, Enterococcus faecalis 
(2 strains), Enterococcus flavescens, Gemella morbillorum, Lactococcus lactis, 
Listeria innocua, Listeria monocytogenes, Staphylococcus aureus (5 strains), 
Staphylococcus auricalis, Staphylococcus capitis subsp. urealyticus, 
Staphylococcus carnosus, Staphylococcus chromogenes, Staphylococcus 
epidermidis (3 strains), Staphylococcus gallinarum, Staphylococcus haemolyticus 
(2 strains), Staphylococcus hominis, Staphylococcus hominis subsp hominis, 
Staphylococcuslentus, Staphylococcus lugdunensis, Staphylococcus 
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saccharolyticus, Staphylococcus saprophytics (3 strains), Staphylococcus 
simulans, Staphylococcus warneri, Staphylococcus xylosus, Streptococcus 
agalactiae, Streptococcus pneumoniae. Strong amplification of both gyrA and 
parC genes was only detected for the S. aureus strains tested. The sensitivity of the 
multiplex assay with 40-cycle PCR was verified with one quinolone-sensitive and 
four quinolone-resistant strains of S. aureus. The detection limit was 2 to 10 copies 
of genomic DNA, depending on the strains tested. 

Detection of the hybridization with the internal probes was performed as described 
in Example 7. The internal probes specific to wild-type gyrA and parC of S. aureus 
and to the gyrA and parC variants of S. aureus were able to recognize two 
quinolone-resistant and one quinolone-sensitive S. aureus strains showing a perfect 
correlation with the susceptibility to quinolones. 

The complete assay for the specific detection of S. aureus and its susceptibility to 
quinolone contains the Staphylococcus-sptcific primers (SEQ ID NOs. 553 and 
575) described in Example 7 and the multiplex containing the 5. aureus gyrA- and 
parC-specific primer pairs (SEQ ID NOs. 1340-1341 and SEQ ID NOs. 1342- 
1343). Amplification is coupled with post-PCR hybridization with the internal 
probe specific to S. aureus (SEQ ID NO. 587) described in Example 7 and the 
internal probes specific to wild-type S. aureus gyrA and parC (SEQ ID NOs. 1940- 
1941) and to the 5. aureus gyrA and parC variants (SEQ ID NOs. 1333-1338). 

An assay was also developed for the detection of quinolone-resistant S. aureus 
using the SmartCycler (Cepheid). Real-time detection is based on the use of S. 
aureus parC-specific primers (SEQ ID NOs. 1342 and 1343) and the 
StaphylococcuS'S$ecific primers (SEQ ID NOs. 553 and 575) described in 
Example 7. Internal probes were designed for molecular beacon detection of the 
wild-type S. aureus parC (SEQ ID NO. 1939), for detection of the Ser-80 to Tyr or 
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Phe amino acid substitutions in the ParC subunit encoded by 5". aureus parC (SEQ 
ID NOs. 1938 and 1955) and for detection of 5. aureus (SEQ ID NO. 2282) . 

EXAMPLE 34: 

Development of a PCR assay for the detection and identification of Klebsiella 
pneumoniae and its quinolone resistance genes zvrA and parC. The analysis of 
gyrA and parC sequences from a variety of bacterial species from the public 
databases and from the database described in Example 32 revealed conserved 
regions allowing the design of PCR primers specific to the quinolone-resistance- 
determining region (QRDR) of gyrA and parC from K. pneumoniae. PCR primer 
pair SEQ ID NOs. 1936 and 1937, or pair SEQ ID NOs. 1937 and 1942, were 
designed to amplify the gyrA sequence of K. pneumoniae, whereas PCR primer 
pair SEQ ID NOs. 1934 and 1935 was designed to amplify AT. pneumoniae parC 
sequence. An alternative pair, SEQ ID NOs. 1935 and 1936, can also amplify K. 
pneumoniae parC. The comparison of gyrA and parC sequences from K. 
pneumoniae strains with various levels of quinolone resistance allowed the 
identification of amino acid substitutions Ser-83 to Tyr or Phe and Asp-87 to Gly 
or Ala and Asp-87 to Asn in the GyrA subunit of DNA gyrase encoded by gyrA 
and amino acid changes Ser-80 to lie or Arg and Glu-84 to Gly or Lys in the ParC 
subunit of topoisomerase IV encoded by parC These amino acid substitutions in 
the GyrA and ParC subunits occur in isolates with intermediate- or high-level 
quinolone resistance. Internal probes for the specific detection of wild-type AT. 
pneumoniae gyrA (SEQ ID NO. 1943) and wild-type K. pneumoniae parC (SEQ 
ID NO. 1944) as well as internal probes for the specific detection of each of the 
gyrA (SEQ ID NOs. 1945-1949) and parC mutations identified in quinolone- 
resistant K. pneumoniae (SEQ ID NOs. 1950-1953) were designed. 

Two multiplex using the K. pneumoniae gyrA- and parC-specific primer pairs were 
used: the first multiplex contained K. pneumoniae gyrA-specific primers (SEQ ID 
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NOs. 1937 and 1942) and K pneumoniae parC-specific primers (SEQ ID NOs. 
1934 and 1935) and the second multiplex contained K. pneumoniae gyrA/parC- 
specific primer (SEQ ID NOs. 1936), K. pneumoniae £yrA-specific primer (SEQ 
ID NO. 1937) and K pneumoniae parC-specific primer (SEQ ID NO. 1935). 
Standard PCR was carried out on PTC-200 thermocyclers (MJ Research) using for 
the first multiplex 0.6, 0.6, 0.4, 0.4 fiM of each primer, respectively, and for the 
second multiplex 0.8, 0.4, 0.4 fiM of each primer, respectively. PCR amplification 
and agarose gel electrophoresis of the amplified products were performed as 
described in Example 28. The optimal cycling conditions for maximum sensitivity 
and specificity were 3 minutes at 95 °C for initial denaturation, then forty cycles of 
two steps consisting of 1 second at 95 °C and 30 seconds at 62 °C, followed by 
terminal extension at 72 °C for 2 minutes. The specificity of the two multiplex 
assays with 40-cycle PCR was verified by using 0.1 ng of purified genomic DNA 
from a panel of gram-negative bacteria. The list included: Acinetobacter 
baumannii, Citrobacter freundii, Eikenella corrodens, Enterobacter aero genes, 
Enter obacter cancer ogenes, Enterobacter cloacae, Escherichia coli (10 strains), 
Haemophilus influenzae, Klebsiella pneumoniae, Klebsiella ornitholytica, 
Klebsiella oxytoca (2 strains), Klebsiella planticola, Klebsiella terrigena, Kluyvera 
ascorbata, Kluyvera cryocrescens, Kluyvera georgiana, Neisseria gonorrhoeae, 
Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Salmonella 
choleraesuis subsp. typhimurium, Salmonella enteritidis, Serratia liquefaciens, 
Serratia marcescens and Yersinia enterocolytica. For both multiplex, strong 
amplification of both gyrA and parC was observed only for the K pneumoniae 
strain tested. The sensitivity of the two multiplex assays with 40-cycle PCR was 
verified with one quinolone-sensitive strain of K pneumoniae. The detection limit 
was around 10 copies of genomic DNA. 

The complete assay for the specific detection of K. pneumoniae and its 
susceptibility to quinolone contains the Klebsiella-specific primers (SEQ ID NOs. 
1331 and 1332) described in Example 29 and either the multiplex containing the K 
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pneumoniae gyrA- and parC-specific primers (SEQ ID NOs. 1935, 1936, 1937) or 
the multiplex containing the K. pneumoniae gyrA- and /wC-specific primers (SEQ 
ID NOs. 1934, 1937, 1939, 1942). Amplification is coupled with post-PCR 
hybridization with the internal probe specific to K. pneumoniae (SEQ ID NO. 
2167) described in Example 29 and the internal probes specific to wild-fyp* K. 
pneumoniae gyrA and parC (SEQ ID NOs. 1943, 1944) and to the K. pneumoniae 
gyrA and parC variants (SEQ ID NOs. 1945-1949 and 1950-1953). 

An assay was also developed for the detection of quinolone-resistant K. 
pneumoniae using the SmartCycler (Cepheid). Real-time detection is based on the 
use of resistant K. pneumoniae gyM -specific primers (SEQ ID NOs. 1936 and 
1937) and the K. pneumoniae-spzcific primers (SEQ ID NOs. 1331 and 1332) 
described in Example 29. Internal probes were designed for molecular beacon 
detection of the wild-type K. pneumoniae gyrA (SEQ ID NO. 2251), for detection 
of the Ser-83 to Tyr or Phe and/or Asp-87 to Gly or Asn in the GyrA subunit of 
DNA gyrase encoded by gyrA (SEQ ID NOs. 2250) and for detection of K. 
pneumoniae (SEQ ID NO. 2281). 

EXAMPLE 35: 

Development of a PCR assay for detection and identification of 5. pneumoniae and 
its quinolone resistance genes gyrA and parC . The analysis of gyrA and parC 
sequences from a variety of bacterial species revealed conserved regions allowing 
the design of PCR primers able to amplify the quinolone-resistance-determining 
region (QRDR) of gyrA and parC from all S. pneumoniae strains. PCR primer pair 
SEQ ID NOs. 2040 and 2041 was designed to amplify the QRDR of S. 
pneumoniae gyrA, whereas PCR primer pair SEQ ID NOs. 2044 and 2045 was 
designed to amplify the QRDR of S. pneumoniae parC. The comparison of gyrA 
and parC sequences from S. pneumoniae strains with various levels of quinolone 
resistance allowed the identification of amino acid substitutions Ser-81 to Phe or 
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Tyr in the GyrA subunit of DNA gyrase encoded by gyrA and amino acid changes 
Ser-79 to Phe in the ParC subunit of topoisomerase IV encoded by parC. These 
amino acid substitutions in the GyrA and ParC subunits occur in isolates with 
intermediate- or high-level quinolone resistance. Internal probes for the specific 
detection of each of the gyrA (SEQ ID NOs. 2042 and 2043) and parC (SEQ ID 

NOr -2046)-mutations - identified- in- quinolone-resistant-i Sr-pneumoniae- were 

designed. 

For all bacterial species, amplification was performed from purified genomic 
DNA. 1 /til of genomic DNA at 0.1 ng//iL was transferred directly to a 19 fi\ PCR 
mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 9.0), 
0.1% Triton X-100, 2.5 mM MgCl 2 , 0.4 /iM (each) of the above primers SEQ ID 
NOs. 2040, 2041, 2044 and 2045, 0.05 mM bovine serum albumin (BSA) and 0.5 
U Taq polymerase coupled with TaqStart™ antibody. The optimal cycling 
conditions for maximum sensitivity and specificity were 3 minutes at 95 °C for 
initial denaturation, then forty cycles of two steps consisting of 1 second at 95 °C 
and 30 seconds at 58 °C, followed by terminal extension at 72 °C for 2 minutes. In 
order to generate Digoxigenin (DIG)-labeled amplicons for capture probe 
hybridization, 0.1X PCR DIG labeling four deoxynucleoside triphosphates mix 
(Boehringer Mannheim GmbH) was used for amplification. 

The DIG-labeled amplicons were hybridized to the capture probes bound to 96- 
well plates. The plates were incubated with anti-DIG-alkaline phosphatase and the 
chemiluminescence was measured by using a luminometer (MLX, Dynex 
Technologies Inc.) after incubation with CSPD and recorded as Relative Light Unit 
(RLU). The RLU ratio of tested sample with and without captures probes was then 
calculated. A ratio > 2.0 was defined as a positive hybridization signal. All 
reactions were performed in duplicate. 
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The specificity of the multiplex assay with 40-cycle PCR was verified by using 0.1 
ng of purified genomic DNA from a panel of bacteria listed in Table 13. Strong 
amplification of both gyrA and parC was detected only for the S. pneumoniae 
strains tested. Weak amplification of both gyrA and parC genes was detected for 
Staphylococcus simulans. The detection limit tested with purified genomic DNA 
from 5 strains of S. pneumoniae was 1 to 10 genome copies. In addition, 5 
quinolone-resistant and 2 quinolone-sensitive clinical isolates of S. pneumoniae 
were tested to further validate the developed multiplex PCR coupled with capture 
probe hybridization assays. There was a perfect correlation between detection of S. 
pneumoniae gyrA and parC mutations and the susceptibility to quinolone. 
The complete assay for the specific detection of S. pneumoniae and its 
susceptibility to quinolone contains the S. pneumoniae-specific primers (SEQ ID 
NOs. 1179 and 1181) described in Exemple 20 and the multiplex containing the S. 
pneumoniae gyM-specific and /rarC-specific primer pairs (SEQ ID NOS. 2040 and 
2041 and SEQ ID NOs. 2044 and 2045). Amplification is coupled with post-PCR 
hybridization with the internal probe specific to S. pneumoniae (SEQ ID NO. 
1180) described in Example and the internal probes specific to each of the S. 
pneumoniae gyrA and parC variants (SEQ ID NOs. 2042, 2043 and 2046). 

EXAMPLE 36: 

Detection of extended-spectrum TEM-tvpe B-lactamases in Escherichia coll The 
analysis of TEM sequences which confer resistance to third-generation 
cephalosporins and to p-lactamase inhibitors allowed the identification of amino 
acid substitutions Met-69 to He or Leu or Val, Ser- 130 to Gly, Arg-164 to Ser or 
His, Gly-238 to Ser , Glu-240 to Lys and Arg-244 to Ser or Cys or Thr or His or 
Leu. PCR primers SEQ ID NOs. 1907 and 1908 were designed to amplify TEM 
sequences. Internal probes for the specific detection of wild-type TEM (SEQ ID 
NO. 2141) and for each of the amino acid substitutions (SEQ ID NOs. 1909-1926) 
identified in TEM variants were designed to detect resistance to third-generation 
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cephalosporins and to ^-lactamase inhibitors. Design and synthesis of primers and 
probes, and detection of the hybridization were performed as described in Example 
7. 

For all bacterial species, amplification was performed from purified genomic 
- DNA, One -/xl-of genomic DNAatO. 1 ng/jLtl-was~transferred directly to a 1 9 -fi\ PGR- 
mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 9.0); 
0.1% Triton X-100, 2.5 mM MgCl 2 , 0.4 /xM of the TEM-specific primers SEQ ID 
NOs. 1907 and 1908, 200 /xM (each) of the four deoxynucleoside triphosphates, 
0.05 mM bovine serum albumin (BSA) and 0.5 U Taq polymerase (Promega) 
coupled with TaqStart™ antibody. PCR amplification and agarose gel analysis of 
the amplified products were performed as described in Example 28. The optimal 
cycling conditions for maximum sensitivity and specificity were 3 minutes at 95 
°C for initial denaturation, then forty cycles of three steps consisting of 5 seconds 
at 95 °C, 30 seconds at 55 °C and 30 seconds at 72 °C, followed by terminal 
extension at 72 °C for 2 minutes. 

The specificity of the TEM-specific primers with 40-cycle PCR was verified by 
using 0.1 ng of purified genomic from the following bacteria: three third- 
generation cephalosporin-resistant Escherichia coli strains (one with TEM-10, one 
with TEM-28 and the other with TEM-49), two third-generation cephalosporin- 
sensitive Escherichia coli strain (one with TEM-1 and the other without TEM), one 
third-generation cephalosporin-resistant Klebsiella pneumoniae strain (with TEM- 
47), and one f3-lactamase-inhibitor-resistant Proteus mirabilis strain (with TEM- 
39). Amplification with the TEM-specific primers was detected only for strains 
containing TEM. 



The sensitivity of the assay with 40-cycle PCR was verified with three E. coli 
strains containing TEM-1 or TEM-10 or TEM-49, one K. pneumoniae strain 
containing TEM-47 and one P. mirabilis strain containing TEM-39. The detection 
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limit was 5 to 100 copies of genomic DNA, depending on the TEM-containing 
strains tested. 



The TEM-specific primers SEQ ID NOs. 1907 and 1908 were used in multiplex 
with the Escherichia coli/Shigella .sp.-specific primers SEQ ID NOs. 1661 and 
1665 described in Example 28 to allow the complete identification of Escherichia 
coli/Shigella sp. and the susceptibility to P-lactams. PCR amplification with 0.4 
jiM of each of the primers and agarose gel analysis of the amplified products was 
performed as described above. 

The specificity of the multiplex with 40-cycle PCR was verified by using 0.1 ng of 
purified genomic DNA from the following bacteria: three third-generation 
cephalosporin-resistant Escherichia coli strains (one with TEM- 10, one with TEM- 
28 and the other with TEM-49), two third-generation cephalosporin-sensitive 
Escherichia coli strain (one with TEM- 1 and the other without TEM), one third- 
generation cephalosporin-resistant Klebsiella pneumoniae strain (with TEM-47), 
and one p-lactamase-inhibitor-resistant Proteus mirabilis strain (with TEM-39). 
The multiplex was highly specific to Escherichia coli strains containing TEM. 

The complete assay for detection of TEM-type (3-lactamases in £. coli includes 
PCR amplification using the multiplex containing the TEM-specific primers (SEQ 
ID NOs. 1907 and 1908) and the Escherichia coli/Shigella sp.-specific primers 
(SEQ ID NOs. 1661 and 1665) coupled with post PCR-hybridization with the 
internal probes specific to wild-type TEM (SEQ ID NO. 2141) and to the TEM 
variants (SEQ ID NOs. 1909-1926). 

EXAMPLE 37: 



Detection of extended-spectrum SHV-tvpe B-lactamases in Klebsiella pneumoniae. 
The comparison of SHV sequences, which confer resistance to third-generation 

123 



WO 01/23604 PCT/CA00/01150 

cephalosporins and to ^-lactamase inhibitors, allowed the identification of amino 
acid substitutions Ser-130 to Gly, Asp-179 to Ala or Asn, Gly-238 to Ser , and 
Glu-240 to Lys. PCR primer pair SEQ ID NOs. 1884 and 1885 was designed to 
amplify SHV sequences. Internal probes for the specific identification of wild-type 
SHV (SEQ ID NO. 1896) and for each of the amino acid substitutions (SEQ ID 
NOsTl 886-1 895~and~ 1897-1 898) "identified "in SHV variants >ere"desigh~e"d~t6 
detect resistance to third-generation cephalosporins and to ^-lactamase inhibitors. 
Design and synthesis of primers and probes, and detection of the hybridization 
were performed as described in Example 7. 

For all bacterial species, amplification was performed from purified genomic 
DNA. One fi\ of of genomic DNA at 0.1ng//il was transferred directly to a 19 fil 
PCR mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 0.1% Triton X-100, 2.5 mM MgCl 2 , 0.4 /iM of the SHV-specific primers 
SEQ ID NO. 1884 and 1885, 200 (each) of the four deoxynucleoside 
triphosphates, 0.05 mM bovine serum albumin (BSA) and 0.5 U Taq polymerase 
(Promega) coupled with TaqStart™ antibody. PCR amplification and agarose gel 
analysis of the amplified products were performed as described in Example 28. 
The optimal cycling conditions for maximum sensitivity and specificity were 3 
minutes at 95 °C for initial denaturation, then forty cycles of three steps consisting 
of 5 seconds at 95 °C, 30 seconds at 55 °C and 30 seconds at 72 °C, followed by 
terminal extension at 72 °C for 2 minutes. 

The specificity of the SHV-specific primers with 40-cycle PCR was verified by 
using 0.1 ng of purified genomic from the following bacteria: two third-generation 
cephalosporin-resistant Klebsiella pneumoniae strains (one with SHV-2a and the 
other with SHV-12), one third-generation cephalosporin-sensitive Klebsiella 
pneumoniae strain (with SHV-1), two third-generation cephalosporin-resistant 
Escherichia coli strains (one with SHV-8 and the other with SHV-7), and two 
third-generation cephalosporin-sensitive Escherichia coli strains (one with SHV-1 
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and the other without any SHV). Amplification with the SHV-specific primers was 
detected only for strains containing SHV. 



The sensitivity of the assay with 40-cycle PCR was verified with four strains 
containing SHV. The detection limit was 10 to 100 copies of genomic DNA, 
— dependingon theSHV^containing strains tested: ~ 

The amplification was coupled with post-PCR hybridization with the internal 
probes specific for identification of wild-type SHV (SEQ ID NO. 1896) and for 
each of the amino acid substitutions (SEQ ID NOs. 1886-1895 and 1897-1898) 
identified in SHV variants. The specificity of the probes was verified with six 
strains containing various SHV enzymes, one Klebsiella pneumoniae strain 
containing SHV-1, one Klebsiella pneumoniae strain containing SHV-2a, one 
Klebsiella pneumoniae strain containing SHV- 12, one Escherichia coli strain 
containing SHV-1, one Escherichia coli strain containing SHV-7 and one 
Escherichia coli strain containing SHV-8. The probes correctly detected each of 
the SHV genes and their specific mutations. There was a perfect correlation 
between the SHV genotype of the strains and the susceptibility to (3-lactam 
antibiotics. 

The SHV-specific primers SEQ ID NOs. 1884 and 1885 were used in multiplex 
with the K. pneumoniae-sptcific primers SEQ ID NOs. 1331 and 1332 described 
in Example 29 to allow the complete identification of K. pneumoniae and the 
susceptibility to (3-lactams. PCR amplification with 0.4 /iM of each of the primers 
and agarose gel analysis of the amplified products were performed as described 
above. 

The specificity of the multiplex with 40-cycle PCR was verified by using 0.1 ng of 
purified genomic DNA from the following bacteria: three K. pneumoniae strains 
containing SHV-1, one Klebsiella pneumoniae strain containing SHV-2a, one 
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Klebsiella pneumoniae strain containing SHV-12, one K. rhinoscleromatis strain 
containing SHV-1, one Escherichia coli strain without SHV. The multiplex was 
highly specific to Klebsiella pneumoniae strain containing SHV. 

EXAMPLE 38: 



Development of a PCR assay for the detection and identification of Neisseria 
gonorrhoeae and its associated tetracycline resistance gene tetM . The analysis of 
publicly available tetM sequences revealed conserved regions allowing the design 
of PCR primers specific to tetM sequences. The PCR primer pair SEQ ID NOs. 
1588 and 1589 was used in multiplex with the Neisseria gonorrhoeae-spzcific 
primers SEQ ID NOs. 551 and 552 described in Example 31. Sequence alignment 
analysis of tetM sequences revealed regions suitable for the design of an internal 
probe specific to tetM (SEQ ID NO. 2254). PCR amplification was carried out on 
PTC-200 thermocyclers (MJ Research) using 0.4 /iM of each primer pair as 
described in Example 28. The optimal cycling conditions for maximum sensitivity 
and specificity were as follow: three minutes at 95 °C for initial denaturation, then 
forty cycles of two steps consisting of 1 second at 95 °C and 30 seconds at 60°C, 
followed by terminal extension at 72 °C for 2 minutes. 

The specificity of the multiplex PCR assay with 40-cycle PCR was verified by 
using 0.1 ng of purified genomic DNA from the following bacteria: two 
tetracycline-resistant Escherichia coli strains (one containing the tetracycline- 
resistant gene tetB and the other containing the tetracycline-resistant gene tetQ, 
one tetracycline-resistant Pseudomonas aeruginosa strain (containing the 
tetracycline-resistant gene tetA), nine tetracycline-resistant Neisseria gonorrhoeae 
strains, two tetracycline-sensitive Neisseria meningitidis strains, one tetracycline- 
sensitive Neisseria polysaccharea strain, one tetracycline-sensitive Neisseria sicca 
strain and one tetracycline-sensitive Neisseria subflava strain. Amplification with 
both the tefAf-specific and Neisseria gonorrhoeae-sptcific primers was detected 
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only for N. gonorrhoeae strains containing tetM. There was a weak amplification 
signal using Neisseria gonorrhoeae-specific primers for the following species: 
Neisseria sicca, Neisseria polysaccharea and Neisseria meningitidis. There was a 
perfect correlation between the tetM genotype and the tetracycline susceptibility 
pattern of the Neisseria gonorrhoeae strains tested. The internal probe specific to 
N. gonorrhoeae SEQ ID NO. 2166 described in Example 31 can discriminate 
Neisseria gonorrhoeae from the other Neisseria sp. 

The sensitivity of the assay with 40-cycle PCR was verified with two tetracycline 
resistant strains of N gonorrhoeae. The detection limit was 5 copies of genomic 
DNA for both strains. 

EXAMPLE 39: 

Development of a PCR assay for the detection and identification of Shigella sp. 
and their associated trimethoprim resistance gene dhfrla. The analysis of publicly 
available dhfrla and other dhfr sequences revealed regions allowing the design of 
PCR primers specific to dhfrla sequences. The PCR primer pair (SEQ ID NOs. 
1459 and 1460) was used in multiplex with the Escherichia coli/Shigella sp.- 
specific primers SEQ ID NOs. 1661 and 1665 described in Example 28. Sequence 
alignment analysis of dhfrla sequences revealed regions suitable for the design of 
an internal probe specific to dhfrla (SEQ ID NO. 2253). PCR amplification and 
agarose gel analysis of the amplified products were performed as described in 
Example 28 with an annealing temperature of 60 °C. The specificity of the 
multiplex assay with 40-cycle PCR was verified by using 0.1 ng of purified 
genomic DNA from a panel of bacteria. The list included the following 
trimethoprim-sensitive strains, Salmonella typhimyurium, Salmonella typhi, 
Salmonella enteritidis, Tatumella ptyseos, Klebsiella pneumoniae, Enterobacter 
aerogenes, Citrobacter farmeri, Campylobacter jejuni, Serratia marcescens, 
Shigella dysenteriae, Shigella flexneri, Shigella sonnei, six trimethoprim-resistant 
Escherichia coli strains (containing dhfrla or dhfrV or dhfrVII or dhfrXII or 



127 



WO 01/23604 PCT/CAOO/01150 

dhfrXIIl or dhfrXV), four trimethoprim-resistant strains containing dhfrla (Shigella 
sonnei, Shigella flexneri, Shigella dysenteriae and Escherichia coli). There was a 
perfect correlation between the dhfrla genotype and the trimethoprim susceptibility 
pattern of the Escherichia coli and Shigella sp. strains tested. The dhfrla primers 
were specific to the dhfrla gene and did not amplify any of the other trimethoprim- 
resistant dhff "gents "tested. The sensitivity of the "multiplex" assay witlT40 r cycle 
PCR was verified with three strains of trimethoprim-resistant strains of Shigella sp. 
The detection limit was 5 to 10 genome copies of DNA, depending on the Shigella 
sp. strains tested. 

EXAMPLE 40: 



Development of a PCR assay for the detection and identification of Acinetobacter 
baumannii and its associated aminoglycoside resistance gene aph(3 $ )-VIa. The 
comparison of publicly available aph( 3')- Via sequence revealed regions allowing 
the design of PCR primers specific to aph(3')-VIa. The PCR primer pair (SEQ ID 
NOs. 1404 and 1405) was used in multiplex with the Acinetobacter baumannii- 
specific primers SEQ ID NOs. 1692 and 1693 described in Example 30. Analysis 
of the aph(3 f )-VIa sequence revealed region suitable for the design of an internal 
probe specific to aph{3 y )-Vla (SEQ ID NO. 2252). PCR amplification and agarose 
gel analysis of the amplified products were performed as described in Example 28. 
The specificity of the multiplex assay with 40-cycle PCR was verified by using 0.1 
ng of purified genomic DNA from a panel of bacteria including: two 
aminoglycoside-resistant A. baumanni strains (containing aph(3')-VIa), one 
aminoglycoside-sensitive A. baumani strain, one of each of the following 
aminoglycoside-resistant bacteria, one Serratia marcescens strain containing the 
aminoglycoside-resistant gene aacCl, one Serratia marcescens strain containing 
the aminoglycoside-resistant gene aacC4, one Enterobacter cloacae strain 
containing the aminoglycoside-resistant gene aacC2, one Enterococcus faecalis 
containing the aminoglycoside-resistant gene aacA-aphD, one Pseudomonas 
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aeruginosa strain containing the aminoglycoside-resistant gene aac6Ila and one of 
each of the following aminoglycoside-sensitive bacterial species, Acinetobacter 
anitratus, Acinetobacter Iwoffi, Psychobbacter phenylpyruvian, Neisseria 
gonorrhoeae, Haemophilus haemolyticus, Haemophilus influenzae, Yersinia 
enterolitica, Proteus vulgaris, Eikenella corrodens, Escherichia colL There was a 

perfect correlation between the -aph(3 K )=VIa genotype" ^d the 

susuceptibility pattern of the A baumannii strains tested. The aph(3')-VIa-specific 
primers were specific to the aph(3')-VIa gene and did not amplify any of the other 
aminoglycoside-resistant genes tested. The sensitivity of the multiplex assay with 
40-cycle PCR was verified with two strains of aminoglycoside-resistant strains of 
A. baumannii. The detection limit was 5 genome copies of DNA for both A. 
baumannii strains tested. 

EXAMPLE 41: 

Specific identification of Bacteroides fragilis using atpD (V-tvpe) sequences. The 
comparison of atpD (V-type) sequences from a variety of bacterial species allowed 
the selection of PCR primers for Bacteroides fragilis. The strategy used to design 
the PCR primers was based on the analysis of a multiple sequence alignement of 
various atpD sequences from B. fragilis, as well as arpZ) sequences from the 
related species B. dispar, bacterial genera and archaea, especially representatives 
with phylogenetically related atpD sequences. A careful analysis of this alignment 
allowed the selection of oligonucleotide sequences which are conserved within the 
target species but which discriminate sequences from other species, especially 
from closely related species B. dispar, thereby permitting the species-specific and 
ubiquitous detection and identification of the target bacterial species. 

The chosen primer pair, SEQ ID NOs. 2134-2135, produces an amplification 
product of 231 bp. Standard PCR was carried out on PTC-200 thermocyclers (MJ 
Research Inc.) using 0.4jxM of each primers pair as described in Example 28. The 
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optimal cycling conditions for maximum sensitivity and specificity were as 
follows: three minutes at 95°C for initial denaturation, then forty cycles of two 
steps consisting of 1 second at 95°C and 30 seconds at 60°C, followed by terminal 
extension at 72°C for 2 minutes. 



The format of this assay is not limited to the one described above. A person skilled 
in the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 2136 for the 
detection of the B. fragilis amplicon. 

EXAMPLE 42: 

Evidence for horizontal gene transfer in the evolution of the elongation factor Tu in 
Enterococci. 

ABSTRACT 

The elongation factor Tu, encoded by tuf genes, is a GTP binding protein that 
plays a central role in protein synthesis. One to three tuf genes per genome are 
present depending on the bacterial species. Most low G+C gram-positive bacteria 
carry only one tuf gene. We have designed degenerate PCR primers derived from 
consensus sequences of the tuf gene to amplify partial tuf sequences from 17 
enterococcal species and other phylogenetically related species. The amplified 
DNA fragments were sequenced either by direct sequencing or by sequencing 
cloned inserts containing putative amplicons. Two different tuf genes (tufA and 
tufB) were found in 11 enterococcal species, including Enterococcus avium, E. 
casseliflavus, E. dispar, E. durans, E. faecium, E. gallinarum, E. hirae, E. 
malodoratus, E. mundtii, E. pseudoavium, and E. raffinosus. For the other six 
enterococcal species (E. cecorum, E. columbae, E. faecalis, E. sulfureus, E. 
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saccharolyticus, and E. solitarius), only the tuf A gene was present. Based on 16S 
rRNA gene sequence analysis, the 1 1 species having two tuf genes all share a 
common ancestor, while the six species having only one copy diverged from the 
enterococcal lineage before that common ancestor. The presence of one or two 
copies of the tuf gene in enterococci was confirmed by Southern hybridization. 
"Pfiylbgenetic" analysis of ~fu/"sequences~dem^^ 

gene branches with the Bacillus, Listeria and Staphylococcus genera, while the 
enterococcal tuJB gene clusters with the genera Streptococcus and Lactococcus. 
Primary structure analysis showed that four amino acid residues within the 
sequenced regions are conserved and unique to the enterococcal tufB genes and the 
tuf genes of streptococci and L. lactis. The data suggest that an ancestral 
streptococcus or a streptococcus-related species may have horizontally transferred 
a tuf gene to the common ancestor of the 1 1 enterococcal species which now carry 
two rw/genes. 

INTRODUCTION 

The elongation factor Tu (EF-Tu) is a GTP binding protein playing a central role 
in protein synthesis. It mediates the recognition and transport of aminoacyl-tRNAs 
and their positioning to the A-site of the ribosome. The highly conserved function 
and ubiquitous distribution render the elongation factor a valuable phylogenetic 
marker among eubacteria and even throughout the archaebacterial and eukaryotic 
kingdoms. The tuf genes encoding elongation factor Tu are present in various copy 
numbers per bacterial genome. Most gram-negative bacteria contain two tuf genes. 
As found in Escherichia coli f the two genes, while being almost identical in 
sequence, are located in different parts of the bacterial chromosome. However, 
recently completed microbial genomes revealed that only one tuf gene is found in 
Helicobacter pylori as well as in some obligate parasitic bacteria, such as Borrelia 
burgdorferi, Rickettsia prowazekii, and Treponema pallidum, and in some 
cyanobacteria. In most gram-positive bacteria studied so far, only one tuf gene was 
found. However, Southern hybridization showed that there are two tuf genes in 
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some Clostridia as well as in Streptomyces coelicolor and S. lividans. Up to three 
tuf like genes have been identified in S. ramocissimus. 

Although massive prokaryotic gene transfer is suggested to be one of the factors 
responsible for the evolution of bacterial genomes, the genes encoding components 
of the translation machinery are thought to be highly conserved and difficult to be 

transferred-horizontally-due to-the -complexity ^ of-their-interactions^However r a few 

recent studies demonstrated evidence that horizontal gene transfer has also 
occurred in the evolution of some genes coding for the translation apparatus, 
namely, 16S rRNA and some aminoacyl-tRNA synthetases. No further data 
suggest that such a mechanism is involved in the evolution of the elongation 
factors. Previous studies concluded that the two copies of tuf genes in the genomes 
of some bacteria resulted from an ancient event of gene duplication. Moreover, a 
study of the tuf gene in R. prowazekii suggested that intrachromosomal 
recombination has taken place in the evolution of the genome of this organism. 

To date, little is known about the tuf genes of enterococcal species. In this study, 
we analyzed partial sequences of tuf genes in 17 enterococcal species, namely, E. 
avium, E. casseliflavus, E. cecorum, E. columbae, E. dispar, E. durans, £. faecalis, 
E. faecium, E. gallinarum, E. hirae, E. malodoratus, E. mundtii, E. pseudoavium, 
E. rajfinosus, E. saccharolyticus, E. solitarius, and E. sulfureus. We report here the 
presence of two divergent copies of tuf genes in 1 1 of these enterococcal species. 
The 6 other species carried a single tuf gene. The evolutionary implications are 
discussed. 

MATERIALS AND METHODS 

Bacterial strains. Seventeen enterococcal strains and other gram-positive 
bacterial strains obtained from the American Type Culture Collection (ATCC, 
Manassas, Va.) were used in this study (Table 16), All strains were grown on sheep 
blood agar or in brain-heart infusion broth prior to DNA isolation. 
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DNA isolation. Bacterial DNAs were prepared using the G NOME DNA 
extraction kit (BiolOl, Vista, Calif.) as previously described. 

Sequencing of putative tuf genes. In order to obtain the fa/gene sequences of 
enterococci and other gram-positive bacteria, two sequencing approaches were 
used; 1) sequencing of cloned PCR products and 2) direct sequencing of PCR 
-products- A pair, of degenerate primers (SEQ ID NOs. -664- and -697) -were used-to - 
amplify an 886-bp portion of the tuf genes from enterococcal species and other 
gram-positive bacteria as previously described. For E. avium, E. casseliflavus, E. 
dispar, E. durans, E. faecium, E. gallinarum, E. hirae, E. mundtii, E. pseudoavium, 
and E. raffinosus, the amplicons were cloned using the Original TA cloning kit 
(Invitrogen, Carlsbad, Calif.) as previously described. Five clones for each species 
were selected for sequencing. For E. cecorum, E. faecalis, E. saccharolyticus, and 
E. solitarius as well as the other gram-positive bacteria, the sequences of the 886- 
bp amplicons were obtained by direct sequencing. Based on the results obtained 
from the earlier rounds of sequencing, two pairs of primers were designed for 
obtaining the partial tuf sequences from the other enterococcal species by direct 
sequencing. One pair of primers (SEQ ID NOs. 543 and 660) were used to amplify 
the enterococcal tuf gene fragments from £. columbae, E. malodoratus, and £. 
sulfur eus. Another pair of primers (SEQ ID NOs. 664 and 661) were used to 
amplify the second tuf gene fragments from E. avium, E. malodoratus, and E. 
pseudoavium. 

Prior to direct sequencing, PCR products were electrophoresed on 1% agarose 
gel at 120V for 2 hours. The gel was then stained with 0.02% methylene blue for 
30 minutes and washed twice with autoclaved distilled water for 15 minutes. The 
gel slices containing PCR products of the expected sizes were cut out and purified 
with the QIAquick gel extraction kit (QIAgen Inc., Mississauga, Ontario, Canada) 
according to the manufacturer's instructions. PCR mixtures for sequencing were 
prepared as described previously. DNA sequencing was carried out with the Big 
Dye™ Terminator Ready Reaction cycle sequencing kit using a 377 DNA 
sequencer (PE Applied Biosystems, Foster City, Calif.). Both strands of the 
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amplified DNA were sequenced. The sequence data were verified using the 
Sequencer 3.0 software (Gene Codes Corp., Ann Arbor, Mich.). 

Sequence analysis and phylogenetic study. Nucleotide sequences of the tuf 
genes and their respective flanking regions for E. faecalis, Staphylococcus aureus, 
and Streptococcus pneumoniae, were retrieved from the TIGR microbial genome 
"database ~and~S." pyogenes "from - the University of Oldahoma "database. DNA" 
sequences and deduced protein sequences obtained in this study were compared 
with those in all publicly available databases using the BLAST and FASTA 
programs. Unless specified, sequence analysis was conducted with the programs 
from GCG package (Version 10; Genetics Computer Group, Madison, Wise). 
Sequence alignment of the tuf genes from 74 species representing all three 
kingdoms of life (Tables 16 and 17) were carried out by use of Pileup and 
corrected upon visual analysis. The N- and C-termini extremities of the sequences 
were trimmed to yield a common block of 201 amino acids sequences and 
equivocal residues were removed. Phylogenetic analysis was performed with the 
aid of PAUP 4.0b4 written by Dr. David L. Swofford (Sinauer Associates, Inc., 
Publishers, Sunderland, Mass.). The distance matrix and maximum parsimony 
were used to generate phylogenetic trees and bootstrap resampling procedures 
were performed using 500 and 100 replications in each analysis, respectively. 

Protein structure analysis. The crystal structures of (i)Thermus aquaticus EF- 
Tu in complex with Phe-tRNA Phe and a GTP analog and (ii) E. coli EF-Tu in 
complex with GDP served as templates for constructing the equivalent models for 
enterococcal EF-Tu. Homology modeling of protein structure was performed using 
the SWISS-MODEL server and inspected using the SWISS-PDB viewer version 
3.1. 

Southern hybridization. In a previous study, we amplified and cloned an 803- 
bp PCR product of the tuf gene fragment from E. faecium. Two divergent 
sequences of the inserts, which we assumed to be tufA and tufB genes, were 
obtained. The recombinant plasmid carrying either tufA or tufB sequence was used 
to generate two probes labeled with Digoxigenin (DIG)-ll-dUTP by PCR 
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incorporation following the instructions of the manufacturer (Boehringer 
Mannheim, Laval, Quebec, Canada). Enterococcal genomic DNA samples (1-2 \ig) 
were digested to completion with restriction endonucleases BgUl and Xbal as 
recommended by the supplier (Amersham Pharmacia Biotech, Mississauga, 
Ontario, Canada), These restriction enzymes were chosen because no restriction 
sites were observed within the amplified tuf gene fragments of most enterococci. 
Southern blotting and filter hybridization were performed using positively charged 
nylon membranes (Boehringer Mannheim) and QuikHyb hybridization solution 
(Stratagene Cloning Systems, La Jolla, Calif.) according to the manufacturers' 
instructions with modifications. Twenty \i\ of each digestion were electrophoresed 
for 2 h at 120V on a 0.8% agarose gel. The DNA fragments were denatured with 
0.5 M NaOH and transferred by Southern blotting onto a positively charged nylon 
membrane (Boehringer Mannheim). The filters were pre-hybridized for 15 min and 
then hybridized for 2 h in the QuikHyb solution at 68°C with either DIG-labeled 
probe. Posthybridization washings were performed twice with 0.5x SSC, 1% SDS 
at room temperature for 15 min and twice in the same solution at 60°C for 15 min. 
Detection of bound probes was achieved using disodium 3- (4-methoxyspiro (1,2- 
dioxetane-3,2- (5'-chloro) tricyclo(3,3. 1 . 1 3 ' 7 ) decan)-4-yl) phenyl phosphate 
(CSPD) (Boehringer Mannheim) as specified by the manufacturer. 

GenBank submission. The GenBank accession numbers for partial tuf gene 
sequences generated in this study are given in Table 16. 

RESULTS 

Sequencing and nucleotide sequence analysis. In this study, all gram-positive 
bacteria other than enterococci yielded a single tuf sequence of 886 bp using 
primers SEQ ID NOs. 664 and 697 (Table 16). Each of four enterococcal species 
including E. cecorum, E.faecalis, E. saccharolyticus, and £. solitarius also yielded 
one 886-bp tuf sequence. On the other hand, for E. avium, E. casseliflavus, E. 
dispar, E. durans, £. faecium, E. gallinarum, £. hirae, E. mundtii, E. pseudoavium, 
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and E. raffinosus, direct sequencing of the 886-bp fragments revealed overlapping 
peaks according to their sequence chromatograms, suggesting the presence of 
additional copies of the tuf gene. Therefore, the tuf gene fragments of these 10 
species were cloned first and then sequenced. Sequencing data revealed that two 
different types of tuf sequences {tuf A and tufB) are found in eight of these species 
--including- E. casseliflavusrE- dispart Er durans. E; faeciumj Ergallinarum, ~E~ 
hirae, E. mundtii, and E. raffinosus. Five clones from E. avium and E. 
pseudoavium yielded only a single tuf sequence. These new sequence data allowed 
the design of new primers specific for the enterococcal tuf A or tufB sequences. 
Primers SEQ ID NOs. 543 and 660 were designed to amplify only enterococcal 
tuf A sequences and a 694-bp fragment was amplified from all 17 enterococcal 
species. The 694-bp sequences of tufA genes from E. columbae, E. malodorants, 
and E. sulfureus were obtained by direct sequencing using these primers. Primers 
SEQ ID NOs. 664 and 661 were designed for the amplification of 730-bp portion 
of tufB genes and yielded the expected fragments from 1 1 enterococcal species, 
including E. malodoratus and the 10 enterococcal species in which heterogeneous 
tuf sequences were initially found. The sequences of the tufB fragments for E. 
avium, E. malodoratus and E. pseudoavium were determined by direct sequencing 
using the primers SEQ ID NOs. 664 and 661. Overall, tufA gene fragments were 
obtained from all 17 enterococcal species but tufB gene fragments were obtained 
with only 1 1 enterococcal species (Table 16). 

The identities between tufA and tufB for each enterococcal species were 68-79% 
at the nucleotide level and 81 to 89% at the amino acid level. The tufA gene is 
highly conserved among all enterococcal species with identities varying from 87% 
to 99% for DNA and 93% to 99% for amino acid sequences, while the identities 
among tufB genes of enterococci varies from 77% to 92% for DNA and 91% to 
99% for amino acid sequences, indicating their different origins and evolution 
(Table 18). Since E. solitarius has been transferred to the genus Tetragenococcus, 
which is also a low G+C gram-positive bacterium, our sequence comparison did 
not include this species as an enterococcus. G+C content of enterococcal tufA 
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sequences ranged from 40.8% to 43.1%, while that of enterococcal tufB sequences 
varied from 37*8% to 46.3%. Based on amino acid sequence comparison, the 
enterococcal tuf A gene products share higher identities with those of Abiotrophia 
adiacens y Bacillus subtilis, Listeria monocytogenes, 5. aureus, and 5. epidermidis. 
On the other hand, the enterococcal tufB gene products share higher percentages of 
"amino" acid '"identity" with" the 7wjf "genes "of ~S. pneumoniae, S. pyogenes and 
Lactococcus lactis (Table 18). 

In order to elucidate whether the two enterococcal tuf sequences encode genuine 
EF-Tu, the deduced amino acid sequences of both genes were aligned with other 
EF-Tu sequences available in SWISSPROT (Release 38). Sequence alignment 
demonstrated that both gene products are highly conserved and carry all conserved 
residues present in this portion of prokaryotic EF-Tu (Figure 4). Therefore, it 
appears that both gene products could fulfill the function of EF-Tu. The partial tuf 
gene sequences encode the portion of EF-Tu from residues 117 to 317, numbered 
as in E. coll This portion makes up of the last four a-helices and two (3-strands of 
domain I, the entire domain II and the N-terminal part of domain III on the basis of 
the determined structures of E. coli EF-Tu. 

Based on the deduced amino acid sequences, the enterococcal tufB genes have 
unique conserved residues Lysl29, Leul40, Ser230, and Asp234 (E. coli 
numbering) that are also conserved in streptococci and L. lactis, but not in the 
other bacteria (Figure 4). All these residues are located in loops except for Ser230. 
In other bacteria the residue Ser230 is substituted for highly conserved Thr, which 
is the 5 th residue of the third |3-strand of domain II. This region is partially 
responsible for the interaction between the EF-Tu and aminoacyl-tRNA by the 
formation of a deep pocket for any of the 20 naturally occurring amino acids. 
According to our three-dimensional model (data not illustrated), the substitution 
Thr23(MSer in domain II of EF-Tu may have little impact on the capability of the 
pocket to accommodate any amino acid. However, the high conservation of Thr230 
comparing to the unique Ser substitution found only in streptococci and 11 
enterococci could suggest a subtle functional role for this residue. 

137 



WO 01/23604 PCT/CA00/01150 

The tuf gene sequences obtained for E. faecalis, £ aureus, S. pneumoniae and S. 
pyogenes were compared with their respective incomplete genome sequence. 
Contigs with more than 99% identity were identified. Analysis of the E. faecalis 
genome data revealed that the single E. faecalis tuf gene is located within an str 
operon where fw/is preceded by fits that encodes the elongation factor G. This str 
operon is present in S. aureus and B. subtilis but not in the two streptococcal 
genomes examined. The 700-bp or so sequence upstream the 5. pneumoniae tuf 
gene has no homology with any known gene sequences. In S. pyogenes, the gene 
upstream of tuf is similar to a cell division gent, fisW, suggesting that the tuf genes 
in streptococci are not arranged in a str operon. 

Phylogenetic analysis. Phylogenetic analysis of the tuf amino acid sequences 
with representatives of eubacteria, archeabacteria, and eukaryotes using neighbor- 
joining and maximum parsimony methods showed three major clusters 
representing the three kingdoms of life. Both methods gave similar topologies 
consistent with the rRNA gene data (data not shown). Within the bacterial clade, 
the tree is polyphyletic but tufA genes from all enterococcal species always 
clustered with those from other low G+C gram-positive bacteria (except for 
streptococci and lactococci), while the tufB genes of the 1 1 enterococcal species 
form a distinct cluster with streptococci and L lactis (Figure 5). Duplicated genes 
from the same organism do not cluster together, thereby not suggesting evolution 
by recent gene duplication. 

Southern hybridization. Southern hybridization of BgllUXbal digested 
genomic DNA from 12 enterococcal species tested with the tufA probe (DIG- 
labeled tufA fragment from E. faecium) yielded two bands of different sizes in 9 
species, which also carried two divergent tuf sequences according to their 
sequencing data. For E. faecalis and E. solitarius, a single band was observed 
indicating that one tuf gene is present (Figure 6). A single band was also found 
when digested genomic DNA from S. aureus, S. pneumoniae, and S. pyogenes 
were hybridized with the tufA probe (data not shown). For E. faecium, the presence 
of three bands can be explained by the existence of a Xbal restriction site in the 
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middle of the tufA sequence, which was confirmed by sequencing data. 
Hybridization with the tufB probe (DIG-labeled tufB fragment of E. faecium) 
showed a banding profile similar to the one obtained with the tufA probe (data not 
shown). 

DISCUSSION 



In this study, we have shown that two divergent copies of genes encoding the 
elongation factor Tu are present in some enterococcal species. Sequence data 
revealed that both genes are highly conserved at the amino acid level. One copy 
(tufA) is present in all enterococcal species, while the other (tufB) is present only in 
11 of the 17 enterococcal species studied. Based on 16S rRNA sequence analysis, 
these 1 1 species are members of three different enterococcal subgroups (E. avium, 
E. faecium, and E. gallinarum species groups) and a distinct species (£. dispar). 
Moreover, 16S rDNA phylogeny suggests that these 11 species possessing 2 tuf 
genes all share a common ancestor before they further evolved to become the 
modem species. Since the six other species having only one copy diverged from 
the enterococcal lineage before that common ancestor, it appears that the presence 
of one tuf gent in these six species is not attributable to gene loss. 

Two clusters of low G+C gram-positive bacteria were observed in the 
phylogenetic tree of the tuf genes: one contains a majority of low G+C gram- 
positive bacteria and the other contains lactococci and streptococci. This is similar 
to the finding on the basis of phylogenetic analysis of the 16S rRNA gene and the 
hrcA gene coding for a unique heat-shock regulatory protein. The enterococcal 
tufA genes branched with most of the low G+C gram-positive bacteria, suggesting 
that they originated from a common ancestor. On the other hand, the enterococcal 
tufB genes branched with the genera Streptococcus and Lactococcus that form a 
distinct lineage separated from other low G+C gram-positive bacteria (Figure 5). 
The finding that these EF-Tu proteins share some conserved amino acid residues 
unique to this branch also supports the idea that they may share a common 
ancestor. Although these conserved residues might result from convergent 
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evolution upon a specialized function, such convergence at the sequence level, 
even for a few residues, seems to be rare, making it an unlikely event. Moreover, 
no currently known selective pressure, if any, would account for keeping one 
versus two tuf genes in bacteria. The G+C contents of enterococcal tufA and tufB 
sequences are similar, indicating that they both originated from low G+C gram- 
positive-bacteriann accordance with the phylogenetic analysis: ~ 

The tuf genes are present in various copy numbers in different bacteria. 
Furthermore, the two tuf genes are normally associated with characteristic flanking 
genes. The two tuf gene copies commonly encountered within gram-negative 
bacteria are part of the bacterial str operon and tRNA-rw/B operon, respectively. 
The arrangement of tufA in the str operon was also found in a variety of bacteria, 
including Thermotoga maritima, the most ancient bacteria sequenced so far, 
Aquifex aeolicus, cyanobacteria, Bacillus sp., Micrococcus luteus, Mycobacterium 
tuberculosis, and Streptomyces sp. Furthermore, the tRNA-rw/B operon has also 
been identified in Aquifex aeolicus, Thermus thermophilus, and Chlamydia 
trachomatis. The two widespread tuf gene arrangements argue in favor of their 
ancient origins. It is noteworthy that most obligate intracellular parasites, such as 
Mycoplasma sp., R. prowazekii, B. burgdorferi, and T. pallidum, contain only one 
tuf gene. Their flanking sequences are distinct from the two conserved patterns as a 
result of selection for effective propagation by an extensive reduction in genome 
size by intragenomic recombination and rearrangement. 

Most gram-positive bacteria with low G+C content sequenced to date contain 
only a single copy of the fw/gene as a part of the str operon. This is the case for B. 
subtilis, S. aureus and E. faecalis. PCR amplification using a primer targeting a 
conserved region of the Jus gene and the /w/ft-specific primer SEQ ID NO. 660, but 
not the tufB- specific primer SEQ ID NO. 661, yielded the expected amplicons for 
all 17 enterococcal species tested, indicating the presence of the fus-tuf 
organization in all enterococci (data not shown). However, in the genomes of S. 
pneumoniae and S. pyogenes, the sequences flanking the tuf genes varies although 
the tuf gene itself remains highly conserved. The enterococcal tufB genes are 
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clustered with streptococci, but at present we do not have enough data to identify 
the genes flanking the enterococcal tufB genes. Furthermore, the functional role of 
the enterococcal tufB genes remains unknown. One can only postulate that the two 
divergent gene copies are expressed under different conditions. 

The amino acid sequence identities between the enterococcal tufA and tufB 

genes-are lower than either i) those between the enterococcal tufA and the tuf genes 

from other low G+C gram-positive bacteria (streptococci and lactococci excluded) 
or ii) those between the enterococcal tufB and streptococcal and lactococcal tuf 
genes. These findings suggest that the enterococcal tufA genes share a common 
ancestor with other low G+C gram-positive bacteria via the simple scheme of 
vertical evolution, while the enterococcal tufB genes are more closely related to 
those of streptococci and lactococci. The facts that some enterococci possess an 
additional tuf gene and that the single streptococcal ta/gene is not clustered with 
other low G+C gram-positive bacteria cannot be explained by the mechanism of 
gene duplication or intrachromosomal recombination. According to sequence and 
phylogenetic analysis, we propose that the presence of the additional copy of the 
tuf genes in 11 enterococcal species is due to horizontal gene transfer. The 
common ancestor of the 1 1 enterococcal species now carrying tufB genes acquired 
a tuf gene from an ancestral streptococcus or a streptococcus-related species during 
enterococcal evolution through gene transfer before the diversification of modern 
enterococci. Further study of the flanking regions of the gene may provide more 
clues for the origin and function of this gene in enterococci. 

Recent studies of genes and genomes have demonstrated that considerable 
horizontal transfer occurred in the evolution of aminoacyl-tRNA synthetases in all 
three kingdoms of life. The heterogeneity of 16S rRNA is also attributable to 
horizontal gene transfer in some bacteria, such as Streptomyces, Thermomonospora 
chromogena and Mycobacterium celatum. In this study, we provide the first 
example in support of a likely horizontal transfer of the tuf gene encoding the 
elongation factor Tu. This may be an exception since stringent functional 
constraints do not allow for frequent horizontal transfer of the tuf gene as with 
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other genes, However, enterococcal tuf genes should not be the only such 
exception as we have noticed that the phylogeny of Streptomyces tuf genes is 
equally or more complex than that of enterococci. For example, the three rw/-like 
genes in a high G+C gram-positive bacterium, S. ramocissimus, branched with the 
tuf genes of phylogenetically divergent groups of bacteria (Figure 5). Another 
example may_be the /u/genes in Clostridia,, whichrepresent a phylogenetically very- - 
broad range of organisms and form a plethora of lines and groups of various 
complexities and depths. Four species belonging to three different clusters within 
the genus Clostridium have been shown by Southern hybridization to carry two 
copies of the tuf gene. Further sequence data and phylogenetic analysis may help 
interpreting the evolution of the elongation factor Tu in these gram-positive 
bacteria. Since the tuf genes and 16S rRNA genes are often used for phylogenetic 
study, the existence of duplicate genes originating from horizontal gene transfer 
may alter the phylogeny of microorganisms when the laterally acquired copy of the 
gene is used for such analysis. Hence, caution should be taken in interpreting 
phylogenetic data. In addition, the two tuf genes in enterococci have evolved 
separately and are distantly related to each other phylogenetically. The 
enterococcal tufB genes are less conserved and unique to the 11 enterococcal 
species only. We previously demonstrated that the enterococcal tuf A genes could 
serve as a target to develop a DNA-based assay for identification of enterococci. 
The enterococcal tufB genes would also be useful in identification of these 11 
enterococcal species. 

EXAMPLE 43: 

Elongation Factor Tu (tuf) and the F-ATPase beta-subunit (atpD) as phylogenetic 
tools for species of the family Enterobacteriaceae. 

SUMMARY 
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The phylogeny of enterobacterial species commonly found in clinical samples was 
analyzed by comparing partial sequences of their elongation factor Tu (tuf) genes 
and their F-ATPase beta-subunit (atpD) genes. A 884-bp fragment for tuf and a 
884- or 871-bp fragment for atpD were sequenced for 88 strains of 72 species from 
25 enterobacterial genera. The atpD sequence analysis revealed a specific indel to 

p antoea Tatumella species* showing for the first time a~tight~ phylogenetic 

affiliation between these two genera. Comprehensive tuf and atpD phylogenetic 
trees were constructed and are in agreement with each other. Monophyletic genera 
are Yersinia, Pantoea, Edwardsiella, Cedecea, Salmonella, Serratia, Proteus, and 
Providencia. Analogous trees were obtained based on available 16S rDNA 
sequences from databases, tuf and atpD phylogenies are in agreement with the 16S 
rDNA analysis despite the smaller resolution power for the latter. In fact, distance 
comparisons revealed that tuf and atpD genes provide a better resolution for pairs 
of species belonging to the family Enterobacteriaceae. However, 16S rDNA 
distances are better resolved for pairs of species belonging to different families. In 
conclusion, tuf and atpD conserved genes are sufficiently divergent to discriminate 
different species inside the family Enterobacteriaceae and offer potential for the 
development of diagnostic tests based on DNA to identify enterobacterial species. 

INTRODUCTION 

Members of the family Enterobacteriaceae are facultatively anaerobic gram- 
negative rods, catalase-positive and oxydase-positive (Brenner, 1984). They are 
found in soil, water, plants, and in animals from insects to man. Many 
enterobacteria are opportunistic pathogens. In fact, members of this family are 
responsible for about 50 % of nosocomial infections in the United States (Brenner, 
1984). Therefore, this family is of considerable clinical importance. 

Major classification studies on the family Enterobacteriaceae are based on 
phenotypic traits (Brenner et al, 1999; Brenner et al, 1980; Dickey & Zumoff, 
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1988; Farmer III et al, 1980; Farmer III et al, 1985b; Farmer III et al, 1985a) 
such as biochemical reactions and physiological characteristics. However, 
phenotypically distinct strains may be closely related by genotypic criteria and may 
belong to the same genospecies (Bercovier et al, 1980; Hard & Dykhuizen, 1984). 
Also, phenotypically close strains (biogroups) may belong to different genospecies, 
— like - Klebsiella pneumoniae and Enterobacter ~aerogenes~(BrzrmeY,~ 1984)" for 
example. Consequently, identification and classification of certain species may be 
ambiguous with techniques based on phenotypic tests (Janda et al, 1999; Kitch et 
al, 1994; Sharma et al, 1990). 

More advances in the classification of members of the family Enterobacteriaceae 
have come from DNA-DNA hybridization studies (Brenner et al, 1993; Brenner et 
al, 1986; Brenner, et al, 1980; Farmer III, et al, 1980; Farmer III, et al, 1985b; 
Izard et al, 1981; Steigerwalt et al., 1976). Furthermore, the phylogenetic 
significance of bacterial classification based on 16S rDNA sequences has been 
recognized by many workers (Stackebrandt & Goebel, 1994; Wayne et al, 1987). 
However, members of the family Enterobacteriaceae have not been subjected to 
extensive phylogenetic analysis of 16S rDNA (Sproer et al, 1999). In fact, this 
molecule was not thought to solve taxonomic problems concerning closely related 
species because of its very high degree of conservation (Brenner, 1992; Sproer, et 
al, 1999). Another drawback of the 16S rDNA gene is that it is found in several 
copies within the genome (seven in Escherichia coli and Salmonella typhimuriwn) 
(Hill & Harnish, 1981). Due to sequence divergence between the gene copies, 
direct sequencing of PCR products is often not suitable to achieve a representative 
sequence (Cilia et al, 1996; Hill & Harnish, 1981). Other genes such as gap and 
ompA (Lawrence et al, 1991), rpoB (Mollet et al, 1997), and infB (Hedegaard et 
al, 1999) were used to resolve the phylogeny of enterobacteria. However, none of 
these studies covered an extensive number of species. 
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fw/and atpD are the genes encoding the elongation factor Tu (EF-Tu) and the F- 
ATPase beta-subunit, respectively. EF-Tu is involved in peptide chain formation 
(Ludwig et al, 1990). The two copies of the tuf gene {tuf A and tufB) found in 
enterobacteria (Sela et al, 1989) share high identity level (99 %) in Salmonella 
typhimurium and in E. coli. The recombination phenomenon could explain 

sequence homogenizationbetweenthetwo copies (Ab^ulkariST& Hughes, 1996r 

Grunberg-Manago, 1996). F-ATPase is present on the plasma membranes of 
eubacteria (Nelson & Taiz, 1989). It functions mainly in ATP synthesis (Nelson & 
Taiz, 1989) and the beta-subunit contains the catalytic site of the enzyme. EF-Tu 
and F-ATPase are highly conserved throughout evolution and shows functional 
constancy (Amann et al, 1988; Ludwig, et al, 1990). Recently, phylogenies based 
on protein sequences from EF-Tu and F-ATPase beta-subunit showed good 
agreement with each other and with the rDNA data (Ludwig et ai, 1993). 

We elected to sequence 884-bp fragments of tuf and atpD from 88 clinically 
relevant enterobacterial strains representing 72 species from 25 genera. These 
sequences were used to create phylogenetic trees that were compared with 16S 
rDNA trees. These trees revealed good agreement with each others and 
demonstrated the high resolution of tuf and atpD phylogenies at the species level. 

MATERIALS AND METHODS 

Bacterial strains and genomic material. All bacterial strains used in this study 
were obtained from the American Type Culture Collection (ATCC) or the 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ). 
These enterobacteria can all be recovered from clinical specimens, but not all are 
pathogens. Whenever possible, we choose type strains. Identification of all strains 
was confirmed by classical biochemical tests using the automated system 
MicroScan WalkAway-96 system equipped with a Negative BP Combo Panel 
Type 15 (Dade Behring Canada). Genomic DNA was purified using the G NOME 
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DNA kit (Bio 101). Genomic DNA from Yersinia pestis was kindly provided by 
Dr. Robert R. Brubaker. Strains used in this study and their descriptions are shown 
in Table 19. 

PCR primers. The eubacterial tuf and atpD gene sequences available from public 
"databases" were"analyzed "using the GCG package (version 8. 0)"(Genetics Computer 
Group). Based on multiple sequence alignments, two highly conserved regions 
were chosen for each genes, and PCR primers were derived from these regions 
with the help of Oligo primer analysis software (version 5.0) (National 
Biosciences). A second 5' primer was design to amplify the gene atpD for few 
enterobacteria difficult to amplifiy with the first primer set. When required, the 
primers contained inosines or degeneracies to account for variable positions. 
Oligonucleotide primers were synthesized with a model 394 DNA/RNA 
synthesizer (PE Applied Biosystems). PCR primers used in this study are listed in 
Table 20. 

DNA sequencing. An 884-bp portion of the tuf gene and an 884-bp portion (or 
alternatively an 871 -bp portion for a few enterobacterial strains) of the atpD gene 
were sequenced for all enterobacteria listed in the first strain column of Table 19. 
Amplification was performed with 4 ng of genomic DNA. The 40-ptl PCR 
mixtures used to generate PCR products for sequencing contained 10 /iM each 
primer, 200 /iM each deoxyribonucleoside triphosphate (Pharmacia Biotech), 10 
mM Tris-HCl (pH 9 0 at 25 °C), 50 mM KC1, 0-1 % (w/v) Triton X-100, 2-5 mM 
MgCl2, 0 05 mM BSA, 0-3 U of Taq DNA polymerase (Promega) coupled with 
TaqStart™ antibody (Clontech Laboratories). The TaqStart™ neutralizing 
monoclonal antibody for Taq DNA polymerase was added to all PCR mixtures to 
enhance efficiency of amplification (Kellogg et al, 1994). The PCR mixtures were 
subjected to thermal cycling (3 min at 95 °C and then 35 cycles of 1 min at 95 °C, 
1 min at 55 °C for tuf or 50 °C for atpD, and 1 min at 72 °C, with a 7-min final 
extension at 72 °C) using a PTC-200 DNA Engine thermocycler (MJ Research). 
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PCR products having the predicted sizes were recovered from an agarose gel 
stained for 15 min with 0 02 % of methylene blue followed by washing in sterile 
distilled water for 15 min twice (Flores et al, 1992). Subsequently, PCR products 
having the predicted sizes were recovered from gels using the QIAquick gel 
extraction kit (QIAGEN). 

Both strands of the purified amplicons were sequenced using the ABI Prism 
BigDye Terminator Cycle Sequencing Ready Reaction Kit (PE Applied 
Biosystems) on an automated DNA sequencer (Model 377). Amplicons from two 
independant PCR amplifications were sequenced for each strain to ensure the 
absence of sequencing errors attributable to nucleotide miscorporations by the Taq 
DNA polymerase. Sequence assembly was performed with the aid of Sequencher 
3.0 software (Gene Codes). 

Phylogenetic analysis. Multiple sequence alignments were performed using 
Pilellp from the GCG package (Version 10.0) (Genetics Computer Group) and 
checked by eye with the editor SeqLab to edit sequences if necessary and to note 
which regions were to be excluded for phylogenetic analysis. Vibrio cholerae and 
Shewanella putrefaciens were used as outgroups. Bootstrap subsets (750 sets) and 
phylogenetic trees were generated with the Neighbor Joining algorithm from Dr. 
David Swofford's PAUP (Phylogenetic Analysis Using Parsimony) Software 
version 4.0b4 (Sinauer Associates) and with tree-bisection branch-swapping. The 
distance model used was Kimura (1980) two-parameter. Relative rate test was 
performed with the aid of Phyltest program version 2.0 (c). 

RESULTS AND DISCUSSION 

DNA amplification, sequencing and sequence alignments 

A PCR product of the expected size of 884 bp was obtained for tuf and of 884 or 
871 bp for atpD from all bacterial strains tested. After subtracting for biased 
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primer regions and ambiguous single strand data, sequences of at least 721 bp for 
tuf and 713 bp for atpD were submitted to phylogenetic analyses. These sequences 
were aligned with tuf and atpD sequences available in databases to verify that the 
nucleotide sequences indeed encoded a part of tested genes. Gaps were excluded to 
perform phylogenetic analysis. 



Signature sequences 

From the sequence alignments obtained from both tested genes, only one insertion 
was observed. This five amino acids insertion is located between the positions 325 
and 326 of atpD gene of £. coli strain K-12 (Saraste et al, 1981) and can be 
considered a signature sequence of Tatumella ptyseos and Pantoea species (Fig. 7). 
The presence of a conserved indel of defined length and sequence and flanked by 
conserved regions could suggest a common ancestor, particularly when members 
of a given taxa share this indel (Gupta, 1998). To our knowledge, high relatedness 
between the genera Tatumella and Pantoea is demonstrated for the first time. 

Enterobacter agglomerans ATCC 27989 sequence does not possess the five amino 
acid indel (Fig. 7). This indel could represent a useful marker to help resolve the 
Enterobacter agglomerans and Pantoea classification. Indeed, the transfer of 
Enterobacter agglomerans to Pantoea agglomerans was proposed in 1989 by 
Gavini et al (Gavini et al, 1989). However, some strains are provisionally 
classified as Pantoea sp. until their interrelatedness is elucidated (Gavini, et al, 
1989). Since the transfer was proposed, the change of nomenclature has not yet 
been made for all Enterobacter agglomerans in the ATCC database. The absence 
of the five amino acids indel suggests that some strains of Enterobacter 
agglomerans most likely do not belong to the genus Pantoea. 
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Phylogenetic trees based oh partial tuf sequences, atpD sequences, and 
published 16S rDNA data of members of the Enterobacteriaceae. 
Representative trees constructed from tuf and atpD sequences with the neighbor- 
joining method are shown in Fig. 8. The phylogenetic trees generated from partial 
tuf sequences and atpD sequences are very similar. Nevertheless, atpD tree shows 
J?ore monophyletic groups. corresponding to_ species Jhat_belong_tO-the_same-genus. - 
These groups are more consistent with the actual taxonomy. For both genes, some 
genera are not monophyletic. These results support previous phylogenies based on 
the genes gap and ompA (Lawrence, et al y 1991), rpoB (Mollet, et al, 1997), and 
infB (Hedegaard, et al y 1999) which all showed that the genera Escherichia and 
Klebsiella are polyphyletic. There were few differences in branching between tuf 
and atpD genes. 

Even though Pantoea agglomerans and Pantoea dispersa indels were excluded for 
phylogenetic analysis, these two species grouped together and were distant from 
Enterobacter agglomerans ATCC 27989, adding another evidence that the latter 
species is heterogenous and that not all members of this species belong to the 
genus Pantoea. In fact, the E. agglomerans strain ATCC 27989 exhibits branch 
lengths similar to others Enterobacter species with both genes. Therefore, we 
suggest that this strain belong to the genus Enterobacter until further 
reclassification of that genus. 

tuf and atpD trees exhibit very short genetic distances between taxa belonging to 
the same genetic species including species segregated for clinical considerations. 
This first concern E. coli and Shigella species that were confirmed to be the same 
genetic species by hybridization studies (Brenner et al, 1972; Brenner et al, 1972; 
Brenner et al., 1982) and phylogenies based on 16S rDNA (Wang etal, 1997) and 
rpoB genes (Mollet, et al, 1997). Hybridization studies (Bercovier, et al, 1980) 
and phylogeny based on 16S rDNA genes (Ibrahim et al, 1994) demonstrated also 
that Yersinia pestis and Y. pseudotuberculosis are the same genetic species. Among 
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Yersinia pestis and Y. pseudotuberculosis, the three Klebsiella pneumoniae 
subspecies, E. coli-Shigella species, and Salmonella choleraesuis subspecies, 
Salmonella is a less tightly knit species than the other genetic species. The same is 
true for E. coli and Shigella species. 

-Escherichia fergusonii- is-very- close to E. coli-Shigella genetic-speciesr This ~ 

observation is corroborated by 16S rDNA phylogeny (McLaughlin et aL, 2000) but 
not by DNA hybridization values. In fact, E. fergusonii is only 49% to 63% related 
to E. coli-Shigella (Farmer IE, et al, 1985b). It was previously observed that very 
recently diverged species may not be recognizable based on 16S rDNA sequences 
although DNA hybridization established them as different species (Fox et al, 
1992). Therefore, E. fergusonii could be a new "quasi-species". 

atpD phylogeny revealed Salmonella subspecies divisions consistent with the 
actual taxonomy. This result was already observed by Christensen et al. 
(Christensen & Olsen, 1998). Nevertheless, tuf partial sequences discriminate less 
than atpD between Salmonella subspecies. 

Overall, tuf and atpD phylogenies exhibit enough divergence between species to 
ensure efficient discrimination. Therefore, it could be easy to distinguish 
phenotypically close enterobacteria belonging to different genetic species such as 
Klebsiella pneumoniae and Enterobacter aerogenes. 

Phylogenetic relationships between Salmonella, E. coli and C.freundii are not well 
defined. 16S rDNA and 23S rDNA sequence data reveals a closer relationship 
between Salmonella and E. coli than between Salmonella and C. freundii 
(Christensen et aL, 1998), while DNA homology studies (Selander et al., 1996) 
and infB phylogeny (Hedegaard, et al, 1999) showed that Salmonella is more 
closely related to C. freundii than to E. coli. In that regard, tuf and atpD 
phylogenies are coherent with 16S rDNA and 23S rDNA sequence analysis. 
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Phylogenetic analyses were also performed using amino acids sequences, tuf tree 
based on amino acids is characterized by a better resolution between taxa outgroup 
and taxa ingroup (enterobacteria) than tree based on nucleic acids whereas atpD 
trees based on amino acids and nucleic acids give almost the same resolution 
between-taxa outgroup and ingroup (data not shown). « 

Relative rate test (or two cluster test (Takezaki et al., 1995)) evaluates if evolution 
is constant between two taxa. Before to apply the test, the topology of a tree is 
determined by tree-building method without the assumption of rate constancy. 
Therefore, two taxa (or two groups of taxa) are compared with a third taxon that is 
an outgroup of the first two taxa (Takezaki, et ai y 1995). Few pairs of taxa that 
exhibited a great difference between their branch lengths at particular nodes were 
chosen to perform the test. This test reveals that tuf and atpD are not constant in 
their evolution within the family Enterobacteriaceae. For tuf, for example, the 
hypothesis of rate constancy is rejected (Z value higher than 1 96) between 
Yersinia species. The same is true for Proteus species. For atpD, for example, 
evolution is not constant between Proteus species, between Proteus species and 
Providencia species, and between Yersinia species and Escherichia coli. For 16S 
rDNA, for example, evolution is not constant between two E. coli, between E. coli 
and Enterobacter aerogenes, and between E. coli and Proteus vulgaris. These 
results suggest that tuf atpD and 16S rDNA could not serve as a molecular clock 
for the entire family Enterobacteriaceae. 

Since the number and the nature of taxa can influence topology of trees, 
phylogenetic trees from tuf and atpD were reconstructed using sequences 
corresponding to strains for which 16S rDNA genes were published in GenEMBL. 
These trees were similar to those generated using 1 6S rDNA (Fig. 9). Nevertheless, 
16S rDNA tree gave poorer resolution power than tuf and atpD gene trees. Indeed, 
these latter exhibited less multifurcation (polytomy) than the 16S rDNA tree. 
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Comparison of distances based on tuf, atpD, and 16S rDNA data. 

tuf atpD, and 16S rDNA distances (i.e. the number of differences per nucleotide 
site) were compared with each other for each pair of strains. We found that the tuf 
and atpD distances were respectively 2-268 ± 0-965 and 2-927 ± 0-896 times larger 
-than 16S-rDNA-distances (Figr 10a-and b). atpD distances-were i-445"±~ 0-570 " 
times larger than tuf distances (Fig. 10c). Figure 10 also shows that the tuf, atpD, 
and 16S rDNA distances between members of different species of the same genus 
(0 053 ± 0 034, 0 060 ± 0-020, and 0 024 ± 0 010, respectively) were in mean 
smaller than the distances between members of different genera belonging to the 
same family (0103 ± 0 053, 0129 ± 0 051, and 0 044 ± 0 013, respectively). 
However, the overlap exhibits with standard deviations add to a focus of evidences 
that some enterobacterial genera are not well defined (Brenner, 1984). In fact, 
many distances for pairs of species especially belonging to the genera Escherichia, 
Shigella, Enter obacter, Citrobacter, Klebsiella, and Kluyvera overlap distances for 
pairs of species belonging to the same genus (Fig. 10). For example, distances for 
pairs composed by species of Citrobacter and species of Klebsiella overlap 
distances for pairs composed by two Citrobacter or by two Klebsiella. 

Observing the distance distributions, 16S rDNA distances reveal a clear separation 
between the families Enterobacteriaceae and Vibrionaceae despite the fact that the 
family Vibrionaceae is genetically very close to the Enterobacteriaceae (Fig. 10a 
and b). Nevertheless, tuf and atpD show higher discriminating power below the 
family level (Fig. 10a and b). 

There were some discrepancies in the relative distances for the same pairs of taxa 
between the two genes studied. First, distances between Yersinia species are at 
least two times lower for atpD than for tuf (Fig. 10c). Also, distances at the family 
level (between Enterobacteriaceae and Vibrionaceae) show that 
Enterobacteriaceae is a tightlier knit family with atpD gene (Proteus genus 
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excepted) than with m/ gene. Both genes well delineate taxa belonging to the same 
species. There is one exception with atpD: Klebsiella planticola and K 
ornithinolithica belong to the same genus but fit with taxa belonging to the same 
species (Fig. 10a and c). These two species are also very close genotypically with 
tuf gene. This suggest that Klebsiella planticola and K. ornithinolithica could be 
two - newborn-species, -tuf- and- atpD -genes- exhibity-little- distances-bet ween - 
Escherichia fergusonii and E. coli-Shigella species. Unfortunately, comparison 
with 16S rDNA could not be achieved because the E. fergusonii 16S rDNA 
sequence is not yet accessible in GenEMBL database. Therefore, the majority of 
phenotypically close enterobacteria could be easily discriminated genotypically 
using tuf and atpD gene sequences. 

In conclusion, tuf and atpD genes exhibit phylogenies consistent with 16S rDNA 
genes phylogeny. For example, they reveal that the family Enterobacteriaceae is 
monophyletic. Moreover, tuf and atpD distances provide a higher discriminating 
power than 16S rDNA distances. In fact, tuf and atpD genes discriminate well 
between different genospecies and are conserved between strains of the same 
genetic species in such a way that primers and molecular probes for diagnostic 
purposes could be designed. Preliminary studies support these observations and 
diagnostic tests based on fw/and atpD sequence data to identify enterobacteria are 
currently under development. 

EXAMPLE 44: 

Testing new pairs of PCR primers selected from two species-specific genomic 
DNA fragments which are objects of our assigned US patent 6.001,564 

Objective : The goal of these experiments is to demonstrate that it is relatively easy 
for a person skilled in the art to find other PCR primer pairs from the species-specific 
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fragments used as targets for detection and identification of a variety of 
microorganisms. In fact, we wish to prove that the PCR primers previously tested by 
our group and which are objects of the present patent application are not the only 
possible good choices for diagnostic purposes. For this example, we used diagnostic 
targets described in our assigned US patent 6,001,564. 



Experimental strategy : We have selected randomly two species-specific genomic 
DNA fragments for this experiment. The first one is the 705-bp fragment specific to 
Staphylococcus epidermidis (SEQ ID NO: 36 from US patent 6,001,564) while the 
second one is the 466-bp fragment specific to Moraxella catarrhalis (SEQ ID NO: 
29 from US patent 6,001,564). Subsequently, we have selected from these two 
fragments a number of PCR primer pairs other than those previously tested. We have 
chosen 5 new primer pairs from each of these two sequences which are well 
dispersed along the DNA fragment (Figures 11 and 12). We have tested these 
primers for their specificity and compared them with the original primers previously 
tested. For the specificity tests, we have tested all bacterial species closely related to 
the target species based on phylogenetic analysis with three conserved genes (rRNA 
genes, tuf and atpD ). The rational for selecting a restricted number of bacterial 
species to evaluate the specificity of the new primer pairs is based on the fact that the 
lack of specificity of a DNA-based assay is attributable to the detection of closely 
related species which are more similar at the nucleotide level. Based on the 
phylogenetic analysis, we have selected (i) species from the closely related genus 
Staphylococcus, Enter ococcus, Streptococcus and Listeria to test the specificity of 
the S. epidermidis-sptcific PCR assays and (ii) species from the closely related genus 
Moraxella, Kingella and Neisseria to test the specificity of the Af. catarrhalis- 
specific PCR assays. 

Materials and methods 
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Bacterial strains. All bacterial strains used for these experiments were obtained 
from the American Type Culture Collection (ATCC, Rockville, MP). 



Genomic DNA isolation. Genomic DNA was purified from the ATCC reference 
strains by using the G-nome DNA kit (Bio 101 Inc., Vista, CA). 

Oligonucleotide design and synthesis. PCR primers were designed with the help of 
the Oligo™ primer analysis software Version 4.0 (National Biosciences Inc., 
Plymouth, Minn.) and synthesized using a model 391 DNA synthesizer (Applied 
Biosystems, Foster City, CA). 

PCR assays. All PCR assays were performed by using genomic DNA purified from 
reference strains obtained from the ATCC. One /xl of purified DNA preparation 
(containing 0.01 to 1 ng of DNA per fi\) was added directly into the PCR reaction 
mixture. The 20 /zL PCR reactions contained final concentrations of 50 mM KC1, 10 
mM Tris-HCl (pH 9.0), 0.1% Triton X-100, 2.5 mM MgCl 2 , 0.4 fiM of each primer, 
200 f*M of each of the four dNTPs and 0.5 unit of Taq DNA polymerase (Promega, 
Madison, WI) combined with the TaqStart™ antibody (Clontech Laboratories Inc., 
Palo Alto, CA). An internal control was integrated into all amplification reactions to 
verify the efficiency of the amplification reaction as well as to ensure that significant 
PCR inhibition was absent. Primers amplifying a region of 252 bp from a control 
plasmid added to each amplification reaction were used to provide the internal 
control. PCR reactions were then subjected to thermal cycling (3 min at 95°C 
followed by 30 cycles of 1 second at 95°C for the denaturation step and 30 seconds at 
50 to 65°C for the annealing-extension step) using a PTC-200 thermal cycler (MJ 
Research Inc., Watertown, MA). PCR amplification products were then analyzed by 
standard agarose gel (2%) electrophoresis. Amplification products were visualized in 
agarose gels containing 0.25 /ig/mL of ethidium bromide under UV at 254 nm. 



Results 
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Tables 21 and 22 show the results of specificity tests with the 5 new primer pairs 
selected from SEQ ID NO: 29 (specific to Af. catarrhalis from US patent 6,001,564) 
and SEQ ID NO: 36 (specific to 5. epidermidis from US patent 6,001,564), 
respectively. In order to evaluate the performance of these new primers pairs, we 
„ compared them .in parallel with-the original-primer pairs previously tested; 

For M. catarrhalis, all of the 5 selected PCR primer pairs were specific for the target 
species because none of the closely related species could be amplified (Table 21). In 
fact, the comparison with the original primer pair SEQ ID NO: 118 + SEQ ID NO: 
119 (from US patent 6,001 ,564) revaled that all new pairs showed identical results in 
terms of specificity and sensitivity thereby suggesting their suitability for diagnostic 
purposes. 

For 5. epidermidis, 4 of the 5 selected PCR primer pairs were specific for the target 
species (Table 22). It should be noted that for 3 of these four primer pairs the 
annealing temperature had to be increased from 55 °C to 60 or 65 °C to attain 
specificity for 5. epidermidis. Again the comparison with the original primer pair 
SEQ ID NO: 145 + SEQ ID NO: 146 (from US patent 6,001,564) revealed that these 
four primer pairs were as good as the original pair. Increasing the annealing 
temperature for the PCR amplification is well known by persons skilled in the art to 
be a very effective way to improve the specificity of a PCR assay (Persing et aU 
1993, Diagnostic Molecular Microbiology: Principles and Applications, American 
Society for Microbiology, Washington, D.C.; Ehrlich and Greenberg, 1994, PCR- 
based Diagnostics in Infectious Disease, Blackwell Scientific Publications, Boston, 
MA). In fact, those skilled in the art are well aware of the fact that the annealing 
temperature is critical for the optimization of PCR assays. Only the primer pair 
VBsep3 + VBsep4 amplified bacterial species other than 5. epidermidis including the 
staphylococcal species S. capitis, S. cohnii, S. aureus, S. haemolyticus and S. hominis 
(Table 22). For this non-specific primer pair, increasing the annealing temperature 
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from 55 to 65 °C was not sufficient to attain the desired specificity. One possible 
explanation for the fact that it appears sligthly easier to select species-specific 
primers for M. catarrhalis than for S. epidermidis is that M. catarrhalis is more 
isolated in phylogenetic trees than S. epidermidis. The large number of coagulase 
negative staphylococcal species such as S. epidermidis is largely responsible for this 
~ phylogenetic clustering: ~ 

Conclusion 

These experiment clearly show that it is relatively easy for a person skilled in the art 
to select, from the species-specific DNA fragments selected as target for 
identification, PCR primer pairs suitable for diagnostic purposes other than those 
previously tested. The amplification conditions can be optimize by modifying critical 
variables such as the annealing temperature to attain the desired specificity and 
sensitivity. Consequently, we consider that it is legitimate to claim any possible 
primer sequences selected from the species-specific fragment and that it would be 
unfair to grant only the claims dealing with the primer pairs previously tested. By 
extrapolation, these results strongly suggest that it is also relatively easy for a person 
skilled in the art to select, from the species-specific DNA fragments, DNA probes 
suitable for diagnostic purposes other than those previously tested. 

EXAMPLE 45: 

Testing modified versions of PCR primers derived from the sequence of several 
primers which are objects of US patent 6,00L564. 

Objective : The purpose of this project is to verify the efficiency of amplification by 
modified PCR primers derived from primers previously tested. The types of primer 
modifications to be tested include (i) variation of the sequence at one or more 
nucleotide positions and (ii) increasing or reducing the length of the primers. For this 
example, we used diagnostic targets described in US patent 6,001,564. 
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Experimental strategy : 

a) Testing primers with nucleotide changes 

We Jiave designed 13 new printers Mvhich_are ^nv_ed_ from_the S. jepidermidis-. 

specific SEQ ID NO: 146 from US patent 6,001,564 (Table 23). These primers have 
been modified at one or more nucleotide positions. As shown in Table 23, the 
nucleotide changes were introduced all along the primer sequence. Furthermore, 
instead of modifying the primer at any nucleotide position, the nucleotide changes 
were introduced at the third position of each codon to better reflect potential genetic 
variations in vivo. It should be noted that no nucleotide changes were introduced at 
the 3' end of the oligonucleotide primers because those skilled in the art are well 
aware of the fact that mimatches at the 3' end should be avoided (Persing et a/., 
1993, Diagnostic Molecular Microbiology: Principles and Applications, American 
Society for Microbiology, Washington, D.C.). All of these modified primers were 
tested in PCR assays in combination with SEQ ID NO: 145 from US patent 
6,001,564 and the efficiency of the amplification was compared with the original 
primer pair SEQ ID NO: 145 + SEQ ID NO: 146 previously tested in US patent 
6,001,564. 

b) Testing shorter or longer versions of primers 

We have designed shorter and longer versions of the original S. epidermidis-specific 
PCR primer pair SEQ ID NO: 145 + 146 from US patent 6,001,564 (Table 24) as 
well as shorter versions of the original P. aeruginosa-specific primer pair SEQ ID 
NO: 83 + 84 from US patent 6,001,564 (Table 25). As shown in Tables 24 and 25, 
both primers of each pair were shortened or lengthen to the same length. Again, those 
skilled in the art know that the melting temperature of both primers from a pair 
should be similar to avoid preferential binding at one primer binding site which is 
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detrimental in PCR (Persing et al, 1993, Diagnostic Molecular Microbiology: 
Principles and Applications, American Society for Microbiology, Washington, D.C.; 
Ehrlich and Greenberg, 1994, PCR-based Diagnostics in Infectious Disease, 
Blackwell Scientific Publications, Boston, MA). All of these shorter or longer primer 
versions were tested in PCR assays and the efficiency of the amplification was 
_ .compared .with the original-primer pair-SEQID-NOsl 45 and~146, 

Materials and methods 

See the Materials and methods section of Example 44. 

Results 

a) Testing primers with nucleotide changes 

The results of the PCR assays with the 13 modified versions of SEQ ID NO: 146 
from US patent 6,001,564 are shown in Table 23. The 8 modified primers having a 
single nucleotide variation showed an efficiency of amplification identical to the 
original primer pair based on testing with 3 different dilutions of genomic DNA. The 
four primers having two nucleotide variations and primer VBmutl2 having 3 
nucleotide changes also showed PCR results identical to those obtained with the 
original pair. Finally, primer VBmutl3 with four nucleotide changes showed a 
reduction in sensitivity by approximately one log as compared with the original 
primer pair. However, reducing the annealing temperature from 55 to 50 °C gave an 
efficiency of amplification very similar to that observed with the original primer pair 
(Table 23). In fact, reducing the annealing temperature of PCR cycles represents an 
effective way to reduce the stringency of hybridization for the primers and 
consequently allows the binding of probes with mismatches (Persing et al, 1993, 
Diagnostic Molecular Microbiology: Principles and Applications, American Society 
for Microbiology, Washington, D.C.). Subsequently, we have confirmed the 
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specificity of the PCR assays with each of these 13 modified versions of SEQ ID 
NO: 146 from US patent 6,001,564 by performing amplifications from all bacterial 
species closely related to 5. epidermidis which are listed in Table 22. 

b) Testing shorter or longer versions of primers 

For these experiments, two primer pairs were selected: i) SEQ ID NO: 145 + 146 
from US patent 6,001,564 (specific to S. epidermidis) which are AT rich and ii) SEQ 
ID NO: 83 + 84 (specific to P. aeruginosa) which are GC rich. For the AT rich 
sequence, primers of 15 to 30 nucleotide in length were designed (Table 24) while 
for the GC rich sequences, primers of 13 to 19 nucleotide in length were designed 
(Table 25). 

Table 24 shows that, for an annealing temperature of 55 °C, the 30- 25-, 20- and 17- 
nucleotide versions of SEQ ID NO: 145 and 146 from US patent 6,001,564 all 
showed identical results as compared with the original primer pair except that the 17- 
nucleotide version amplified slightly less efficiently the S. epidermidis DNA. 
Reducing the annealing temperature from 55 to 45 °C for the 17-nucleotide version 
allowed to increase the amplification efficiency to a level very similar to that with the 
original primer pair (SEQ ID NO: 145 + 146 from US patent 6,001,564). Regarding 
the 15-nucleotide version, there was amplification ofS. epidermidis DNA only when 
the annealing temperature was reduced to 45 °C. Under those PCR conditions the 
assay remained S. epidermidis-specific but the amplification signal with S. 
epidermidis DNA was sligthly lower as compared with the original primer pair. 
Subsequently, we have further confirmed the specificity of the shorter or longer 
versions by amplifying DNA from all bacterial species closely related to 5. 
epidermidis which are listed in Table 22. 

Table 25 shows that, for an annealing temperature of 55 °C, all shorter versions of 
SEQ ID NO: 83 and 84 from US patent 6,001,564 showed identical PCR results as 

160 



WO 01/23604 PCT/CA00/01150 

compared with the original primer pair. As expected, these results show that it is 
simpler to reduce the length of GC rich as compared with AT rich. This is 
attributable to the fact that GC binding is more stable than AT binding. 

Conclusion 



a) Testing primers with nucleotide changes 

The above experiments clearly show that PCR primers may be modified at one or 
more nucleotide positions without affecting the specificity and the sensitivity of the 
PCR assay. These results strongly suggest that a given oligonucleotide can detect 
variant genomic sequences from the target species. In fact, the nucleotide changes in 
the selected primers were purposely introduced at the third position of each codon to 
mimic nucleotide variation in genomic DNA. Thus we conclude that it is justified to 
claim "a variant thereof for i) the SEQ IDs of the fragments and oligonucleotides 
which are object of the present patent application and ii) genomic variants of the 
target species. 

b) Testing shorter or longer versions of primers 

The above experiments clearly show that PCR primers may be shorter or longer 
without affecting the specificity and the sensitivity of the PCR assay. We have 
showed that oligonucleotides ranging in sizes from 13 to 30 nucleotides may be as 
specific and sensitive as the original primer pair from which they were derived. 
Consequently, these results suggest that it is not exaggerated to claim sequences 
having at least 12 nucleotide in length. 
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This invention has been described herein above, and it is readily 
apparent that modifications can be made thereto without departing from the 
spirit of this invention. These modifications are under the scope of this 
invention, as defined in the appended claims. 
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Table 1. Distribution (%) of nosocomial pathogens for various human infections in USA 
(1990-1 992) 1 . 
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5 


5 


0 


Other Candida 


2 


1 


3 


1 


0 


Enterobacter sp. 


5 


7 


4 


12 


2 


Acinetobacter sp. 


1 


1 


2 


4 


2 


C/frobactersp. 


2 


1 


1 


1 


0 


Serratia marcescens 


1 


1 


1 


3 


1 


Other Klebsiella 


1 


1 


1 


2 


1 


Others 


0 


6 


4 


5 


0 



Data recorded by the National Nosocomial Infections Surveillance (NNIS) from 80 hospitals 
(Emori and Gaynes, 1993, Clin. Microbiol. Rev., 6:428-442). 



Urinary tract infection. 

3 

Surgical site infection. 
35 Bloodstream infection. 



5 



Cerebrospinal fluid. 
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Table 2. Distribution (%) of bloodstream infection pathogens in Quebec (1995), 
Canada (1992), UK (1969-1988) and USA (1990-1992). 



5 




Organism 


Quebec 1 


Canada 2 


3 

UK 

Community- 
acquired 


Hospital- 
acquired 


USA* 

Hospital- 
acquired 




£ coli 


1 15.6 


53.8 


24.8 


20.3 


5.0 


10 


S. epidermidis and 














other CoNS 


25.8 




0.5 


7.2 


31.0 




S. aureus 


9.6 




9.7 


19.4 


16.0 




S. pneumoniae 


6.3 




22.5 


2.2 






E. faecalis 


3.0 




1.0 


4.2 




15 


E. faecium 


2.6 




0.2 


0.5 






Enterococcus sp. 








9.0 






H. influenzae 


1.5 




3.4 


0.4 






P. aeruginosa 


1.5 


8.2 


1.0 


8.2 


3.0 




K. pneumoniae 


3.0 


11.2 


3.0 


9.2 


4.0 


20 


P. mirabilis 




3.9 


2.8 


5.3 


1.0 




S. pyogenes 






1.9 


0.9 






Enterobacter sp. 


4.1 


5.5 


0.5 


2.3 


4.0 




Candida sp. 


8.5 






1.0 


8.0 




Others 


18.5 


17.4 


28.7 


18.9 


19.0 



25 



30 



Data obtained for 270 isolates collected at the Centre Hospitalier de I'Universite Laval (CHUL) 
during a 5 month period (May to October 1995). 

Data from 10 hospitals throughout Canada representing 941 gram-negative isolates. 
(Chamberland etai, 1992, Clin. Infect D/s., 15:615-628). 

Data from a 20-year study (1969-1988) for nearly 4000 isolates. (Eykyn ef a/., 1990, 1 
Antimicrob. Chemother., Suppl. C, 25:41-58). 

Data recorded by the National Nosocomial Infections Surveillance (NNIS) from 80 hospitals 
(Emori and Gaynes, 1993, Clin. Microbiol. Rev., 6:428-442). 



Coagulase-negative staphylococci. 
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Table 3. Distribution of positive and negative clinical specim ns tested at the 
microbiology laboratory of the CHUL (February 1994 - January 1995). 



Clinical specimens 
and/or sites 


No. of samples 
tested (%) 


% of positive 
specimens 


% of negative 
specimens 


Urine 


17.981 (54.5) 


19.4 


80.6 


Blood culture/marrow 


10^dToT30.4j 


679 


93.1 


Sputum 


1,266 (3.8) 


68.4 


31.6 


Superficial pus 


1,136(3.5) 


72.3 


27.7 


Cerebrospinal fluid 


553(1.7) 


1.0 


99.0 


Synovial fluid 


523(1.6) 


2.7 


97.3 


Respiratory tract 


502 (1.5) 


56.6 


43.4 


Deep pus 


473 (1.4) 


56.8 


43.2 


Ears 


289 (0.9) 


47.1 


52.9 


Pleural and pericardial fluid 


132 (0.4) 


1.0 


99.0 


Peritoneal fluid 


101(0.3) 


28.6 


71.4 


Total: 


32,966(100.0) 


20.0 


80.0 
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Table 4. Example f microbial species for which tuf and/or atpD and/or recA nucleic acids and/ r 

sequences are used in the present invention. 



Bacterial species 



Abiotrophia adiacens 
Abiotrophia defectiva 

Achromobacter xylosoxidans subsp. denitrificans 65 

10 Acetobacterium woodi 

Acetobacter aceti 
Acetobacter altoacetigenes 
Acetobacter pofyoxogenes 

Aclioleplasma laidlawii 70 
1 5 Acidothermus cellulolyticus 
Acidiphilum facilis 
Acinetobacter baumannii 
Acinetobacter calcoaceticus 

Acinetobacter Iwqffii 75 
20 Actinomyces meyeri 
Aerococcus viridans 
Aeromonas hydrophila 
Aeromonas salmonicida 

Agrobacterium radiobacter 80 
25 Agrobacterium tumefaciens 

Alcaligenes faecal is subsp. faecalis 
Allochromatium vinosum 
Anabaena variabilis 

Anacystis nidulans 85 
30 Anaerorhabdus furcosus 

Aquifex aeolicus 

Aquifex pyrophilus 

Arcanobacterium haemotyticum 

Archaeoglobus julgidus 90 
35 Azotobacter vinelandii 

Bacillus anthracis 

Bacillus cereus 

Bacillus firmus 

Bacillus halodurans 95 
40 Bacillus megaterium 
Bacillus mycoides 
Bacillus pseudomycoides 
Bacillus stearothermophilus 

Bacillus subtilis 100 
45 Bacillus thuringiensis 

Bacillus weihenstephanensis 
Bacteroides distasonis 
Bacteroides fragilis 

Bacteroides forsythus 1 05 

50 Bacteroides ovatus 
Bacteroides vulgatus 
Bartonella henselae 
Bifidobacterium adolescentis 

Bifidobacterium breve 110 
55 Bifidobacterium dentium 
Bifidobacterium longum 
Blastochloris viridis 
Borrelia burgdorferi 

Bordetella pertussis 115 
60 Bordetella bronchiseptica 
Brucella abortus 
Brevibacterium linens 
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Brevibacterium flavum 
Brevundimonas diminuta 
Buchnera aphidicola 

.Budviciaaquatica 

Burkholderia cepacia 

Burkholderia mallei 

Burkholderia pseudomallei 

Buttiauxella agrestis 

Butyrivibrio fibrisolvens 

Campylobacter coli 

Campylobacter curvus 

Campylobacter fetus subsp. fetus 

Campylobacter fetus subsp. venerealis 

Campylobacter gracilis 

Campylobacter jejuni 

Campylobacter jejuni subsp. doylei 

Campylobacter jejuni subsp. jejuni 

Campylobacter lari 

Campylobacter rectus 

Campylobacter sputorum subsp. sputorum 

Campylobacter upsaliensis 

Cedecea davisae 

Cedecea lapagei 

Cedecea neteri 

Chlamydia pneumoniae 

Chlamydia psittaci 

Chlamydia trachomatis 

Chlorobium vibrioforme 

Chloroflexus aurantiacus 

Chryseobacterium meningosepticum 

Citrobacter amalonaticus 

Citrobacter braakii 

Citrobacter farmeri 

Citrobacter freundii 

Citrobacter koseri 

Citrobacter sedlakii 

Citrobacter werkmanii 

Citrobacter youngae 

Clostridium acetobutylicum 

Clostridium beijerincldi 

Clostridium bifermentans 

Clostridium botulinum 

Clostridium difficile 

Clostridium innocuum 

Clostridium histolyticum 

Clostridium novyi 

Clostridium septicum 

Clostridium perfringens 

Clostridium ramosum 

Clostridium sordelUi 

Clostridium tertium 

Clostridium tetani 

Comamonas acidovorans 

Corynebacterium accolens 

Corynebacterium bovis 

Corynebacterium cervicis 
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Table 4. Example of microbial species for which tuf and/or atpD and/or recA nucleic acids and/or 
sequences are used in the present invention (continued). 

Bacterial species (continued) 



Corynebacterium diphtheriae 

Corynebacterium flavescens 

Corynebacterium genitalium 

Corynebacterium glutamicum 
10 Corynebacterium jeikeium 
~ Cblr^ebliclel^umkutJckeli 

Corynebacterium minutissimum 

Corynebacterium mycetoides 

Corynebacterium pseudodiphtheriticum 
1 5 Corynebacterium pseudogenitalium 

Corynebacterium pseudotuberculosis 

Corynebacterium renale 

Corynebacterium striatum 

Corynebacterium ulcerans 
20 Corynebacterium ureatyticum 

Corynebacterium xerosis 

Coxiella burnetii 

Cytophaga lytica 

Deinococcus radiodurans 
25 Deinonema sp. 

Edwardsiella hoshinae 

Edwardsiella tarda 

Ehrlichia canis 

Ehrlichia risticii 
30 Eikenella corrodens 

Enterobacter aerogenes 

Enterobacter agglomerans 

Enterobacter amnigenus 

Enterobacter asburiae 
3 5 Enterobacter cancerogenus 

Enterobacter cloacae 

Enterobacter gergoviae 

Enterobacter hormaechei 

Enterobacter sakazakii 
40 Enterococcus avium 

Emerococcus casseliflavus 

Enterococcus cecorum 

Enterococcus columbae 

Enterococcus dispar 
45 Enterococcus durans 

Enterococcus faecalis 

Enterococcus faecium 

Enterococcus flavescens 

Enterococcus gallinarum 
50 Enterococcus hirae 

Enterococcus malodorants 

Enterococcus mundtii 

Enterococcus pseudoavium 

Enterococcus raffinosus 
5 5 Enterococcus saccharotyticus 

Enterococcus solitarius 

Enterococcus sulfureus 

Erwinia amylovora 

Erwinia carotovora 
60 Escherichia coli 

Escherichia fergusonii 

Escherichia hermannii 

Escherichia vulneris 



Eubacterium lentum 
65 Eubacterium nodatum 

Emngella americana 

Francisella tularensis 
Frankia alni 

Fervidobacterium islandicum 
70 Fibrobacter succinogenes 

Flavobacterium ferrigeneum 

Flexistipes sinusarabici 

Fusobacterium gonidiaformans 

Fusobacterium necrophorum subsp. necrophorum 
75 Fusobacterium nucleatum subsp. polymorphum 

GardnereUa vaginalis 

Gemella haemotysans 

Gemella morbillorum 

Globicatella sanguis 
80 Gloeobacter violaceus 

Gloeothece sp. 

Gluconobacter oxydans 

Haemophilus actinomycetemcomitans 

Haemophilus aphrophilus 
85 Haemophilus ducreyi 

Haemophilus haemolyticus 

Haemophilus influenzae 

Haemophilus parahaemolyticus 

Haemophilus parainfluenzae 
90 Haemophilus paraphrophilus 

Haemophilus segnis 

Hafhia alvei 

Halobacterium marismortui 

Halobacterium salinarum 
95 Haloferax volcanii 

Helicobacter pylori 

Herpetoshiphon aurantiacus 

Kingella kxngae 

Klebsiella ornithinolytica 
1 00 Klebsiella oxytoca 

Klebsiella planticola 

Klebsiella pneumoniae subsp. ozaenae 

Klebsiella pneumoniae subsp. pneumoniae 

Klebsiella pneumoniae subsp. 
105 rhinoscleromatis 

Klebsiella terrigena 

Kluyvera ascorbata 

Kluyvera cryocrescens 

Kluyvera georgiana 
1 1 0 Kocuria kristinae 

Lactobacillus acidophilus 

Lactobacillus garvieae 

Lactobacillus paracasei 

Lactobacillus casei subsp. casei 
1 1 5 Lactococcus garvieae 

Lactococcus lactis 

Lactococcus lactis subsp. lactis 

Leclercia adecarboxylata 

Legionella micdadei 
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Table 4. Example of microbial species for which tuf and/or atpD and/or recA nucleic acids and/ r 

sequences are used in the present invention (continued). 



Bacterial species (continued) 



Legionella pneumophila subsp. pneumophila 

Leminorella grimontii 

LtminoreUa richardii 
1 0 Leptospira biflexa 

Leptospira interrogans 

Leuconostoc mesenteroides subsp. 

dext rani cum 

Listeria innocua 
1 5 Listeria ivanovii 

Listeria monocytogenes 

Listeria seeligeri 

Macrococcus caseolyticus 

Magnetospirillum magnetotacticum 
20 Megamonas hypermegale 

Methanobacterium ihermoautotrophicum 

Methanococcus jannaschii 

Methanococcus vannielii 

Methanosarcina barkeri 
25 Methanosarcina jannaschii 

Methylobacillus flagellatum 

Methylomonas clara 

Micrococcus luteus 

Micrococcus tylae 
30 Mitsuokella multacidus 

Mobiluncus curtisii subsp. holmesii 

Moellerella thermoacetica 

Moellerella wisconsensis 

MooreUa thermoacetica 
35 Moraxella catarrhalis 

Moraxella osloensis 

Morganella morganii subsp. morganii 

Mycobacterium avium 

Mycobacterium bovis 
40 Mycobacterium gordonae 

Mycobacterium kansasii 

Mycobacterium leprae 

Mycobacterium terrae 

Mycobacterium tuberculosis 
45 Mycoplasma capricolum 

Mycoplasma gallisepticum 

Mycoplasma genitalium 

Mycoplasma hominis 

Mycoplasma pirum 
50 Mycoplasma mycoides 

Mycoplasma pneumoniae 

Mycoplasma pulmonis 

Mycoplasma salivarium 

Myxococcus xanthus 
55 Neisseria animalis 

Neisseria cards 

Neisseria cinerea 

Neisseria cuniculi 

Neisseria elongata subsp. elongata 
60 Neisseria elongata subsp. intermedia 

Neisseria flava 

Neisseria flavescens 



Neisseria gonorrhoeae 

Neisseria lactamica 
65 Neisseria meningitidis 

_ Neisseria mucosa 

Neisseria perflava 

Neisseria pharyngis var. flava 

Neisseria polysaccharea 
70 Neisseria sicca 

Neisseria subflava 

Neisseria weaveri 

Obesumbacterium proteus 

Ochrobactrum anthropi 
75 Pantoea agglomerans 

Pantoea dispersa 

Paracoccus denitrificans 

Pasteurella multocida 

Pectinatus frisingensis 
80 Peptococcus niger 

Peptostreptococcus anaerobius 

Peptostreptococcus asaccharotyticus 

Peptostreptococcus prevotii 

Phormidium ectocarpi 
85 PireUula manna 

Planobispora rosea 

Plesiomonas shigelloides 

Plectonema boryanum 

Porphyromonas asaccharolytica 
90 Porphyromonas gingivalis 

Pragiafontium 

Prevotella buccalis 

Prevotella melaninogenica 

Prevotella oralis 
95 Prevotella ruminocola 

Prochlorothrix hollandica 

Propionibacterium acnes 

Propionigenium modestum 

Proteus mirabilis 
1 00 Proteus penneri 

Proteus vulgaris 

Providencia alcalifaciens 

Providencia rettgeri 

Providencia rustigianii 
105 Providencia stuartii 

Pseudomonas aeruginosa 

Pseudomonas Jluorescens 

Pseudomonas putida 

Pseudomonas stutzeri 
110 Psychrobacter phenylpyruvicum 

Pyrococcus abyssi 

Rahnella aquatilis 

Rickettsia prowazekii 

Rhizobium leguminosarum 
115 Rhizobium phaseoli 

Rhodobacter capsulatus 

Rhodobacter sphaeroides 
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Table 4. Example of microbial species for which tuf and/or atpD and/or recA nucleic acids and/or 
sequences are used in the present invention (continued). 



Bacterial species (continued) 



Rhodopseudomonas palustris 

Rhodospirillum rubrum 65 

. 10 Ruminococcus albus 

Ruminococcus bromii 
Salmonella bongori 

Salmonella choleraesuis subsp. arizonae 
Salmonella choleraesuis subsp 70 
1 5 choleraesuis 

Salmonella choleraesuis subsp. 
diarizonae 

Salmonella choleraesuis subsp. 
houtenae 75 
20 Salmonella choleraesuis subsp. indica 
Salmonella choleraesuis subsp. salamae 
Serpulina hyodysenteriae 
Serratiaficaria 

Serratia fonticola 80 
25 Serratia grimesii 

Serratia liquefaciens 
Serratia marcescens 
Serratia odorifera 

Serratia plymuihica 85 
30 Serratia rubidaea 

Shewanella putrefaciens 
Shigella boydii 
Shigella dysenteriae 

Shigella flexneri 90 
35 Shigella sonnei 

Sinorhizobium meliloti 
Spirochaeta auraniia 
Staphylococcus aureus 

Staphylococcus aureus subsp. aureus 95 
40 Staphylococcus auricularis 

Staphylococcus capitis subsp. capitis 

Staphylococcus cohnii subsp. cohnii 

Staphylococcus epidermidis 

Staphylococcus haemotyticus 1 00 

45 Staphylococcus hominis 

Staphylococcus hominis subsp. hominis 

Staphylococcus lugdunensis 

Staphylococcus saprophytics 

Staphylococcus sciuri subsp. sciuri 1 05 

50 Staphylococcus simulans 

Staphylococcus warneri 

Stigmatella aurantiaca 

Stenotrophomonas maltophilia 

Streptococcus acidominimus 110 
55 Streptococcus agalactiae 

Streptococcus anginosus 

Streptococcus bovis 

Streptococcus cricetus 

Streptococcus cristatus 1 1 5 

60 Streptococcus downei 

Streptococcus dysgalactiae 
Streptococcus equi subsp. equi 
Streptococcus ferus 



Streptococcus gordonii 
Streptococcus macacae 

-Streptococcus mitis ~ 

Streptococcus mutans 

Streptococcus oralis 

Streptococcus parasanguinis 

Streptococcus pneumoniae 

Streptococcus pyogenes 

Streptococcus ratti 

Streptococcus salivarius 

Streptococcus salivarius subsp. thermophilus 

Streptococcus sanguinis 

Streptococcus sobrinus 

Streptococcus suis 

Streptococcus uberis 

Streptococcus vestibularis 

Streptomyces anbofaciens 

Streptomyces aureofaciens 

Streptomyces cinnamoneus 

Streptomyces coelicolor 

Streptomyces collinus 

Streptomyces lividans 

Streptomyces netropsis 

Streptomyces ramocissimus 

Streptomyces rimosus 

Streptomyces venezjuelae 

Succinivibrio dextrinosolvens 

Synechococcus sp. 

Synechocystis sp. 

Tatumella ptyseos 

Taxeobacter occealus 

Tetragenococcus halophilus 

Thermoplasma acidophilum 

Thermotoga maritima 

Thermus aquaticus 

Thermus thermophilus 

Thiobacillus ferrooxidans 

Thiomonas cuprina 

Trabulsiella guamcnsis 

Treponema pallidum 

Ureaplasma ureatyticum 

Veillonella parvula 

Vibrio alginotyticus 

Vibrio anguiUarum 

Vibrio cholerae 

Vibrio mimicus 

Wolinella succinogenes 

Xanthomonas citri 

Xanthomonas oryzae 

Xenorhabdus bovieni 

Xenorhabdus nematophilus 

Yersinia bercovieri 

Yersinia enterocolitica 

Yersinia frederiksensii 

Yersinia intermedia 

Yersinia pestis 
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Table 4. Example of microbial species for which tuf and/or atpD and/or recA nucleic acids and/ r 
sequences are used in the present invention (continued). 



S Bacterial species (continued) 

Yersinia pseudotuberculosis 
Yersinia rohdei 
Yokenella regensburgei 
10 Zoogloea ramigera — 
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Table 4. Example of microbial species for which tuj and/or atpD and/or recA nucleic acids and/or 
sequences are used in the present invention (continued). 



Fungal species 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Absidia corymbifera 




Fusarium moniliforme 


Absidia glauca 


65 


Fusarium oxysporum 


Alternaria alternata 


Fusarium solani 


Arxula adeninivorans 




jGeotrichum sp. 


Aspergillus flavus 




Histoplasma capsulatum 


Aspergillus fumigatus 




Hortaea werneckii 


Aspergillus nidulans 


70 


Issatchenkia orientalis Kudrjanzev 


Aspergillus niger 


Kluyveromyces lactis 


Aspergillus oryzae 




Malassezia furfur 


Aspergillus terreus 




Malassezia pachydermatis 


Aspergillus versicolor 




Malbranchea filamentosa 


Aureobasidium pullulans 


75 


Metschnikoma pulcherrima 


Basidiobolus ranarum 


Microsporum audouinii 


Bipolaris hawaiiensis 




Microsporum canis 


Bilophila wadsworthia 




Mucor circinelloides 


Blastoschizomyces capitatus 




Neurospora crassa 


Blastomyces dermatitidis 


80 


Paecilomyces lilacinus 


Candida albicans 


Paracoccidioides brasiliensis 


Candida catenulata 




Penicillium marneffei 


Candida dubliniensis 




Phialaphora verrucosa 


Candida famaia 




Pichia anomala 


Candida glabrata 




Piedraia hortai 


Candida guilliermondii 


85 


Podospora anserina 


Candida haemulonii 




Podospora curvicolla 


Candida inconspicua 




Puccinia graminis 


Candida kefyr 




Pseudallescheria boydii 


Candida krusei 




Reclinomonas americana 


Candida lambica 


90 


Rhizomucor racemosus 


Candida lusitaniae 




Rhizopus oryzae 


Candida norvegica 




Rhodotorula minuta 


Candida norvegensis 




Rhodotorula mucilaginosa 


Candida parapsilosis 


95 


Saccharomyces cerevisiae 


Candida rugosa 


Saksenaea vasiformis 


Candida sphaerica 




Schizosaccharomyces pombe 


Candida tropicalis 




Scopulariopsis koningii 


Candida utilis 




Sordaria macrospora 


Candida viswanathii 




Sporobolomyces salmonicolor 


Candida zeylanoides 


100 


Sporothrix schenckii 


Cladophialophora carrionii 




Stephanoascus ciferrii 


Coccidioides immitis 




Syncephalastrum racemosum 


Coprinus cinereus 




Trichoderma reesei 


Cryptococcus albidus 


105 


Trichophyton mentagrophytes 


Cryptococcus humicolus 


Trichophyton rubrum 


Cryptococcus laurenlii 




Trichophyton tonsurans 


Cryptococcus neoformans 




Trichosporon cutaneum 


Cunninghamella bertholletiae 




Ustilago maydis 


Curvularia lunata 




Wangiella dermatitidis 


Emericella nidulans 


110 


Yarrowia lipolytica 


Emmonsia parva 






Eremothecium gossypii 






Exophiala dermatitidis 






Exophiala jeanselmei 






Exophiala moniliae 






Exserohilum rostratum 






Eremothecium gossypii 






Fonsecaea pedrosoi 
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Table 4. Example of microbial species for which fu/and/or atpD and/or recA nucleic acids and/or 
sequences are used in the present invention (continued). 



Parasitical species 

5 

Babesia bigemina 
Babesia bovis 
Babesia microti 
Blastocystis hominis 

10 Crithidia fasciculaia 

Cryptosporidium parvum 

Entamoeba histolytica 

Giardia lamblia 

Kenirophoros sp. 
15 Leishmania aethiopica 

Leishmania amazonensis 

Leishmania braziliensis 

Leishmania donovani 

Leishmania infantum 
20 Leishmania enriettii 

Leishmania gerbilli 

Leishmania guyanensis 

Leishmania hertigi 

Leishmania major 
25 Leishmania mexicana 

Leishmania panamensis 

Leishmania tarentolae 

Leishmania tropica 

Neospora caninum 
30 Onchocerca volvulus 

Plasmodium berghei 

Plasmodium falciparum 

Plasmodium knowlesi 

Porphyra purpurea 
35 Toxoplasma gondii 

Treponema pallidum 

Trichomonas tenax 

Trichomonas vaginalis 

Trypanosoma brucei 
40 Trypanosoma brucei subsp. brucei 

Trypanosoma congolense 

Trypanosoma cruzi 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic 
Antimicrobial agent Bacteria 1 



purposes. 



Gene 



ACCESSION NO. SEQ ID NO. 



10 



aac(3)-Ib 1 

aac(3)'IIb 2 

aac(3)-IVa 2 
aac(3)-VIa 2 



Aminoglycosides 

Aminoglycosides 

Aminoglycosides 
Aminoglycosides 



15 



20 



35 



40 



45 



50 



55 



60 



65 



(wc(V)-la 2 

aac(6')-aph(2") 2 

aac(V)-la t 2 

aac(6')-lc 2 

aac(6')-lla 2 
aadB[ant(2")-Ia 2 } 
aacCJ [aac(3)-Ia 2 \ 
aacC2 [aac(3)-IIa 2 1 



aacC3 [aac(3)-III 2 J 
25 aacA4 [aac(6')-Ib 2 ] 
ant(3")-la 2 



ant(4')-Ia 2 
30 aph(3')-la 2 



aph(3')-lla 2 
aph(3')-IIIa 2 
aph(3'hVla 2 
rpsL 2 



hla OXA 5,6 



Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 
Aminoglycosides 
Aminoglycosides 
Aminoglycosides 
Aminoglycosides 
Aminoglycosides 
Aminoglycosides 

Aminoglycosides 
Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Streptomycin 



B- lac tarns 



Enterobacteriaceae 
Pseudomonads 
Enterobacteriaceae, 
Pseudomonads 
Enterobacteriaceae 
Enterobacteriaceae, 

-Pseudomonads 

Enterobacteriaceae, 

Pseudomonads 

Enterococcus sp.. 

Staphylococcus sp. 

Enterobacteriaceae, 

Pseudomonads 

Enterobacteriaceae, 

Pseudomonads 

Pseudomonads 

Enterobacteriaceae 

Pseudomonads 

Pseudomonads 

Pseudomonads 

Pseudomonads 

Enterobacteriaceae, 

Enterococcus sp.. 

Staphylococcus sp. 

Staphylococcus sp. 

Enterobacteriaceae, 

Pseudomonads 

Enterobacteriaceae, 

Pseudomonads 

Enterococcus sp., 

Staphylococcus sp. 

Enterobacteriaceae, 

Pseudomonads 

M. tuberculosis, 

M. avium complex 



Enterobacteriaceae, 
Pseudomonads 



L06157 

M97172 

X01385 
M88012 

X04555 



M18967 
M94066 



bla R OB ' 



fl-lactams 



Haemophilus sj 
Pasteurella sp. 



X02340 
M10241 

V01282 
J01839 

V00618 

V01547 

X07753 

X80120 
U 14749 
X70995 
L08011 

Y10693 
AJ238349 
AJ009819 
XG6046 
X03037 
X07260 
U 13880 
X75562 
AF034958 
J03427 
Z22590 
U59183 
L38523 
U63835 
AF043100 
AF060206 
U85514 
AF043381 
AF024602 
AF064820 



83-86 3 



112 4 

53-54 3 
55-56 3 
57-58 3 
59-60 3 
65-66 3 



WO' 



45-48 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic purposes (continued) 
Gene Antimicrobial agent " Bacteria* ACCESSION NO. 



ACCESSION NO. SEQ ID NO. 



B-lactams 



Enterobacteriacea, 
Pseudomonas aeruginosa 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



bla TEM 5,6 



B-lactams 



Enterobacteriaceae, 
Neisseria sp. f 
Haemophilus sp. 



AF124984 
AF148850 
M59181 
X98099 
M33655 
AF148851 
_X53433 
L47119 
AF074954 
X53817 
AF096930 
X55640 
Y11069 
U20270 
U92041 
S82452 
X98101 
X98105 
AF164577 
AJ011428 
AF1 16855 
AB023477 
AF293345 
AF227204 
AF208796 
AF132290 
AF012911 
U48775 
AF093512 
AF052748 
X64523 
Y13612 
X57972 
AF157413 
U31280 
U36911 
U48775 
VO0613 
X97254 
AJ012256 
X04515 
AF126482 
U09188 
M88143 
Y14574 
AF188200 
AJ251946 
Y17581 
Y17582 
Y 17583 
M88143 
U37195 
Y17584 
X64523 
U95363 
Y10279 
Y10280 
Y10281 
AF027199 
AF104441 
AF104442 
AF062386 
X57972 
AF047171 
AF18B199 
AF157553 
AF190694 
AF190695 
AF190693 
AF190692 



41-44- 



37-40 3 
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Table 5. Antimicr bial agents resistance genes selected for diagnostic purposes (continued). 



Gene 



Antimicrobial agent 



Bacteria* 



ACCESSION NO. SEQ ID NO. 



blaCARB : 



TO 



15 



20 



25 



30 



bla CTX-M-2 5 
bla CMY-2 7 



bla m p' 



35 



bla PER-l 

40 blapm-2 1 
blaZ 12 



B-lactams 



bla CTX-M-l 5 B-lactams 



B-lactams 
B-lactams 



B-lactams 



mecA 



12 



6-Iactams 

B-lactams 
B-lactams 

B-lactams 



Pseudomonas sp., 
Enterobacteriaceae 



Enterobacteriaceae 



Enterobacteriaceae 
Enterobacteriaceae 



Enterobacteriaceae, 
Pseudomonas aeruginosa 



Enterobacteriaceae, 
Pseudomodanaceae 
Enterobacteriaceae 

Enterococcus sp.. 
Staphylococcus sp. 

Staphylococcus sp. 



J05162 
S46063 
M69058 
__U14749_ 
D86225 
DL3210 
Z18955 
AF071555 
AF153200 
AF030945 

X92506 

X92507 

X91840 

AJ007826 
AJ011293 
AJ011291 
Y17716 
Y16783 
Y16781 
Y15130 
U77414 
S83226 
Y15412 
X78117 

AJ223604 

S71932 
D50438 
D29636 
X98393 
AB010417 
D78375 

Z21957 
X93314 



111 4 
97-98 3 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic purposes (continued). 



Gene 



Antimicrobial agent 



Bacteria 1 



ACCESSION NO. SEQ ID NO. 



pbpla 



13 



B-lactams 



Streptococcus pneumoniae 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



pbplb 



13 



fl-lactams 



Streptococcus pneumoniae 



pbp2b 



13 



fl-lactams 



Streptococcus pneumoniae 



M90527 
X67872 
AB006868 
AB006874 
X67873 
AB006878. 
AB006875 
AB006877 
AB006879 
AF046237 
AF046235 
AF026431 
AF046232 
AF046233 
AF046236 
X67871 
Z49095 
AF046234 
AB006873 
X67866 
X67868 
AB006870 
AB006869 
AB006872 
X67870 
AB006871 
X67867 
X67869 
AB006876 
AF046230 
AF046238 
Z49094 

X16022 
M25516 
M25518 
M25515 
U20O71 
U20084 
U20O82 
U20067 
U20079 
Z22185 
U20072 
U20083 
U20081 
M25522 
U20075 
U20070 
U20077 
U20O68 
Z22184 
U20069 
U20078 
M25521 
M25525 
M25519 
Z21981 
M25523 
M25526 
U20076 
U20074 
M25520 
M25517 
M25524 
Z22230 
U20073 
U20080 



1004-1018, 

1648,2056-2064, 

2273-2276 



1019-1033 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic purposes (continued). 



Gene 



Antimicrobial agent 



Bacteria 



ACCESSION NO. SEQ ID NO. 



pbp2x 



13 



B-lactams 



Streptococcus pneumoniae 



10 



15 



20 



25 

int 

30 sul 



35 



45 



50 



55 



ermA * 4 
ermB 14 



40 ermC 14 



ereA 



12 



ereB 12 
msrA 12 
mefA, mefE 8 
mphA 8 

UnA/linA' 9 



60 UnB™ 

vga 15 
65 vg* 15 



-lactams, 
trimethoprim 
aminoglycosides, 
antiseptic, 
chloramphenicol 
Macrolides, 
lincosamides, 
streptogramin B 
Macrolides, 

lincosamides, 
streptogramin B 
Macrolides, 
lincosamides, 
streptogramin B 

Macrolides 



Macrolides 
Macrolides 
Macrolides 
Macrolides 

Lincosamides 

Lincosamides 

Streptrogramin 
Streptrogramin 



Enterobacteriaceae, 
Pseudomonads 

Staphylococcus sp. 



Enterobacteriaceae, 
Staphylococcus sp. 
Enterococcus sp. 
Streptococcus sp. 
Enterobacteriaceae, 
Staphylococcus sp. 



Enterobacteriaceae, 
Staphylococcus sp. 

Enterobacteriaceae 
Staphylococcus sp. 
Staphylococcus sp. 

Streptococcus sp. 

Enterobacteriaceae, 
Staphylococcus sp. 

Staphylococcus sp. 



Enterococcus faecium 

Staphylococcus sp. 
Staphylococcus sp. 



X16367 
X65135 
AB011204 
AB011209 
AB011199_ 
AB011200 
AB011201 
AB011202 
AB011198 
AB011208 
AB011205 
AB015852 
AB011210 
AB015849 
AB015850 
AB015851 
AB015847 
AB015846 
AB011207 
AB015848 
Z49096 



1034-1048 



Ml 1277 
E01199 
AF099140 
A15097 
X03988 



U70055 
U83667 
D16251 
U34344 
U36578 
J03947 
M14039 
A15070 
E01245 
AF110130 
AJ238249 

M90056 
U82085 
M36022 
M20219 
AF015628 



99-102 3 
103-106 3 

113 4 

114 4 

115 4 



77-80 • 



89-90 
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Table 5. Antimicr bfal agents resistance genes selected for diagnostic purposes (continued). 

n 



Gene 



Antimicrobial agent 



Bacteria 1 



ACCESSION NO. SEQ ID NO. 



vat 1$ 
,15 



satA 15 
10 mupA 12 



15 



20 



25 



30 



45 



50 



55 



gyrA 



16 



parC/grlA 16 



parE/grlB 16 



norA 



16 



mexR fpnm 16 
35 njxBK 
cat 11 



40 



ppflo-\\ke 
embB 11 



pncA 17 
60 rpoB 17 

65 inhA 17 



Streptrogramin 
Streptrogramin 

Streptrogramin 
Mupirocin 



Quinolones 



Staphylococcus sp. 
Staphylococcus sp. 

Enterococcus faecium 
Staphylococcus aureus 



Quinolones 



Quinolones 
Quinolones 



Quinolones 
Quinolones 
Chloramphenicol 



Chloramphenicol 

Ethambutol 

Pyrazinamide 

Rifampin 



Isoniazid 



Gram-positive and 
gram-negative bacteria 



Gram-positive and 
gram-negative bacteria 



Gram-positive bacteria 



Staphylococcus sp. 



Pseudomonas aeruginosa 
Pseudomonas aeruginosa 

Gram-positive and 
gram-negative bacteria 



Mycobacterium tuberculosis 
Mycobacterium tuberculosis 

Mycobacterium tuberculosis 



Mycobacterium tuberculosis 



L07778 87-88 3 

U19459 

L38809 

L12033 81-82 3 
X75439 

X59478 

X59477 

X95718 1255, 1607-1608, 
X06744 1764-1776, 
X57174 2013-2014, 
X16817 2277-2280 
X71437 

AF065152 

AF060881 
D32252 

AB005036 1777-1785 
AF056287 
X95717 
AF129764 
AB017811 
AF065152 

X95717 
AF065153 
AF058920 

D90119 

M80252 

M97169 

U23763 

X65646 

M55620 

X15100 

A24651 

M28717 

A00568 

A00569 

X74948 

Y00723 

A24362 

A00569 

M93113 

M62822 

M58516 

V01277 

X02166 

M77169 

X53796 

J01841 

X07848 

AF071555 
U68480 
U59967 

AF055891 
AF055892 

S71246 

L27989 
AF055893 
AF106077 

U02492 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic purposes (continued). 
Antimicrobial agent _ Bacteria 1 



Gene 



ACCESSION NO. SEQ ID NO. 



vanA * 2 

vanB 12 
vanCJ 12 

10 

VQn C2-YL- 



15 



20 



25 



30 



35 



40 



45 



50 



55 



vanC3 



12 



vanD 18 
vanE* 2 
tetB » 



letM 



19 



sulll 20 
dhfrla 20 



dhfrlb 



20 



dhfrV 20 
dhfrVlW 
dhfiVIl 20 



dhfrVIII 20 

dhfrIX 20 
dhfrXII 20 



dhfrXlll™ 
dhfrXV 2 ® 
60 dhfrXVII 20 



Vancomycin 

Vancomycin 
Vancomycin 

-Vancomycin" 



Enterococcus sp. 

Enierococcus sp. 
Enterococcus gallinarum 

Entefb^cuTcasseliflavus 







U94521 






U94522 






U94523 






U94524 






U94525 






L29638 


Vancomycin 


Enterococcus flavescens 








L29639 


< 




U72706 


Vancomycin 


Enterococcus faecium 


AF130997 


Vancomycin 


Enterococcus faecium 


AF136925 


Tetracycline 


Gram-negative bacteria 


J01830 






AF162223 






AP000342 






S83213 






U81141 






V00611 


Tetracycline 


Gram-negative and 


X52632 




Gram-positive bacteria 


AF1 16348 




U50983 






X92947 






M211136 






U08812 






X04388 


Sulfonamides 


Gram-negative bacteria 


M36657 






AF017389 






AF017391 


Trimethoprim 


Gram-negative bacteria 


AJ238350 






X17477 






K00052 






U09476 






X00926 


Trimethoprim 


Gram-negative bacteria 


Z50805 






Z50804 


Trimethoprim 


Gram-negative bacteria 


X12868 


Trimethoprim 


Gram-negative bacteria 


Z86002 


Trimethoprim 


Gram-negative bacteria 


U31119 






AF139109 






X58425 


Trimethoprim 


Gram-negative bacteria 


U10186 






U09273 


Trimethoprim 


Gram-negative bacteria 


X57730 


Trimethoprim 


Gram-negative bacteria 


Z21672 






AF175203 






AF1 80731 






M84522 


Trimethoprim 


Gram-negative bacteria 


Z50802 


Trimethoprim 


Gram-negative bacteria 


Z83331 


Trimethoprim 


Gram-negative bacteria 


AF170088 






AF180469 






AF169041 



67-70 3 
1049-1057 
116 4 
117 4 

J058-_1059_ 
1060-1063 



1064-1066 
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Table 5. Antimicrobial agents resistance genes selected for diagnostic purposes (continued). 



Gene Antimicrobial agent Bacteria 1 ACCESSION NO. SEQ ID NO. 



Trimethoprim Staphylococcus sp. AF045472 

U40259 
AF051916 
X13290 

Y07536 

Z16422 
Z48233 



15 

1 Bacteria having high incidence for the specified antibiotic resistance gene. The presence of the 
antibiotic resistance genes in other bacteria is not excluded. 

2 Shaw, K. J., P. N. Rather, R. S. Hare, and G. H. Miller. 1993. Molecular genetics of aminoglycoside 
resistance genes and familial relationships of the aminoglycoside-modifying enzymes. Microbiol. 

20 Rev. 57:138-163. 

3 Antibiotic resistance genes from our assigned US patent no. 6,001 ,564 for which we have selected 
PCR primer pairs. 

4 These SEQ ID NOs. refer to a previous patent (publication WO98/20157). 

5 Bush, K M G.A. Jacoby and A. Medeiros. 1995. A functional classification scheme for fc-lactamase 
25 and its correlation with molecular structure. Antimicrob. Agents. Chemother. 39:1211-1233. 

6 Nucleotide mutations in b!asHV> bla TEM. and blaoxA> are associated with extended-spectrum G- 
lactamase or inhibitor-resistant (^-lactamase. 

7 Bauerfeind, A., Y. Chong, and K. Lee. 1998. Plasmid-encoded AmpC beta-lactarnases: how far 
have we gone 10 ears after discovery? Yonsei Med. J. 39:520-525. 

30 8 Sutcliffe, J., T. Grebe, A. Tait-Kamradt, and L Wondrack. 1996. Detection of erythromycin-resistant 
determinants by PCR. Antimicrob. Agent Chemother. 40:2562-2566. 
9 Leclerc, R., A., Brisson-NoSI, J. Duval, and P. Courvalin. 1991. Phenotypic expression and genetic 
heterogeneity of lincosamide inactivation in Staphylococcus sp. Antimicrob. Agents. Chemother. 
31:1887-1891. 

35 10 Bozdogan, B. p L Berrezouga, M.-S. Kuo, D. A. Yurek, K. A. Farley, B. J. Stockman, and R. 

Leclercq. 1999. A new gene, linB, conferring resistance to lincosamides by nucleotidylation in 
Enterococcus faecium HM1025. Antimicrob. Agents. Chemother. 43:925-929. 
11 Cockerill III, F.R. 1999. Genetic methods for assessing antimicrobial resistance. Antimicrob. Agents. 
Chemother. 43:199-212- 

40 12 Tenover, F. C, T. Popovic, and O Olsvik. 1996. Genetic methods for detecting antibacterial 
resistance genes, pp. 1368-1378. In Murray, P. R., E. J. Baron, M. A. Pfaller, F. C. Tenover, R. H. 
Yolken (eds). Manual of clinical microbiology. 6th ed., ASM Press, Washington, D.C. USA 

13 Dowson, C. G., T. J. Tracey, and B. G. Spratt. 1994, Origin and molecular epidemiology of 
penicillin-binding-protein-mediated resistance to (1-lactam antibiotics. Trends Molec. MicrobioL2: 

45 361-366. 

14 Jensen, L B., N. Frimodt-Moller, F. M. Aarestrup. 1999. Presence of erm gene classes in Gram- 
positive bacteria of animal and human origin in Denmark. FEMS Microbiol. 170:151-158. 

1 5 Thai, L. A., and M. J. Zervos. 1999. Occurrence and epidemiology of resistance to virginimycin and 
streptrogramins. J. Antimicrob. Chemother. 43:171-176- 

50 16 Martinez J. L, A. Alonso, J. M. Gomez-Gomez, and F. Baquero. 1998. Quinolone resistance by 
mutations in chromosomal gyrase genes. Just the tip of the iceberg? J. Antimicrob. Chemother. 
42:683-688 

17 Cockerill III, F.R. 1999. Genetic methods for assessing antimicrobial resistance. Antimicrob. Agents. 
Chemother. 43:199-212. 

55 18 Casadewall, B. and P. Courvalin. 1999 Characterization of the vanD glycopeptide resistance gene 
cluster from Enterococcus faecium BM 4339. J. Bacterid. 181:3644-3648. 

19 Roberts, M.C. 1999. Genetic mobility and distribution of tetracycline resistance determinants.Ciba 
Found. Symp. 207:206-222. 

20 Huovinen, P., L. Sundstrdm, G. Swedberg, and O. Skdld. 1995. Trimethoprim and sulfonamide 
60 resistance.Antimicrob. Agent Chemother. 39:279-289. 
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20 
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Table 6. List of bacterial t xins selected f r diagnostic purposes. 



Organism 



Toxin 



Accession number 



5 Actinobacillus actinomycetemcomitans Cytolethal distending toxin (ctf/A, cdtB, cdtC) AF006830 



10 



15 



20 



25 



30 



Actinomyces pyogenes 
Aeromonas hydrophila 



Bacillus anthracis 
Bacillus cereus 



Bacillus mycoides 
Bacillus pseudomycoides 
Bacteroides fragilis 



Bordetella bronchiseptica 



35 Bordetella pertussis 



40 



45 



Campylobacter jejuni 
5 0 Citrobacter freundii 
Clostridium botulinum 



55 



60 



65 



Leukotoxin (ItxA) 
Hemolysin (pyolysin) 
Aerolysin (aerA) 

Haemolysin (hlyA) 

Cytotonic enterotoxin (alt) 
Anthrax toxin (cya) 
Enterotoxin (bceT) 



Enterotoxic hemolysin BL 



M27399 
U84782 
M 16495 

U81555 



Non-haemolytic enterotoxins A,B and C (nhe) 
Hemolytic enterotoxin HBL 
Hemolytic enterotoxin HBL 
Enterotoxin (bfi?) 



L77573 
M23179 
D 173 12 

AF192766, AF192767 

AJ237785 

Y19005 

AJ243150 to AJ243153 
AJ243154 to AJ243156 
U67735 



Matrix metalloprotease/enterotoxin (fragilysin) S75941, AF038459 
Metalloprotease toxin-2 



Metalloprotease toxin-3 

Adenylate cyclase hemolysin (cyak) 

Dermonecrotic toxin (dm) 

Pertussis toxin (SI subunit, tox) 



Adenyl cyclase (cya) 

Dermonecrotic toxin (dnt) 

Cytolethal distending toxin (cdtk, cdtB, cdtC) 

Shiga-like toxin (jfr-IIcA) 

Botulism toxin (BoNT) (A.B.E and F serotypes 

are neurotoxic for humans; the other serotypes 

have not been considered) 



U90931 
AF081785 

AF056297 
Z37112, U22953 

U59687 

AB020025 

AJ006151 

AJ006153 

AJ006155 

AJ006157 

AJ006159 

AJ007363 

M14378, M16494 

AJ007364 

M13223 

X16347 

18323 

U 10527 
U51121 

X67514, S53206 

X52066, X52088 

X73423 

M30196 

X70814 

X70819 

X71343 

Z11934 

X70817 

M81186 

X70818 

X70815 

X62089 

X62683 

S76749 

X81714 

X70816 
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Table 6. List of bacterial toxins selected for diagnostic purposes (continued). 



Organism 



Toxin 



Accession number 



5 Clostridium botulinum (continued) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Clostridium difficile 



Clostridium perfringens 



A toxin (enterotoxin) (tcdA) (cdtA) 



Clostridium sordellii 
Clostridium tetani 

Corynebacterium diphtheriae 

Corynebacterium pseudotuberculosis 

55 Eikenella corrodens 
Enterobacter cloacae 
Enterococcus faecalis 
Escherichia coli (EHEQ 

60 



B toxin (cytotoxin) (toxB) (cdtB) 
Alpha (phospholipase Q (cpa) 



Beta (dermonecrotic protein) (cpb) 



Enterotoxin (cpe) 



Enterotoxin pseudogene (not expressed) 

Epsilon toxin (etxD) 

Iota (la and lb) 

Lambda (metalloprotease) 

Theta (perfringolysin O) 
Cytotoxin L 
Tetanos toxin 

Diphtheriae toxin 

Phospholipase C 

lysine decarboxylase (cadA) 
Shiga-like toxin II 
Cytolysin B (cylB) 
Hemolysin toxin (hlyA and ehxA) 



X70820 

X70281 

L35496 

M92906 

AB012304 

AF053400 

Y12616_ _ 

X51797 

X17194 

M30307 

Z23277 

X53138 

L43545 

L43546 

L43547 

L43548 

X13608 

X17300 

D 10248 

L13198 
X83275 
L77965 

AJ000766 

M98037 

X81849 

X71844 

Y16009 

AF037328 
AF037329 
AF037330 

M80837 
M95206 
X60694 

X73562 

D45904 

M36704 
X82638 
X06214 
X04436 
X00703 

A21336 

U89166 

Z50754, U33502 

M38052 

AF043471 

X94129 

X79839 

X86087 

AB011549 

AF074613 
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Table 6. List of bacterial toxins selected for diagnostic purposes (continued). 



_10_ 



15 



20 



30 



40 



45 



50 



55 



60 



65 



Organism 


Toxin 


Accession number 


Escherichia coli (EHEC) 


Shiga-like (Vero cytotoxin) (stx) 


X81418, M36727 






M14107, E03962 






M10133, E03959 






M12863, X07865 






X81417, Y10775 






X81416, Z50754 




_ _ 


X81415,-X67515 






Z36900, AF043627 






LI 1078, M19473 






L04539, M17358 






LI 1079, M19437 






X65949, M24352 






M21534, X07903 






M29153, Z36899 






Z37725 


• 




Z36901 






X61283 






ABO 17524 






U72191 






X61283 


Escherichia coli (ETEC) 


Enterotoxin (heat-labile) (eltB) 


M 17874 






M 17873 






J01605 






AB011677 




Enterotoxin (heat-stable) (astA) (estAl) 


L11241 






M58746 






M29255 






V00612 






J0183! 


Escherichia coli (other) 


Cytolethal-distending toxin 


U03293 




(cdt) (3 senes) 


U04208 




U89305 




Cytotoxic necrotizing factor 1 (cnf\) 


U42629 




Microcin 24 (mtfS) 


U47048 




Autotransporter enterotoxin (Pet) (cytotoxin) 


AF056581 


Haemophilus ducreyi 


Cytolethal distending toxin (cdt A, cdtB, cdtC) 


U53215 


Helicobacter pylori 


Vacuolating toxin (vac A) 


U07145 






U80067 






U80068 






AF077938 






AF077939 






AF077940 






AF077941 


Legionella pneumophila 


Structural toxin protein (rCcA) 


AF057703 


Listeria monocxtoeenes 


Listeriolvsin O (lisA. hl\A) 


X15127 
M24199 






X60035 






U25452 






U25443 






U25446 






U25449 


Pasteurella multocida 


Mitogenic toxin (dermonecrotic toxin) 


X57775, Z28388 




X51512 






X52478 


Proteus mirabilis 


Hemolysin (hpmA) 


M30186 


Pseudomonas aeruginosa 


Cytotoxin (Enterotoxin A) 


X14956 


Salmonella typhimurium 


Calmodulin-sensitive adenylate cyaJase toxin (cya) 


AF060869 




Cytolysin (salmolysin) (slyA) 


U03842 




Enterotoxin (stn) 


L16014 
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Organism Toxin Accessi n number 



5 


Serratia marcescens 
Shigella dysenteriae type 1 


Hemolysin (shlA) 

Shiga toxin (stxA and stxB) 


M22618 

X07903, M32511 
M19437 

M24352, M21947 


JP_ 


Shigella flexneri 


ShET2 enterotoxin (senA) 
Enterotoxin 1 (set\ A and setlB) 


Z54211 

-Z4738L 

U35656 


15 


Shigella sonnei 
Sphingomonas paucimobilis 
Staphylococcus aureus 


Hemolysin E (/ifyE, cfyA, sheA) 
Shiga toxin (stxA and stxB) 
Beta-hemolysin (hlyA) 
Gamma-hemolysin (hlgl) 


AF200955 
AJ 132761 
L01270 
D42143 
L01055 


20 




Enterotoxin 
Enterotoxin A (sea) 


U93688 

U2565, L22566 
Ml 8970 


25 




Enterotoxin B 
Enterotoxin CI (enlCl) 


Ml 1118 
X05815 


30 




Enterotoxin C2 (entCZ) 
Enterotoxin C3 (entCl) 
Enterotoxin D (sed) 


P34071 
X51661 
M94872 


35 




Enterotoxin E 
Enterotoxin G (seg) 

Enterotoxin H (seh) 


M21319 
AF064773 

U11702 


40 




Enterotoxin I (sei) 
Enterotoxin J 


AF064774 
AFO53140 


45 




Exfoliative toxin A (ETA, Epidermolytic toxin A) 
Exfoliative toxin B (ETB) 


M17347 
M17357 

L25372, M20371 
M17348, M13775 


50 




Leukocidin R (F and S component, lukF and lukS\ 
Hemolysin B and Q 


X64389, S53213 

X72700 

L01O55 


55 
60 


Staphylococcus epidermidis 
Staphylococcus intermedius 


Toxic shock syndrome toxin 1 (TSST-1, 
alnha toxin alnha hemolvsin) 

Delta toxin (hid) 
Enterotoxin 1 


X01645 

M90536 

J02615 

U93688 

AF068634 

U91526 




Leukocidin R (F and S component, lukF and MS; 
synergohymenotropic toxin) 


X79188 


65 


Streptococcus pneumoniae 


Pneumolysin 


X52474 
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Organism 



Toxin 



Accession number 



Streptococcus pyogenes 



10 Vibrio cholerae 



15 



20 



25 Vibrio parahaemofyticus 
Vibrio vulnificus 
Yersinia enterocolitica 



30 



Yersinia kristensenii 
Yersinia pestis 



Streptococcus pyrogenic exotoxin A (speA) 



Pyrogenic exotoxin B (speh) 

M86905,M35110 

Cholerae toxin (ctxA and ctxB subunits) 



Accessory cholera enterotoxin (ace) 
Heat-stable enterotoxin (sto) 

Zonula occludens toxin (zot) 
Thermostable direct hemolysin (tdh) 
Cytolysin (w/zA) 
Heat-stable enterotoxin (yst) 

Heat-stable enterotoxin type B (ystB) 

Heat-stable enterotoxin type C (ystC) 

Enterotoxin X69218 

Toxin 



X61553 to X61573 
X03929 

U4G453, M19350 
-U63134 

X00171 
X76390 
X58786 

X58785, S55782 

D30052 

D30053 

K02679 

AF175708 

Z22569, AF175708 

X74108, M85198 
M97591, L03220 

M83563, AF175708 

S67841 

M34670 

U09235, X65999 

D88145 

D63578 



X92727 



35 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing. 



SEQ ID NO. Archaeal, bacterial, fiingal or parasitical species SourceGene* 



-10- - 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



1 


Acinetobacter bawnannii 


This patent 


tuf 


2 


Actinomyces meyeri 


This patent 


tuf 


3 


Aerococcus viridans 


This patent 


tuf 


4 


Achromobacter xylosoxidans subsp. demtnficans 


This patent 


tuf 


5 


Anaerorhahdus furcosus 


_ This patent 


M 


6 


- Bacillusanthracis 


This patent 


tuf 


7 


Bacillus cereus 


This patent 


tuf 


8 


Bacteroides distasonis 


This patent 


tuf 


9 


Enterococcus casseliflavus 


This patent 


tuf 


10 


Staphylococcus saprophytics 


This patent 


tuf 


11 


Bacteroides ovatus 


This patent 


tuf 


12 


Bartonella henselae 


This patent 


tuf 


13 


Bifidobacterium adolescentis 


This patent 


tuf 


14 


Bifidobacterium dentium 


This patent 


tuf 


15 


Brucella abortus 


This patent 


tuf 


16 


Burkholderia cepacia 


This patent 


tuf 


17 


Cedecea davisae 


This patent 


tuf 


18 


Cedecea neteri 


This patent 


tuf 


19 


Cedecea lapagei 


This patent 


tuf 


20 


Chlamydia pneumoniae 


This patent 


tuf 


21 


Chlamydia psittaci 


This patent 


tuf 


22 


Chlamydia trachomatis 


This patent 


tuf 


23 


Chryseobacterium meningosepticum 


This patent 


tuf 


24 


Citrobacter amalonaticus 


This patent 


tuf 


25 


Citrobacter braakii 


This patent 


tuf 


26 


Citrobacter koseri 


This patent 


tuf 


27 


Citrobacter farmeri 


This patent 


tuf 


28 


Citrobacter freundii 


This patent 


tuf 


29 


Citrobacter sedlakii 


This patent 


tuf 


30 


Citrobacter werkmanii 


This patent 


tuf 


31 


Citrobacter youngae 


This patent 


tuf 


32 


Clostridium perfringens 


This patent 


tuf 


33 


Comamonas acidovorans 


This patent 


tuf 


34 


Corynebacterium bovis 


This patent 


tuf 


35 


Corynebacterium cervicis 


This patent 


tuf 


36 


Corynebacterium flavescens 


This patent 


tuf 


37 


Corynebacterium kutscheri 


This patent 


tuf 


38 


Corynebacterium minutissimum 


This patent 


tuf 


39 


Corynebacterium mycetoides 


Tliis patent 


tuf 


40 


Corynebacterium pseudogenitalium 


This patent 


tuf 


41 


Corynebacterium renale 


This patent 


tuf 


42 


Corynebacterium ulcerans 


This patent 


tuf 


43 


Corynebacterium urealyticum 


This patent 


tuf 


44 


Corynebacterium xerosis 


This patent 


tuf 


45 


Coxiella burnetii 


This patent 


tuf 


46 


Edwardsiella hoshinae 


This patent 


tuf 


47 


Edwardsiella tarda 


This patent 


tuf 


48 


Eikenella corrodens 


This patent 


tuf 


49 


Enterobacter aerogenes 


l ius patent 


tuf 


50 


Enterobacter agglomerans 


This patent 


tuf 


51 


Enterobacter amnigenus 


This patent 


tuf 


52 


Enterobacter asburiae 


This patent 


tuf 


53 


Enterobacter cancerogenus 


This patent 


tuf 


54 


Enterobacter cloacae 


This patent 


tuf 


55 


Enterobacter gergoviae 


This patent 


tuf 


56 


Enterobacter hormaechei 


This patent 


tuf 


57 


Enterobacter sakazakii 


This patent 


tuf 


58 


Enterococcus casseliflavus 


This patent 


tuf 


59 


Enterococcus cecorum 


This patent 


tuf 


60 


Enterococcus dispar 


This patent 


tuf 


61 


Enterococcus durans 


This patent 


tuf 
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Table 7. 



Origin f the nucleic acids and/or sequences in the sequence listing (continued). 



__10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



SEQ ID NO, 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


62 


Enterococcus faecalis 


This patent 


tuf 


63 


Enterococcus faecalis 


This patent 


tuf 


64 


Enterococcus faecium 


This patent 


tuf 


65 


Enterococcus flavescens 


This patent 


tuf 


66 


Enterococcus gallinarum 


This patent 


tuf 


67 


- Enterococcus hirae 


Thispatent 


~ ' tuf 


68 


Enterococcus mundtii 


This patent 


tuf 


69 


Enterococcus pseudoavium 


This patent 


tuf 


70 


Enterococcus raffinosus 


This patent 


tuf 


71 


Enterococcus saccharofyticus 


This patent 


tuf 


72 


Enterococcus solitarius 


This patent 


tuf 


73 


Enterococcus casseliflavus 


This patent 


tuf(C) 


74 


Staphylococcus saprophyticus 


This patent 


unknown 


75 


Enterococcus flavescens 


This patent 


tuf(C) 


76 


Enterococcus gallinarum 


This patent 


tuf(C) 


77 


Ehrlichia cants 


This patent 


tuf 


78 


Escherichia coli 


This patent 


tuf 


79 


Escherichia fergusonii 


This patent 


tuf 


80 


Escherichia hermannii 


This patent 


tuf 


81 


Escherichia vulneris 


This patent 


tuf 


82 


Eubacterium lentum 


This patent 


tuf 


83 


Eubacterium nodatum 


This patent 


tuf 


84 


Ewingella americana 


This patent 


tuf 


85 


Francisella tularensis 


This patent 


tuf 


86 


Fusobacterium nucleatum subsp. potymorphum 


This patent 


tuf 


87 


Gemella haemotysans 


This patent 


tuf 


88 


Gemella morbillorum 


This patent 


tuf 


89 


Haemophilus actinomycetemcomitans 


This patent 


tuf 


90 


Haemophilus aphrophilus 


This patent 


tuf 


91 


Haemophilus ducreyi 


This patent 


tuf 


92 


Haemophilus haemolyticus 


This patent 


tuf 


93 


Haemophilus parahaemolyticus 


This patent 


tuf 


94 


Haemophilus parainfluenzae 


This patent 


tuf 


95 


Haemophilus paraphrophilus 


This patent 


tuf 


96 


Haemophilus segnis 


This patent 


tuf 


97 


Hafnia alvei 


This patent 


tuf 


98 


Kingella kingae 


This patent 


tuf 


99 


Klebsiella ornithinofytica 


This patent 


tuf 


100 


Klebsiella oxytoca 


This patent 


tuf 


101 


Klebsiella planticola 


This patent 


tuf 


102 


Klebsiella pneumoniae subsp. ozaenae 


This patent 


tuf 


103 


Klebsiella pneumoniae pneumoniae 


This patent 


tuf 


104 


Klebsiella pneumoniae subsp. rhinoscleromatis 


This patent 


tuf 


105 


Kluyvera ascorbata 


This patent 


tuf 


106 


Kluyvera cryocrescens 


This patent 


tuf 


107 


Kluyvera georgiana 


This patent 


tuf 


108 


Lactobacillus casei subsp. casei 


This patent 


tuf 


109 


Lactococcus lactis subsp. lactis 


This patent 


tuf 


110 


Leclercia adecarboxylata 


This patent 


tuf 


111 


Legionella micdadei 


This patent 


tuf 


112 


Legionella pneumophila subsp. pneumophila 


This patent 


tuf 


113 


LeminorelLa grimontii 


This patent 


tuf 


114 


Leminorella richardii 


This patent 


tuf 


115 


Leptospira interrogans 


This patent 


tuf 


116 


Megamonas hypermegale 


This patent 


tuf 


117 


Mitsuokella mukacidus 


This patent 


tuf 


118 


Mobiluncus curtisii subsp. holmesii 


This patent 


tuf 


119 


Moellerella wisconsensis 


This patent 


tuf 


120 


Moraxella catarrhalis 


This patent 


tuf 


121 


Morganella morganii subsp. morganii 


This patent 


tuf 


122 


Mycobacterium tuberculosis 


This patent 


tuf 


-> 
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Table 7. 



Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



SEQ ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source Gene* 




123 


Neisseria cinerea 


This patent 


tuf 


124 


Neisseria elongata subsp. elongata 


This patent 


tuf 


125 


Neisseria flavescens 


This patent 


tuf 


126 


Neisseria gonorrhoeae 


This patent 


tuf 


127 


Neisseria lactamica 


This patent 


tuf 


128 


Neisseria meningitidis 


This patent 


tuf 


129 


Neisseria mucosa 


—This patent 


~tuf- 


130 


Neisseria sicca 


This patent 


tuf 


131 


Neisseria subflava 


This patent 


tuf 


132 


Neisseria weaveri 


This patent 


tuf 


133 


Ochrobactrum anthropi 


This patent 


tuf 


134 


Pantoea agglomerans 


This patent 


tuf 
tuf 


135 


Pantoea dispersa 


This patent 


136 


Pasteurella multocida 


This patent 


tuf 


137 


Peptostreptococcus anaerobius 


This patent 


tuf 


138 


Peptostreptococcus asaccharotyticus 


This patent 


tuf 


139 


Peptostreptococcus prevotii 


This patent 


tuf 


140 


Porphyromonas asaccharofytica 


This patent 


tuf 


141 


Porphyromonas gingivaUs 


This patent 


tuf 

J 


142 


Pragia fontium 


This patent 


tuf 


143 


Prevotella melaninogenica 


This patent 


tuf 

J 


144 


Prevotella oralis 


This patent 


tuf 


145 


Propionibacterium acnes 


This patent 


tuf 


146 


Proteus mirabilis 


This patent 


tuf 

J 


147 


Proteus penneri 


This patent 


tuf 


148 


Proteus vulgaris 


This patent 


tuf 


149 


Providencia alcalifaciens 


This patent 


tuf 


150 


Providencia rettgeri 


This patent 


tuf 


151 


Providencia rustigianii 


This patent 


Kf 


152 


Providencia stuartii 


This patent 


tuf 


153 


Pseudomonas aeruginosa 


This patent 


tuf 


154 


Pseudomonas fluorescens 


This patent 


tuf 


155 


Pseudomonas stutzeri 


This patent 


tuf 


156 


Psychrobacter phenylpyruvicum 


This patent 


tuf 


157 


Rahnella aquatilis 


This patent 


tuf 


158 


Salmonella choleraesuis subsp. arizonae 


This patent 


tuf 


159 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Choleraesuis 


160 


Salmonella choleraesuis subsp. diarizonae 


This patent 


tuf 


161 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Heidelberg 






162 


Salmonella choleraesuis subsp. houtenae 


This patent 


tuf 


163 


Salmonella choleraesuis subsp. indica 


This patent 


tuf 


164 


Salmonella choleraesuis subsp. salamae 


This patent 


tuf 


165 


Salmonella choleraesuis subsp. choleraesuis serotype Typhi This patent tuf 


166 


Serratia fonticola 


This patent 


tuf 


167 


Serratia liquefaciens 


This patent 


tuf 


168 


Serratia marcescens 


This patent 


tuf 


169 


Serratia odorifera 


This patent 


tuf 


170 


Serratia plymuthica 


This patent 


tuf 


171 


Serratia rubidaea 


This patent 


tuf 


172 


Shigella boydii 


This patent 


tuf 


173 


Shigella dysenteriae 


This patent 


tuf 


174 


Shigella flexneri 


This patent 


tuf 


175 


Shigella sonnei 


This patent 


tuf 


176 


Staphylococcus aureus 


This patent 


tuf 


177 


Staphylococcus aureus 


This patent 


tuf 


178 


Staphylococcus aureus 


This patent 


tuf 


179 


Staphylococcus aureus 


This patent 


tuf 


180 


Staphylococcus aureus subsp. aureus 


This patent 


tuf 


181 


Staphylococcus auricularis 


This patent 


tuf 


182 


Staphylococcus capitis subsp. capitis 


This patent 


tuf 
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Table 7. 



Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



SEQ ID NO. 



Archaeal, bacterial, fungal or parasitical species 



Source 



Gene* 



5 


183 




184 




185 




186 


1.0 _ 


187 


188 




189 




190 




191 




192 


15 


193 




194 




195 




196 




197 


20 


198 




199 




200 




201 




202 


25 


203 




204 




205 




206 




207 


30 


208 




209 




210 




211 




212 


35 


213 




214 




215 




216 




217 


40 


218 




219 




220 




221 




222 


45 


223 




224 




225 




226 




227 


50 


228 




229 




230 




231 




232 


55 


233 




234 




235 




236 




237 


60 


238 




239 




240 




241 




242 


65 


243 




244 



Macrococcus caseolyticus 
Staphylococcus cohnii subsp. cohnii 
Staphylococcus epidermidis 
Staphylococcus haemolyticus 
Staphylococcus warned 

~ Staphylococcus haemolyticus 

Staphylococcus haemolyticus 
Staphylococcus haemolyticus 
Staphylococcus hominis subsp. hominis 
Staphylococcus warneri 
Staphylococcus hominis 
Staphylococcus hominis 
Staphylococcus hominis 
Staphylococcus hominis 
Staphylococcus lugdunensis 
Staphylococcus saprophyticus 
Staphylococcus saprophyticus 
Staphylococcus saprophyticus 
Staphylococcus sciuri subsp. sciuri 
Staphylococcus warneri 
Staphylococcus warneri 
Bifidobacterium longum 
Stenotrophomonas maltophilia 
Streptococcus acidominimus 
Streptococcus agalactiae 
Streptococcus agalactiae 
Streptococcus agalactiae 
Streptococcus agalactiae 
Streptococcus anginosus 
Streptococcus bovis 
Streptococcus anginosus 
Streptococcus cricetus 
Streptococcus cristatus 
Streptococcus downei 
Streptococcus dysgalactiae 
Streptococcus equi subsp. equi 
Streptococcus ferus 
Streptococcus gordonii 
Streptococcus anginosus 
Streptococcus macacae 
Streptococcus gordonii 
Streptococcus mutans 
Streptococcus parasanguinis 
Streptococcus ratti 
Streptococcus sanguinis 
Streptococcus sobrinus 
Streptococcus suis 
Streptococcus uberis 
Streptococcus vestibularis 
Tatumella ptyseos 
Trabulsiella guamensis 
Veillonella parvula 
Yersinia enterocolitica 
Yersinia frederiksenii 
Yersinia intermedia 
Yersinia pestis 
Yersinia pseudotuberculosis 
Yersinia rohdei 
Yokenella regensburgei 

Achromobacter xylosoxidans subsp. denitrificans 
Acinetobacter baumannii 
Acinetobacter Iwoffii 



Thic nufpnt 
i ma pdicm 


tiff 


Thic natpnt 
1 Ilia pdlCLil 


tiff 

TUJ 


Thic nofpnt 
J ilia pdlclll 


tiff 
TUJ 


i nib paiciu 


tuf 


Thic natpnt 
1 Ilia pdlclll 


tirf 


i ms pdiciii 


tuf 
tuj 


1 Ilia pdlclll 


tiff 
TUJ 


i nis paiciu 


tirf 

tuj 


1 QIS pdlclll 


tirf 

TUJ 


Thic n*at/»nt 

i ms paiciu 


tirf 

tuj 


This nntont 

i ms p aie hi 


tirf 


Thic nfif**nt 

j ms paicni 


tirf 

tuj 


i ms paieni 


tuf 

tuf 


Thic notpnt 

1 ms pdiciii 


tiff 

*uj 


Thic nit*»nt 

i ms pdiciu 


tirf 
TUJ 


Thic nofont 
i ms pdlclll 


tirf 
TUJ 


i ms paieni 


tirf 
TUJ 


Thic nnt/>flt 

i ms paieni 


tuf 


j ms paieni 


tirf 
TUJ 


Thic ni te±r\t 

J nis pdieni 


tirf 
TUJ 


l ms paieni 


tirf 

tuj 


Thic Tiot<»nt 

J ms pdieni 


tirf 
TUJ 


Thic nat«»rit 

i ms pdieni 


tuf 
TUJ 


Thic notAnf 

j ms pdieni 


tirf 
TUJ 


Thic ni rt»nt 

i ms paieni 


tirf 
TUJ 


Thic nitont 

i ms pdieni 


tirf 
TUJ 


This patent 


tirf 

TUJ 


j ms patent 


tirf 

tuj 


Thic fsotAiit 

i ms pdiciu 


tiff 

TUJ 


This patent 


tirf 

TUJ 


Thin Aitanf 

l ms paieni 


tirf 
TUJ 


i ms pdieni 


tirf 
TUJ 


i ms paieni - 


tirf 
TUJ 


j ms paicni 


tirf 
TUJ 


Thic nitAiit 

i ms paiciu 


tiff 
TUJ 


Thic no font 

i ms paieni 


tirf 


i ms paieni 


tirf 
TUJ 


i ms paieni 


tirf 

tuj 


This patent 


tirf 
lU J 


Thic r\otf>r*lt 

i nis patent 


tirf 
TUJ 


Thic notortf 

j ms paieni 


tirf 
TUJ 


Thic nofAfit 

i ms paieni 


tirf 
TUJ 


Thic n-Jt*»nt 

i Ills pdlclll 


tuf 
TUJ 


Thic notAfit 

i ms pdieni 


tiif 
TUJ 


Thic natpnt 

1 ins pdiciu 


tuf 
TUJ 


in ic t\ n to rt f 

J ms pdiciu 


tirf 
TUJ 


Thic natpnt 
l ilia pdiciu 


tiff 
TUJ 


i ms paieni 


tirf 
TUJ 


Thic natpnt 
j uia pdiciii 


tiff 
lUJ 


This patent 


tuf 


This patent 


tuf 


This patent 


«4 


This patent 


tuf 


This patent 


tuf 


This patent 


tuf 


This patent 


M 


This patent 


tuf 


This patent 


tuf 


This patent 


tuf 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued)* 



SEQ ID NO. 



Archaeal, bacterial, fungal or parasitical species 



Source 


Gene* 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


JThis-patent - 


-atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 
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245 
246 
247 
248 
249 
250 
_251. 
252 
253 
254 
255 
256 
257 
258 
259 
260 
261 
262 
263 
264 
265 
266 
267 
268 
269 
270 
272 
273 
274 
275 
276 
277 
278 
279 
280 
281 
282 
283 
284 
285 
286 
287 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 
305 
306 



Staphylococcus saprophytics 
Alcaligenes faecalis subsp. faecalis 
Bacillus anthracis 
Bacillus cereus 
Bacteroides distasonis 
Bacteroides ovatus 

.Leclercia.adecarboxylata 

Stenotrophomonas maltophilia 
Bartonella henselae 
Bifidobacterium adolescentis 
Brucella abortus 
Cedecea davisae 
Cedecea lapagei 
Cedecea neteri 

Chryseobacterium meningosepticum 

Citrobacter amalonaticus 

Citrobacter braakii 

Citrobacter koseri 

Citrobacter farmeri 

Citrobacter freundii 

Citrobacter koseri 

Citrobacter sedlakii 

Citrobacter werkmanii 

Citrobacter youngae 

Clostridium innocuum 

Clostridium perfringens 

Corynebacterium diphtheriae 

Corynebacterium pseudodiphtheriticum 

Corynebacterium ulcerans 

Corynebacterium ureatyticum 

Coxiella burnetii 

Edwardsiella hoshinae 

Edwardsiella tarda 

Eikenella corrodens 

Enterobacter agglomerans 

Enterobacter amnigenus 

Enterobacter asburiae 

Enterobacter cancerogenus 

Enterobacter cloacae 

Enterobacter gergoviae 

Enterobacter hormaechei 

Enterobacter sakazakii 

Enterococcus avium 

Enterococcus casseliflavus 

Enterococcus durans 

Enterococcus faecalis 

Enterococcus faecium 

Enterococcus gallinarum 

Enterococcus saccharolyticus 

Escherichia fergusonii 

Escherichia hermannii 

Escherichia vulneris 

Eubacterium lentum 

Ewingella americana 

Francisella tularensis 

Fusobacterium gonidiaformans 

Fusobacterium necrophorum subsp. necrophorum 

Fusobacterium nucleatum subsp. potymorphum 

Gardnerella vaginalis 

Gemella haemolysans 

Gemella morbillorum 



190 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



Table 7. 



Origin of the nucleic acids and/ r sequences in the sequence listing (continued). 



SEQ ID NO. Archaeal, bacterial, fungal or parasitical species Source 



Gene* 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



307 


Haemophilus ducreyi 


This patent 


308 


Haemophilus haemolyticus 


This patent 


309 


Haemopfiilus parahaemolyticus 


This patent 


310 


Haemophilus parainfluenzae 


This patent 


311 


Hafnia alvei 


This patent 


312 


Kingellakihgae 


This patent 


313 


Klebsiella pneumoniae subsp. ozaenae 


This patent 


314 


Klebsiella ornithinotytica 


This patent 


315 


Klebsiella oxytoca 


This patent 


316 


Klebsiella pkmticola 


This patent 


317 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


318 


Kluyvera ascorbata 


This patent 


319 


Kluyvera cryocrescens 


This patent 


320 


Kluyvera georgiana 


This patent 


321 


Lactobacillus acidophilus 


This patent 


322 


Legionella pneumophila subsp. pneumophila 


This patent 


323 


Leminorella grimontii 


This patent 


324 


Listeria monocytogenes 


This patent 


325 


Micrococcus lylae 


This patent 


326 


Moellerella wisconsensis 


This patent 


327 


Moraxella catarrhalis 


This patent 


328 


Moraxella osloensis 


This patent 


329 


M organella morganii subsp. morganii 


This patent 


330 


Pantoea agglomerans 


This patent 


331 


Pantoea dispersa 


This patent 


332 


Pasieurella multocida 


This patent 


333 


Pragia fontium 


This patent 


334 


Proteus mirabilis 


This patent 


335 


Proteus vulgaris 


This patent 


336 


Providencia alcalifaciens 


This patent 


337 


Providencia rettgeri 


This patent 


338 


Providencia rustigianii 


This patent 


339 


Providencia stuartii 


This patent 


340 


Psychrobacter phenylpyruvicum 


This patent 


341 


Rahnella aquatilis 


This patent 


342 


Salmonella choleraesuis subsp. arizonae 


This patent 


343 


Salmonella choleraesuis subsp. choleraesuis 


This patent 




serotype Choleraesuis 


344 


Salmonella choleraesuis subsp. diarizonae 


This patent 


345 


Salmonella choleraesuis subsp. houtenae 


This patent 


346 


Salmonella choleraesuis subsp. indica 


This patent 


347 


Salmonella choleraesuis subsp. choleraesuis 


This patent 




serotype Paratyphi A 


348 


Salmonella choleraesuis subsp. choleraesuis 


This patent 




serotype Paratyphi B 


349 


Salmonella choleraesuis subsp. salamae 


This patent 


350 


Salmonella choleraesuis subsp. choleraesuis serotype Typhi This patent 


351 


Salmonella choleraesuis subsp. choleraesuis 


This patent 




serotype Typhimurium 


352 


Salmonella choleraesuis subsp. choleraesuis 


This patent 




serotype Virchow 


353 


Serratia ficaria 


This patent 


354 


Ser ratio fonticola 


This patent 


355 


Serratia grimesii 


This patent 


356 


Serratia liquefaciens 


This patent 


357 


Serratia marcescens 


This patent 


358 


Serratia odorifera 


This patent 


359 


Serratia plymuthica 


This patent 


360 


Serratia rubidaea 


This patent 


361 


Pseudomonas putida 


This patent 


362 


Shigella boydii 


This patent 


363 


Shigella dysenieriae 


This patent 



atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 

atpD 
atpD 
atpD 
atpD 

atpD 

atpD 
atpD 
atpD 

atpD 

atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
aipD 
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SEQ ID NO. 


Archaeal, bacterial, fungal or parasitical species 


SourceGene* 




364 


Shigella flexneri 


This patent 


atpD 


365 


Shigella sonnei 


This patent 


atpD 


366 


Staphylococcus aureus 


This patent 


atpD 


367 


Staphylococcus auricularis 


This patent 


atpD 


368 


Stapf^locqccj^capitis_su\^p^capitis__ 


_This patent 


atpD 


369 


Staphylococcus cohnii subsp. cohnii 


This patent 


atpD 


370 


Staphylococcus epidermidis 


This patent 


atpD 


371 


Staphylococcus haemolyticus 


This patent 


atpD 


372 


Staphylococcus hominis subsp. hominis 


This patent 


atpD 


373 


Staphylococcus hominis 


This patent 


atpD 


374 


Staphylococcus lugdunensis 


This patent 


atpD 


375 


Staphylococcus saprophyticus 


This patent 


atpD 


376 


Staphylococcus simulans 


This patent 


atpD 


377 


Staphylococcus warned 


This patent 


atpD 


378 


Streptococcus acidominimus 


This patent 


atpD 


379 


Streptococcus agalactiae 


This patent 


atpD 


380 


Streptococcus agalactiae 


This patent 


atpD 


381 


Streptococcus agalactiae 


This patent 


atpD 


382 


Streptococcus agalactiae 


This patent 


atpD 


383 


Streptococcus agalactiae 


This patent 


atpD 


384 


Streptococcus dysgalactiae 


This patent 


atpD 


385 


Streptococcus equi subsp. equi 


This patent 


atpD 


386 


Streptococcus anginosus 


This patent 


atpD 


387 


Streptococcus salivarius 


This patent 


atpD 


388 


Streptococcus suis 


This patent 


atpD 


389 


Streptococcus uheris 


This patent 


atpD 


390 


Tatumella ptyseos 


This patent 


atpD 


391 


Trabulsiella guamensis 


This patent 


atpD 


392 


Yersinia bercovieri 


This patent 


atpD 


393 


Yersinia enterocolinca 


This patent 


atpD 


394 


Yersinia frederiksenii 


This patent 


atpD 


395 


Yersinia intermedia 


This patent 


atpD 


396 


Yersinia pseudotuberculosis 


This patent 


atpD 


397 


Yersinia rohdei 


This patent 


atpD 


398 


Yokenella regensburgei 


This patent 


atpD 


399 


Yarrowia lipolytica 


This patent 


te/(EF-l) 


400 


Absidia corymbifera 


This patent 


ta/(EF-l) 


401 


Alternaria alternata 


This patent 


tuf(ET-l) 


402 


Aspergillus flaws 


This patent 


/m/(EF-1) 


403 


Aspergillus fumigatus 


This patent 


/«/(EF-l) 


404 


Aspergillus fumigatus 


This patent 


/w/(EF-l) 


405 


Aspergillus niger 


This patent 


r«/(EF-l) 


406 


Blastoschizomyces capitatus 


This patent 


to/(EF-l) 


407 


Candida albicans 


This patent 


/w/(EF-l) 


408 


Candida albicans 


This patent 


/w/(EF-l) 


409 


Candida albicans 


This patent 


/m/(EF-1) 


410 


Candida albicans 


This patent 


ta/(EF-l) 


411 


Candida albicans 


This patent 


fw/(EF-l) 


412 


Candida dubliniensis 


This patent 


fw/(EF-l) 


413 


Candida catenulata 


This patent 


tojf(EF-l) 


414 


Candida dubliniensis 


This patent 


/w/(EF-l) 


415 


Candida dubliniensis 


This patent 


/w/(EF-l) 


416 


Candida famata 


This patent 


mf(EF-l) 


417 


Candida glabrata 


WO98/20157 


tuf (EVA) 


418 


Candida guilliermondii 


This patent 


mf(EF-l) 


419 


Candida haemulonii 


This patent 


m/(EF-l) 


420 


Candida inconspicua 


This patent 


ta/(EF-l) 


421 


Candida kefyr 


This patent 


fw/(EF-l) 


422 


Candida krusei 


WO98/20157 


rw/(EF-l) 


423 


Candida lambica 


This patent 


f W /(EF-l) 


• 424 


Candida lusitaniae 


This patent 


rw/(EF-l) 


425 


Candida norvegensis 


This patent 


tuf(E¥-l) 
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ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


426 


Candida parapsilosis 


WO98/20157 


m/(EF-l) 


427 


Candida rugosa 


This patent 


tuf (EF-l) 


428 


Candida sphaerica 


This patent 


/w/(EF-l) 


429 


Candida tropicalis 


WO98/20157 


tuf (EF-l) 


430 


Candida utilis 


This patent 


tajf(EF-l) 


431 


Candida viswanathii 


This patent 


ft// (EF-l) 


-432 


Candida zeylanoides 


This patent 


/m/(EF-1) 


433 


Coccidioides immitis 


This patent 


rw/(EF-l) 


434 


Cryptococcus albidus 


This patent 


tuf (EF-l) 


435 


Exophiala jeanselmei 


This patent 


tuf (EF-l) 


436 


Fusarium oxysporum 


This patent 


tuf (EF-l) 


437 


Geotrichum sp. 


This patent 


rw/(EF-l) 


438 


Histoplasma capsulation 


This patent 


/w/(EF-l) 


439 


Issatchenkia orientalis Kudrjanzev 


This patent 


tuf (EF-l) 


440 


Malassezia furfur 


This patent 


tuf (EF-l) 


441 


Malassezia pachydermatis 


This patent 


tuf (EF-l) 


442 


Malbrdnchea filamentosa 


This patent 


m/(EF-l) 


443 


Metschnikowia pulcherrima 


This patent 


tuf (EF-l) 


444 


Paecilomyces lilacinus 


This patent 


tuf (EF-l) 


445 


Paracoccidioides brasiliensis 


This patent 


tuf (EF-l) 


446 


Penicillium marneffei 


This patent 


ft// (EF-l) 


447 


Pichia anomala 


This patent 


ft// (EF-l) 


448 


Pichia anomala 


This patent 


ft// (EF-l) 


449 


Pseudallescheria boydii 


This patent 


tuf (EF-l) 


450 


Rhizopus oryzae 


This patent 


tuf (EF-l) 


451 


Rhodotorula minuta 


This patent 


ft// (EF-l) 


452 


Sporobolomyces salmonicolor 


This patent 


tuf (EF-l) 


453 


Sporothrix schenckii 


This patent 


tuf (EF-l) 


454 


Stephanoascus ciferrii 


This patent 


ft// (EF-l) 


455 


Trichophyton mentagrophytes 


This patent 


tuf (EF-l) 


456 


Trichosporon cutanewn 


This patent 


tuf (EF-l) 


457 


Wangiella dermathidis 


This patent 


ft// (EF-l) 


458 


Aspergillus fumigatus 


This patent 


atpD 


459 


Blastoschizomyces capitatus 


This patent 


atpD 


460 


Candida albicans 


This patent 


atpD 


461 


Candida dubliniensis 


This patent 


atpD 


462 


Candida famata 


This patent 


atpD 


463 


Candida glabrata 


This patent 


atpD 


464 


Candida guilliennondii 


This patent 


atpD 


465 


Candida haemulonii 


This patent 


atpD 


466 


Candida inconspicua 


This patent 


atpD 


467 


Candida kefyr 


This patent 


atpD 


468 


Candida krusei 


This patent 


atpD 


469 


Candida lambica 


This patent 


atpD 


470 


Candida lusitaniae 


This patent 


atoD 


471 


Candida norvegensis 


This patent 


atpD 


472 


Candida parapsilosis 


This patent 


atpD 


473 


Candida rugosa 


This patent 


atpD 


474 


Candida sphaerica 


This patent 


atpD 


475 


Candida tropicalis 


This patent 


atpD 


476 


Candida utilis 


This patent 


atpD 


477 


Candida viswanathii 


This patent 


atpD 


478 


Candida zeylanoides 


This patent 


aipD 


479 


Coccidioides immitis 


This patent 


atpD 


480 


Cryptococcus albidus 


This patent 


atpD 


481 


Fusarium oxysporum 


This patent 


atpD 


482 


Geotrichum sp. 


This patent 


atpD 


483 


Histoplasma capsulatum 


This patent 


aipD 


484 


Malassezia furfur 


This patent 


aipD 


485 


Malassezia pachydermatis 


This patent 


atpD 


486 


Metschnikowia pulcherrima 


This patent 


atpD 


487 


Penicillium marneffei 


This patent 


atpD 
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Aicnaeai, Dactenai, rungs i or parasitical species 


Source 


Oene* 


A DO 

488 


Pichia anomala 


This patent 


atpD 


489 


Pichia anomala 


This patent 


atpD 


4 A A 

490 


Rhodotorula minuta 


This patent 


atpD 


491 


Rhodotorula mucilaginosa 


This patent 


atpD 


492 


Sporobolomyces salmonicolor 


This patent 


atpD 


493 


Sporothrix schenckii 


This patent 


atpD 


_494 


Sttphanoascus_ ciferrii 


-This patent — 


- atpD 


495 


Trichophyton mentagrophytes 


This patent 


atpD 


496 


Wangiella dermatitidis 


This patent 


atpD 


497 


Yarroma lipolytica 


This patent 


atpD 


498 


Aspergillus fumigatus 


This patent 


tuf(M) 


499 


Blastoschizomyces capitatus 


This patent 


tttf(M) 


500 


Candida rugosa 


This patent 


tuf(M) 


501 


Coccidioides immitis 


This patent 


tufQA) 


502 


Fusarium oxysporum 


This patent 


tirfQA) 


503 


Histoplasma capsulation 


This patent 


tufQA) 


504 


Paracoccidioides brasiliensis 


This patent 


tufQA) 


505 


Penicillium marneffei 


This patent 


tufQA) 


506 


Pichia anomala 


This patent 


nj/(M) 


507 


Trichophyton mentagrophytes 


This patent 


tuf(M) 


508 


Yarroma lipolytica 


This patent 


tuf(M) 


509 


Babesia bigemina 


This patent 


tuf(EFA) 


510 


Babesia bovis 


This patent 


fwjf (EF-1) 


511 


Crithidia fasciculata 


This patent 


n/f (EF-D 


512 


Entamoeba histolytica 


This patent 


tuf(EF-l) 


513 


Giardia lamblia 


This patent 


tuf(EF-l) 


514 


Leislunania tropica 


This patent 


tuf(EF-l) 


515 


Leishmania aethiopica 


This patent 


rt$f(EF-l) 


516 


Leishmania tropica 


This patent 


n^(EF-l) 


517 


Leishmania donovani 


This patent 


tuf(£¥-l) 


518 


Leishmania infantum 


This patent 


titf(EF-l) 


519 


Leishmania enriettii 


This patent 


/wjf(EF-l) 


520 


Leishmania gerbilli 


This patent 


/uf(EF-l) 


521 


Leishmania hertigi 


This patent 


tuf(EF-l) 


522 


Leishmania major 


This patent 


tuf(EF-l) 


523 


Leishmania amazonensis 


This patent 


fwf(EF-l) 


524 


Leishmania mexicana 


This patent 


tuf(EF-l) 


525 


Leishmania tarentolae 


This patent 


tuf(EF-l) 


526 


Leishmania tropica 


This patent 


fwf(EF-l) 


527 


Neospora caninum 


This patent 


ntf(EF-\) 


528 


Trichomonas vaginalis 


This patent 


tuf(EF-l) 


529 


Trypanosoma brucei subsp. brucei 


This patent 


rwjf(EF-l) 


530 


Crithidia fasciculata 


This patent 


atpD 


531 


Leishmania tropica 


This patent 


atpD 


532 


Leishmania aethiopica 


This patent 


atpD 


533 


Leishmania donovani 


This patent 


atpD 


534 


Leishmania infantum 


This patent 


atpD 


535 


Leishmania gerbilli 


This patent 


atpD 


536 


Leishmania hertigi 


This patent 


atpD 


537 


Leishmania major 


This patent 


atpD 


538 


Leishmania amazonensis 


This patent 


atpD 


607 


Enterococcus faecalis 


WO98/20157 


*f 


608 


Enterococcus faecium 


WO98/20157 


tuf 


609 


Enterococcus gallinarum 


WO98/20157 


tuf 


OIU 


naemopnuus influenzae 


wuyo/zuij / 


tUf 


611 


Staphylococcus epidermidis 


WO98/20157 


tuf 


612 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


ntf 




serotype Paratyphi A 




613 


Serratia ficaria 


This patent 


ntf 


614 


Enterococcus malodoratus 


This patent 


n/(Q 


615 


Enterococcus durans 


This patent 


titf(C) 


616 


Enterococcus pseudoavium 


This patent 


ntf(C) 
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SEQ ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


617 


Enterococcus dispar 


This patent 


tuf(Q 


618 


Enterococcus avium 


This patent 


wf(Q 


619 


Saccharomyces cerevisiae 


Database 


tuf(M) 


621 


Enterococcus faecium 


This patent 


tuf{Q) 


622 


Saccharomyces cerevisiae 


This patent 


W/(EF-1) 


623 


Cryptococcus neoformans 


This patent 


m/(EF-l) 


624 


Candida albicans 


-WO98/20157- - 


tuf (EF-1)" " 


662 


Corynebacterium diphtheriae 


WO98/20157 


tuf 


663 


Candida catenulata 


This patent 


atpD 


665 


Saccharomyces cerevisiae 


Database 


/w/(EF-l) 


666 


Saccharomyces cerevisiae 


Database 


atpD 
atpD 


667 


Trypanosoma cruzi 


This patent 


668 


Corynebacterium glutamicum 


Database 


tuf 


669 


Escherichia coli 


Database 


atpD 


670 


Helicobacter pylori 


Database 


atpD 


671 


Clostridium acetobutylicum 


Database 


atpD 


672 


Cytophaga fytica 


Database 


atpD 


673 


Ehrlichia risticii 


This patent 


atpD 


674 


Vibrio cholerae 


This patent 


atpD 


675 


Vibrio cholerae 


This patent 


tuf 


676 


Leishmania enriettii 


This patent 


atpD 


677 


Babesia microti 


This patent 


ft//(EF-l) 


678 


Cryptococcus neoformans 


This patent 


atpD 


679 


Cryptococcus neoformans 


This patent 


atpD 


680 


Cunninghamella bertholletiae 


This patent 


atpD 


684 


Candida tropicalis 


Database 


atpD(V) 


685 


Enterococcus hirae 


Database 


atpD(V) 


686 


Chlamydia pneumoniae 


Database 


atpD (V) 


687 


Halobacterium salinarum 


Database 


atpD(V) 


688 


Homo sapiens 


Database 


atpD(V) 


689 


Plasmodium falciparum 


Database 


atpD (V) 


690 


Saccharomyces cerevisiae 


Database 


atpD(V) 


691 


Schizosaccharomyces pombe 


Database 


atpD (V) 


692 


Trypanosoma congolense 


Database 


atpD (V) 


693 


Thermus thermophilus 


Database 


atpD (V) 


698 


Escherichia coli 


WO98/20157 


tuf 


709 


Borrelia burgdorferi 


Database 


atpD (V) 


710 


Treponema pallidum 


Database 


atpDQf) 


711 


Chlamydia trachomatis 


Genome project 


atpD{V) 


712 ' 


Enterococcus faecalis 


Genome project 


atpD(V) 


713 


Methanosarcina barkeri 


Database 


atpD (V) 


714 


Methanococcus jannaschii 


Database 


atpD (V) 


715 


Porphyromonas gingivalis 


Genome project 


atpD (V) 


716 


Streptococcus pneumoniae 


Genome project 


atpD (V) 


717 


Burkholderia mallei 


This patent 


tuf 


718 


Burkholderia pseudomallei 


This patent 


tuf 


719 


Clostridium beijerinckii 


This patent 


tuf 


720 


Clostridium innocuum 


This patent 


tuf 


721 


Clostridium novyi 


This patent 


tuf 


722 


Clostridium septicum 


This patent 


tuf 


723 


Clostridium tertium 


This patent 


tuf 


724 


Clostridium tetani 


This patent 


tuf 


725 


Enterococcus malodoratus 


This patent 


tuf 


726 


Enterococcus sulfureus 


This patent 


tuf 


727 


Lactococcus garvieae 


This patent 


tuf 


728 


Mycoplasma pirum 


This patent 


tuf 


729 


Mycoplasma salivarium 


This patent 


tuf 


730 


Neisseria polysaccharea 


This patent 


tuf 


731 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Enteritidis 
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SEO ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


732 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Gallinarum 


733 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Paratyphi B 


734 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


tuf 




serotype Virchow 






735 


Serratia grimesii ~~ 


This patent 


tuf 
tuf 


736 


Clostridium difficile 


This patent 


737 


Burkholderia pseudomallei 


This patent 


aipD 


738 


Clostridium bifermentans 


This patent 


atnD 


739 


Clostridium beijerinckii 


This patent 


aipD 


740 


Clostridium difficile 


This patent 


atpD 


741 


Clostridium ramosum 


This patent 


awD 


742 


Clostridium septicum 


This patent 


atpD 


743 


Clostridium tertium 


This patent 


aipD 


744 


Comamonas acidovorans 


This patent 


atvD 


745 


Klebsiella pneumoniae subsp. rhinoscleromatis 


This patent 


atpD 


746 


Neisseria canis 


This patent 


atpD 


747 


Neisseria cinerea 


This patent 


atpD 


748 


Neisseria cuniculi 


This patent 


atpD 


749 


Neisseria elongata subsp. elongata 


This patent 


atpD 


750 


Neisseria flavescens 


This patent 


atpD 


751 


Neisseria gonorrhoeae 


This patent 


atnD 


752 


Neisseria gonorrhoeae 


This patent 


atpD 


753 


Neisseria lactamica 


This patent 


atpD 


754 


Neisseria meningitidis 


This patent 


atpD 


755 


Neisseria mucosa 


This patent 


atpD 


756 


Neisseria subflava 


This patent 


atpD 


757 


Neisseria weaveri 


This patent 


atpD 


758 


Neisseria animalis 


This patent 


atpD 


759 


Proteus penneri 


This patent 


atpD 


760 


Salmonella choleraesuis subsp. choleraesuis 


This patent 


atpD 




serotype Enteritidis 




761 


Yersinia pestis 


This patent 


atpD 


762 


Burkholderia mallei 


This patent 


atpD 


763 


Clostridium sordellii 


This patent 


atpD 


764 


Clostridium novyi 


This patent 


atpD 


765 


Clostridium botulinum 


This patent 


aipD 


766 


Clostridium histolyticum 


This patent 


atpD 


767 


Peptostreptococcus prevotii 


This patent 


atpD 


768 


Absidia corymbifera 


This patent 


atpD 


769 


Alternaria alternata 


This patent 


atpD 


770 


A spergillus flavus 


This patent 


atpD 


771 


Mucor circinelloides 


This patent 


atpD 


772 


Piedraia hortai 


This patent 


atpD 


773 


Pseudallescheria boydii 


This patent 


atpD 


774 


Rhizopus oryzae 


This patent 


atpD 


775 


Scopulariopsis koningii 


This patent 


atpD 


776 


Trichophyton mentagrophytes 


This patent 


atpD 


777 


Trichophyton tonsurans 


This patent 


atpD 


778 


Trichosporon cutaneum 


This patent 


atpD 


779 


Cladophialophora carrionii 


This patent 


tuf(EF-\) 


780 


Cunninghamella bertholletiae 


This patent 


to/(EF-l) 


781 


Curvularia lunata 


This patent 


r«/(EF-l) 


782 


Fonsecaea pedrosoi 


This patent 


tuf (EF-1) 


783 


Microsporum audouinii 


This patent 


rwf(EF-l) 


784 


Mucor circinelloides 


This patent 


rwf(EF-l) 


785 


Phialophora verrucosa 


This patent 


fn/(EF-l) 


786 


Saksenaea vasiformis 


This patent 


/u/(EF-l) 


787 


Syncephalastrum racemosum 


This patent 


f«/(EF-l) 


788 


Trichophyton tonsurans 


This patent 


m/(EF-l) 


789 


Trichophyton mentagrophytes 


This patent 


m/(EF-l) 
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ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


790 


Bipolaris hawaiiensis 


This patent 


fw/(EF-l) 


791 


Aspergillus fumigatus 


This patent 


tufQA) 


792 


Trichophyton mentagrophytes 


This patent 


tufQA) 


827 


Clostridium novyi 


This patent 


atpD (V) 


828 


Clostridium difficile 


This patent 


atpD (V) 


829 


Clostridium septicum 


This patent 


atoD CV) 


830 


Clostridium botulinum- 


" This patent 


atpD (V) 


831 


Clostridium perfringens 


This patent 


atpD (V) 


832 


Clostridium tetani 


This patent 


atpD (V) 


833 


Streptococcus pyogenes 


Database 


atpD (V) 


834 


Babesia bovis 


This patent 


atpD (V) 


835 


Cryptosporidium parvum 


This patent 


atpD (V) 


836 


Leishmania infantum 


This patent 


atpD (V) 


837 


Leishmania major 


This patent 


atpD (V) 


838 


Leishmania tarentolae 


This patent 


atvD (V) 


839 


Trypanosoma brucei 


This patent 


atpD (V) 


840 


Trypanosoma cruzi 


This patent 


tuf YEF-11 


841 


Trypanosoma cruzi 


This patent 




842 


Trypanosoma cruzi 


This patent 


tuf(EFA) 


843 


Babesia bovis 


This patent 


tuf(M) 


844 


Leishmania aethiopica 


This patent 


tufQA) 


845 


Leishmania amazonensis 


This patent 


tuf CM) 


846 


Leishmania donovani 


This patent 


tufQA) 


847 


Leishmania infantum 


This patent 


tufQA) 


848 


Leishmania enriettii 


This patent 


tufQA) 


849 


Leishmania gerbilli 


This patent 


tufQA) 


850 


Leishmania major 


This patent 


tufQA) 


851 


Leishmania mexicana 


This patent 


tufQA) 


852 


Leishmania tarentolae 


This patent 


tufQA) 


853 


Trypanosoma cruzi 


This patent 


tufQA) 


854 


Trypanosoma cruzi 


This patent 


tufQA) 


855 


Trypanosoma cruzi 


This patent 


tufQA) 


856 


Babesia bigemina 


This patent 


atpD 


857 


Babesia bovis 


This patent 


atpD 


858 


Babesia microti 


This patent 


atpD 


859 


Leishmania guyanensis 


This patent 


atpD 


860 


Leishmania mexicana 


This patent 


atpD 


861 


Leishmania tropica 


This patent 


atpD 


862 


Leishmania tropica 


This patent 


atpD 


863 


Bordetella pertussis 


Database 


tuf 


864 


Trypanosoma brucei brucei 


Database 


tuf(E¥A) 


865 


Cryptosporidium parvum 


This patent 


rw/(EF-l) 


866 


Staphylococcus saprophyticus 


This patent 


atpD 


867 


Zoogloea ramigera 


This patent 


atpD 


868 


Staphylococcus saprophyticus 


This patent 


tuf 


869 


Enterococcus casseliflavus 


This patent 


tuf 
j 


870 


Enterococcus casseliflavus 


This patent 


tuf 

J 


871 


Enterococcus flavescens 


This patent 


tuf 

J 


872 


Enterococcus gallinarum 


This patent 


tuf 

J 


873 


Enterococcus gallinarum 


This patent 


tuf 


874 


Staphylococcus haemolyticus 


This patent 


tuf 


875 


Staphylococcus epidermidis 


This patent 


tuf 


876 


Staphylococcus epidermidis 


This patent 


tuf 


877 


Staphylococcus epidermidis 


This patent 


tuf 


878 


Staphylococcus epidermidis 


This patent 


tuf 


879 


Enterococcus gallinarum 


This patent 


tuf 


880 


Pseudomonas aeruginosa 


This patent 


"t 


881 


Enterococcus casseliflavus 


This patent 


tuf 


882 


Enterococcus casseliflavus 


This patent 


tuf 


883 


Enterococcus faecalis 


This patent 


tuf 


884 


Enterococcus faecalis 


This patent 


tuf 


885 


Enterococcus faecium 


This patent 


tuf 
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SEQ ID NO. Archaeal. bacterial, fungal or parasitical species 


Source 


Gene* 


5 


886 


Enterococcus faecium 


This patent 


taf 




887 


Zoogloea ramigera 


This patent 


tuf 




888 


Enterococcus faecalis 


This patent 


tuf 




889 


Aspergillus fumigatus 


This patent 


atpD 


10 


890 


Penicillium marneffei 


This patent 


atpD 


891 


Paecllomyces lilacinus 


This patent 


atpD_ 




- 892- 


Penicillium marneffei 


"~ This patent 


atpD 




893 


Sporothrix schenckii 


This patent 


atpD 




894 


Malbranchea filamentosa 


This patent 


atpD 


15 


895 


Paecllomyces lilacinus 


This patent 


atpD 


896 


Aspergillus niger 


This patent 


atpD 




897 


Aspergillus fumigatus 


This patent 


ru/(EF-l) 




898 


Penicillium marneffei 


This patent 


fw/(EF-l) 




899 


Piedraia hortai 


This patent 


ruf (EF-1) 


20 


900 


Paecllomyces lilacinus 


This patent 


fw/(EF-l) 


901 


Paracoccidioides brasiliensis 


This patent 


ru/(EF-l) 




902 


Sporothrix schenckii 


This patent 


m/(EF-l) 




903 


Penicillium marneffei 


This patent 


tuf (EF-1) 




904 


Curvularia lunata 


This patent 


tufQA) 


25 


905 


Aspergillus niger 


This patent 


tuf(M) 


906 


Bipolaris hawaiiensis 


This patent 


tufQA) 




907 


Aspergillus flavus 


This patent 


tufQA) 




908 


Alternaria aUernata 


This patent 


tufQA) 




909 


Penicillium marneffei 


This patent 


tufQA) 


30 


910 


Penicillium marneffei 


This patent 


tufQA) 




918 Escherichia coli 


Database 


recA 






929 Bacteroides fragilis 


This patent 


atpD(V) 






930 Bacteroides distasonis 


This patent 


atpD (V) 






931 Porphyromonas asaccharofytica 


This patent 


atpD (V) 


35 




932 Listeria monocytogenes 


This patent 


tuf 




939 Saccharomyces cerevisiae 


Database 


recA (Rad51> 






940 Saccharomyces cerevisiae 


Database 


recA (DmclJ 






941 Cryptococcus humicolus 


This patent 


atpD 






942 Escherichia coli 


This patent 


atpD 


40 




943 Escherichia coli 


This patent 


atpD 




944 Escherichia coli 


This patent 


atpD 






945 Escherichia coli 


This patent 


atpD 






946 Neisseria polysaccharea 


This patent 


atpD 






947 Neisseria sicca 


This patent 


atpD 


45 




948 Streptococcus mitis 


This patent 


atpD 




949 Streptococcus mitis 


This patent 


atpD 






950 Streptococcus mitis 


This patent 


atpD 






951 Streptococcus oralis 


This patent 


atpD 






952 Streptococcus pneumoniae 


This patent 


atpD 


50 




953 Streptococcus pneumoniae 


This patent 


atpD 




954 Streptococcus pneumoniae 


This patent 


atpD 






955 Streptococcus pneumoniae 


This patent 


atpD 






956 Babesia microti 


This patent 


atpDQJ) 






957 Entamoeba histolytica 


This patent 


atpD (V) 


55 




958 Fusobacterium nucleatum subsp. polymorphum 


This patent 


atpD(V) 




959 Leishmania aethiopica 


This patent 


atpD (V) 






960 Leishmania tropica 


This patent 


atpDQJ) 






961 Leishmania guyanensis 


This patent 


atpD(V) 






962 Leishmania donovani 


This patent 


atpD(V) 


60 




963 Leishmania hertigi 


This patent 


atpD(V) 




964 Leishmania mexicana 


This patent 


atpD (V) 






965 Leishmania tropica 


This patent 


atpDQT) 






966 Peptostreptococcus anaerobius 


This patent 


atpD (V) 






967 Bordetella pertussis 


This patent 


tuf 


65 




968 Bordetella pertussis 


This patent 


tuf 




969 Enterococcus columbae 


This patent 


tuf 
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CCA TT\ VTi°V 

bt\j ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source Gene* 




970 


ErUerococcus flavescens 


This patent 


tuf 


971 


Streptococcus pneumoniae 


This patent 


tuf 


972 


Escherichia coli 


This patent 


tuf 


973 


Escherichia coli 


This patent 


tuf 


974 


Escherichia coli 


This patent 


tuf 


975 


Escherichia coli 


This patent 


tuf 


976 


Mycobacterium avium 


This patent 


tuf 


977 


Streptococcus pneumoniae 


TOs patent 


~tuf — 


978 


Mycobacterium gordonae 


This patent 


tuf 


979 


Streptococcus pneumoniae 


This patent 


tuf 


980 


Mycobacterium tuberculosis 


This patent 


tuf 


981 


Staphylococcus warneri 


This patent 


tuf 


982 


Streptococcus mitis 


This patent 




983 


Streptococcus mitis 


This patent 


tuf 


984 


Streptococcus mitis 


This patent 


tuf 


985 


Streptococcus oralis 


This patent 


tuf 


986 


Streptococcus pneumoniae 


This patent 


tuf 


987 


Enterococcus hirae 


This patent 


turn 


988 


Enterococcus mundtii 


This patent 


tuf{C) 


989 


Enterococcus raffinosus 


This patent 


tufip) 


990 


Bacillus anthracis 


This patent 


recA 


991 


Prevotella melaninogenica 


This patent 


recA 


992 


Enterococcus casseliflavus 


This patent 


tuf 


993 


Streptococcus pyogenes 


Database 


speA 


1002 


Streptococcus pyogenes 


WO98/20157 


tuf 


1003 


Bacillus cereus 


This patent 


recA 


1004 


Streptococcus pneumoniae 


This patent 


pbpla 


1005 


Streptococcus pneumoniae 


This patent 


pbpla 


1006 


Streptococcus pneumoniae 


This patent 


pbpla 


1007 


Streptococcus pneumoniae 


This patent 


pbpla 


1008 


Streptococcus pneumoniae 


This patent 


pbpla 


1009 


Streptococcus pneumoniae 


This patent 


pbpla 


1010 


Streptococcus pneumoniae 


This patent 


pbpla 


1011 


Streptococcus pneumoniae 


This patent 


pbpla 


1012 


Streptococcus pneumoniae 


This patent 


pbpla 


1013 


Streptococcus pneumoniae 


This patent 


pbpla 


1014 


Streptococcus pneumoniae 


This patent 


pbpla 


1015 


Streptococcus pneumoniae 


This patent 


pbpla 


1016 


Streptococcus pneumoniae 


This patent 


pbpla 


1017 


Streptococcus pneumoniae 


This patent 


pbpla 


1018 


Streptococcus pneumoniae 


This patent 


pbpla 


1019 


Streptococcus pneumoniae 


This patent 


pbp2b 


1020 


Streptococcus pneumoniae 


This patent 


pbplb 


1021 


Streptococcus pneumoniae 


This patent 


pbplb 


1022 


Streptococcus pneumoniae 


This patent 


pbplb 


1023 


Streptococcus pneumoniae 


This patent 


pbplb 


1024 


Streptococcus pneumoniae 


This patent 


pbplb 


1025 


Streptococcus pneumoniae 


This patent 


pbplb 


1026 


Streptococcus pneumoniae 


This patent 


pbplb 


1027 


Streptococcus pneumoniae 


This patent 


pbplb 


1028 


Streptococcus pneumoniae 


This patent 


pbplb 


1029 


Streptococcus pneumoniae 


This patent 


pbplb 


1030 


Streptococcus pneumoniae 


This patent 


pbplb 


1031 


Streptococcus pneumoniae 


This patent 


pbplb 


1032 


Streptococcus pneumoniae 


This patent 


pbplb 


1033 


Streptococcus pneumoniae 


This patent 


pbplb 


1034 


Streptococcus pneumoniae 


This patent 


pbplx 


1035 


Streptococcus pneumoniae 


This patent 


pbplx 


1036 


Streptococcus pneumoniae 


This patent 


pbplx 


1037 


Streptococcus pneumoniae 


This patent 


pbplx 
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seo ro NO 


ArrViiipol hapfpria! fnnoal or rtitmcit iphI cnpripc 
niUlaCdl, UaWlCIIal, lUIlgaJ Ul paladlUWU opCL»lCd 


oource 


uene 


5 


1038 


Streptococcus pneumoniae 


This patent 


pbp2x 




1039 


Streptococcus pneumoniae 


This patent 


pbplx 




1040 


Streptococcus pneumoniae 


This patent 


pbp2x 




1041 


Streptococcus pneumoniae 


This patent 


pbp2x 




1042 


Streptococcus pneumoniae 




— pbplx 






-This patent - 


"To" 


1043 


Streptococcus pneumoniae 


This patent 


pbplx 




1044 


Streptococcus pneumoniae 


This patent 


pbplx 




1045 


Streptococcus pneumoniae 


This patent 


pbp2x 




1046 


Streptococcus pneumoniae 


This patent 


pbplx. 


15 


1047 


Streptococcus pneumoniae 


This patent 


pbplx 


1048 


Streptococcus pneumoniae 


This patent 


pbplx 




1049 


Enterococcus faecium 


This patent 


vanA 




1050 


Enterococcus gallinarum 


This patent 


vanA 




1051 


Enterococcus faecium 


This patent 


vanA 


20 


1052 


Enterococcus faecium 


This patent 


vanA 


1053 


Enterococcus faecium 


This patent 


vanA 




1054 


Enterococcus faecalis 


This patent 


vanA 




1055 


Enterococcus gallinarum 


This patent 


vanA 




1056 


Enterococcus faecium 


This patent 


vanA 


25 


1057 


Enterococcus flavescens 


This patent 


vanA 


1058 


Enterococcus gallinarum 


This patent 


vanCl 




1059 


Enterococcus gallinarum 


This patent 


vanCl 




1060 


Enterococcus casseliflavus 


This patent 


vanC2 




1061 


Enterococcus casseliflavus 


This patent 


vanCl 




1062 


Enterococcus casseliflavus 


This patent 


vanCl 


30 


1063 


Enterococcus casseliflavus 


This patent 


vanC2 




1064 


Enterococcus flavescens 


This patent 


vanC3 




1065 


Enterococcus flavescens 


This patent 


vanC3 




1066 


Enterococcus flavescens 


This patent 


vanC3 


35 


1067 


Enterococcus faecium 


This patent 


vanXY 


1068 


Enterococcus faecium 


This patent 


vanXY 




1069 


Enterococcus faecium 


This patent 


vanXY 




1070 


Enterococcus faecalis 


This patent 


vanXY 




1071 


Enterococcus gallinarum 


This patent 


vanXY 


40 


1072 


Enterococcus faecium 


This patent 


vanXY 


1073 


Enterococcus flavescens 


This patent 


vanXY 




1074 


Enterococcus faecium 


This patent 


vanXY 




1075 


Enterococcus gallinarum 


This patent 


vanXY 




1076 


Escherichia coli 


Database 


stx t 




1077 


Escherichia coli 


Database 


stx 2 


45 


1093 


Staphylococcus saprophyticus 


This patent 


unknown 




1117 


Enterococcus faecium 


Database 


vanB 




1138 


Enterococcus gallinarum 


Database 


vanCl 




1139 


Enterococcus faecium 


Database 


vanA 




1140 


Enterococcus casseliflavus 


Database 


vanCZ 


50 


1141 


Enterococcus faecium 


Database 


vanHAXY 




1169 


Streptococcus pneumoniae 


Database 


pbpla 




1172 


Streptococcus pneumoniae 


Database 


pbplb 




1173 


Streptococcus pneumoniae 


Database 


pbplx 




1178 


Staphylococcus aureus 


Database 


meek 


55 


1183 


Streptococcus pneumoniae 


Database 


hexA 




1184 


Streptococcus pneumoniae 


This patent 


hexA 




1185 


Streptococcus pneumoniae 


This patent 


hexA 




1186 


Streptococcus pneumoniae 


This patent 


hexA 




1187 


Streptococcus pneumoniae 


This patent 


hexA 



60 
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5 


1188 


Streptococcus oralis 


This patent 


hexA 




1189 


Streptococcus mitis 


This patent 


hexA 




1190 


Streptococcus mitis 


This patent 


hexA 




1191 


Streptococcus mitis 


This patent 


hexA 





1198 


_ . Staphylococcus saprophytics 


This patent 


_ junknown_ 


10 


1215 


Streptococcus pyogenes 


Database 


pep 




1230 


Escherichia coli 


Database 


fu/(EF-G) 




1242 


Enterococcus faecium 


Database 


ddl 




1243 


Enterococcus faecalis 


Database 


mtiF, mtID 




1244 


Staphylococcus aureus subsp. aureus 


This patent 


unknown 


15 


1245 


Bacillus anthracis 


This nfltftnt 

1 1 1 lw UOIOI II 


atpD 




1246 


Bacillus mycoides 


This patent 


atpD 




1247 


Bacillus thuringiensis 


This detent 


atpD 




1248 


Bacillus thuringiensis 


This Datent 


atpD 




1249 


Bacillus thuringiensis 


This Datent 


atpD 


20 


1250 


Bacillus weihenstephanensis 


This oatent 


atpD 




1251 


Bacillus thuringiensis 


This Datent 

• ■ liW WW IW 1 It 


atpD 




1252 


Bacillus thuringiensis 


This Datent 


atpD 




1253 


Bacillus cereus 

t*/WW*'»WW WW* WWW 


Thie Dfltfint 


atpD 




1254 


Bacillus cereus 

WdVlfl WW WW www 


Thie nfltfint 


atpD 


25 


1255 


Staphylococcus aureus 


This oatent 

■ i iiw puiv' ii 


QvrA 




1256 


Bacillus weihenstephanensis 


This oatent 


atpD 




1257 


Bacillus anthracis 


This Datent 


atpD 




1258 


Bacillus thuringiensis 


This patent 


atpD 




1259 


Bacillus cereus 


This patent 


atpD 


30 


1260 


Bacillus cereus 


This Datent 


atpD 




1261 


Bacillus thurinaiensis 


This oatent 

l lllw puiwi 11 


atpD 




1262 


Bacillus thurinaiensis 

WdU'lI WW *• * W 9 f t 'U'Vf *WIW 


This natent 

• l llw (JUID 1 It 


atpD 




1263 


Bacillus thurinaiensis 

WMVff'W«1 If IWf If I UIV* JWJW 


This oatent 


atpD 




1264 


Bacillus thuringiensis 


This patent 


atpD 


35 


1265 


Bacillus anthracis 


This patent 


atpD 




1266 


Paracoccidioides brasiliensis 

9 W# W W WW W* W t W' WW W W# WW" • W F FWf W 


This Datent 

■ 1 iiw w w> t> W 1 It 


fur* (EF-1) 




1267 


Blastomyces dermatitidis 


This patent 


tut (EF-1) 




1268 


Histoplasma capsulatum 


This patent 


ft/' (EF-1) 




1269 


Trichophyton rubrum 


This patent 


ft/r*(EF-1) 


40 


1270 


Microsporum canis 


This patent 


ft/' (EF-1) 




1271 


Aspergillus versicolor 


This patent 


ft/'(EF-1) 




1272 


Exophiala moniliae 


This patent 


ft/' (EF-1) 




1273 


Hortaea wemeckii 


This patent 


tut (EF-1) 




1274 


Fusarium solani 


This patent 


tuf (EF-1) 


45 


1275 


Aureohasidium pullulans 


This patent 


ft/f(EF-1) 




1276 


Blastomyces dermatitidis 


This patent 


tuf (EF-1) 




1277 


Exophiala dermatitidis 


This patent 


ft/f (EF-1) 




1278 


Fusarium moniliforme 


This patent 


faf (EF-1) 




1279 


Aspergillus terreus 


This patent 


ft/' (EF-1) 


50 


1280 


Aspergillus fumigatus 


This patent 


fur* (EF-1) 




1281 


Cryptococcus laurentii 


This patent 


ft/' (EF-1) 




1282 


Emmonsia parva 


This patent 


ft;' (EF-1) 




1283 


Fusarium solani 


This patent 


ft/' (EF-1) 




1284 


Sporothrix schenckii 


This patent 


ft//* (EF-1) 


55 


1285 


Aspergillus nidulans 


This patent 


ft/' (EF-1) 




1286 


Cladophialophora canionii 


This patent 


faf (EF-1) 




1287 


Exserohilum rostratum 


This patent 


ft/' (EF-1) 




1288 


Bacillus thuringiensis 


This patent 


recA 




1289 


Bacillus thuringiensis 


This patent 


recA 


60 


1299 


Staphylococcus aureus 


Database 


gyrA 




1300 


Escherichia coli 


Database 


gyrA 




1307 


Staphylococcus aureus 


Database 


gyrB 




1320 


Escherichia coli 


Database 


parC (griA) 


65 


1321 


Staphylococcus aureus 


Database 


parC (griA) 


1328 


Staphylococcus aureus 


Database 


parE (grIB) 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 





SEQ ID NO. 


Archaeal bacterial furiosi or Darasrttcal SDecies 


Source 


ID 


5 


1348 


unidentified bacterium 


Database 


aac2la 




1351 


Pseudomonas aeruginosa 


Database 


aac3lb 




1356 


Serratia marcescens 


Database 


aacSllb 




1361 


Escherichia coli 


Database 


aac3IVa 




1366 


Enterobacter cloacae 


Database 


aacSVla 


—10- 


1371 


Citrobacterkoseri- 


Database 


aacSIa 




1376 


Serratia marcescens 


Database 


aac6lc 




1381 


Escherichia coli 


Database 


ant3la 




1386 


Staphylococcus aureus 


Database 


antAla 




1391 


Escherichia coli 


Database 


aph3la 


15 


1396 


Escherichia coli 


Database 


aph3lla 




1401 


Enterococcus faecalis 


Database 


aphSllla 




1406 


Acinetobacter baumannii 


Database 


aph3Vla 




1411 


Pseudomonas aeruginosa 


Database 


blaCARB 




1416 


Klebsiella pneumoniae 


Database 


blaCMY-2 


20 


1423 


Escherichia coli 


Database 


blaCTX-M-1 




1428 


Salmonella choleraesuis subsp. choleraesuis serotype 
Typhimurium 


Database 


blaCTX-M-2 




1433 


Pseudomonas aeruginosa 


Database 


blaIMP 




1438 


Escherichia coli 


Database 


blaOXA2 


25 


1439 


Pseudomonas aeruginosa 


Database 


biaOXAW 




1442 


Pseudomonas aeruginosa 


Database 


blaPERI 




1445 


Salmonella choleraesuis subsp. choleraesuis serotype 
Typhimurium 


Database 


blaPER2 




1452 


Staphylococcus epidermidis 


Database 


dfrA 


30 


1461 


Escherichia coli 


Database 


dhfrla 




1470 


Escherichia coli 


Database 


dhfrib 




1475 


Escherichia coli 


Database 


dhfrV 




1480 


Proteus mirabilis 


Database 


dhfrVI 




1489 


Escherichia coli 


Database 


dhfrVII 


35 


1494 


Escherichia coli 


Database 


dhfrVIII 




1499 


Escherichia coli 


Database 


dhfrIX 




1504 


Escherichia coli 


Database 


dhfrXIt 




1507 


Escherichia coli 


Database 


dhfrXIII 




1512 


Escherichia coli 


Database 


dhfrXV 


40 


1517 


Escherichia coli 


Database 


dhfrXVII 




1518 


Acinetobacter Iwoffii 


This patent 


fusA 




1519 


Acinetobacter Iwoffii 


This patent 


fusA-tuf spacer 




1520 


Acinetobacter Iwoffii 


This patent 


tuf 




1521 


Haemophilus influenzae 


This patent 


fusA 


45 


1522 


Haemophilus influenzae 


This patent 


fusA-tuf spacer 




1523 


Haemophilus influenzae 


This patent 


tuf 




1524 


Proteus mirabilis 


This patent 


fusA 




1525 


Proteus mirabilis 


This patent 


fusA-tuf spacer 




1526 


Proteus mirabilis 


This patent 


tuf 


50 


1527 


Campylobacter curvus 


This patent 


atpD 




1530 


Escherichia coli 


Database 


ereA 




1535 


Escherichia coll 


Database 


ereB 




1540 


Staphylococcus haemolyticus 


Database 


HnA 




1545 


Enterococcus faecium 


Database 


iinB 


55 


1548 


Streptococcus pyogenes 


Database 


mefA 




1551 


Streptococcus pneumoniae 


Database 


mefE 




1560 


Escherichia coli 


Database 


mphA 




1561 


Candida albicans 


This patent 


M(EM) 




1562 


Candida dubliniensis 


This patent 


M(EF-1) 


60 


1563 


Candida famata 


This patent 


fu/(EM) 




1564 


Candida glabrata 


This patent 


fu/(EM) 




1565 


Candida guilliermondii 


This patent 


to/ (EM) 




1566 


Candida haemulonii 


This patent 


M(EF-t) 


65 


1567 


Candida kefyr 


This patent 


M(EF-1) 


1568 


Candida lusitaniae 


This patent 


M(EF-1) 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



SEQ ID NO. Archaeal, bacterial, fungal or parasitical species Source Gene* 



c 
3 


1 KAO 
1009 


uanaiuQ spnaenca 


i nis paieru 


TUI tfcr-1 ) 




1570 


Candida tropicslis 


i nis patent 


tW(EF-1) 




4 C*74 

1571 


Candida viswanathii 


i nis patent 


*t ./ /CP 4 \ 

rur(EF-1) 




1572 


Alcaligenes faecalis subsp. faecalis 


This patent 


tuf 




1573 


Prevotella buccsfis 


This patent 


tuf 


1U 


- — 4 C7A 

10/4 


Succinivibrio dextrinosolvens 


This patent 


tut 




1575 


Tetragenococcus halophitus 


Tibia. maAaiaI 

This patent 


tuf 




1576 


Campylobacter jejuni su bsp . jejuni 


i nis patent 


atpD 




1577 


Campylobacter rectus 


This patent 


atpu 




lO/O 


Enterococcus casseliflavus 


This patent 


TUSA 


15 


1579 


Enterococcus gall in arum 


This patent 






4 COft 

1580 


Streptococcus mitis 


i nis patent 


tUSA 




4 CDC 

1585 


Enterococcus faecium 


Database 


satG 




1590 


Cloning vector pFW16 


Database 


tetM 




1594 


Enterococcus faecium 


uaiaDase 


vanD 


20 


* coo 

1599 


Enterococcus faecalis 


uaiaDase 


vanE 




lOUU 


Campylobacter jejuni subsp. doylei 


This patent 


atpu 




1 CA4 

lOOl 


Enterococcus sulfureus 


This patent 


atpu 




1 CftO 

loQZ 


enterococcus sot nanus 


i nis patent 


atpD 




1603 


Campylobacter sputorum subsp. sputorum 


This patent 


atpD 


25 


1604 


Enterococcus pseudoavium 


I his patent 


atpD 




1607 


Klebsiella omithinolytica 


This patent 


gyrA 




4 CAB 

1606 


Klebsiella oxytoca 


i his patent 


gyrA 




1613 


Staphylococcus aureus 


Database 


varo 




1618 


Staphylococcus cohnii 


Database 


vafC 


30 


1623 


Staphylococcus aureus 


Database 


wja 




4 COO 

1628 


Staphylococcus aureus 


Database 






1633 


Staphylococcus aureus 


Database 






4 MO 

1638 


Aspergillus fumigatus 


This patent 


atpD 




1639 


Aspergillus fumigatus 


tl; ^ — * a 

This patent 


atpD 


35 


1640 


Bacillus mycoides 


This patent 


atpD 




1641 


Bacillus mycoides 


This patent 


atpD 




1642 


Bacillus mycoides 


This patent 


atpD 




1643 


Bacillus pseudomycoides 


This patent 


atpD 




1644 


Bacillus pseudomycoides 


This patent 


atpD 


AC\ 

40 


4 GA C 

1645 


Budvicia aquatica 


This patent 


atpu 




1646 


Buttiauxella agrestis 


This patent 


atpu 




1647 


Candida norvegica 


This patent 


atpD 




1648 


Streptococcus pneumoniae 


This patent 


pop i a 




1649 


Campylobacter lari 


I his patent 


atpD 




JOOU 


Coccidioides immitis 


i nis patent 


atpD 




lOOl 


Emmonsia parva 


i nis patent 


atpD 


• 


1652 


Erwinia amylovora 


This patent 


atpD 




1 ceo 


ronsecaea peorosoi 


i nis patent 


atpD 




4 CCA 

1654 


Fusarium moniliforme 


This patent 


atpu 


50 


1655 


Klebsiella oxytoca 


This patent 


atpu 




1656 


Microsporum audouinii 


This patent 


atpu 




ice? 
io57 


UDesumDacienum proteus 


i nis patent 


atpD 




looo 


Paracoccidioides brasiliensis 


This patent 


atpD 




1 CEO 

loo9 


riosiomonas snigeiioiues 


i nis patent 


atpD 




1 oou 




1 1 HO yJB Iwl I* 


atpD 




1662 


Campylobacter curvus 


This patent 


tuf 




1663 


Campylobacter rectus 


This patent 


tuf 




1664 


Fonsecaea pedrosoi 


This patent 


tuf 




1666 


Microsporum audouinii 


This patent 


tuf 


60 


1667 


Piedraia hortai 


This patent 


tuf 




1668 


Escherichia coli 


Database 


tuf 




1669 


Saksenaea vasiformis 


This patent 


tuf 




1670 


Trichophyton tonsurans 


This patent 


tuf 




1671 


Enterobacter aerogenes 


This patent 


atpD 


65 


1672 


Bordetella pertussis 


Database 


atpD 




1673 


Arcanobacterium haemolyticum 


This patent 


tuf 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 





SEQ ID NO. 


Archaeal bacterial funaal or Darasitical soecias 


Source 


Gene* 


5 


1674 


Butyrivibrio fibrisoivens 


This patent 


tuf 




1675 


Campylobacter jejuni subsp. doylei 


This patent 


tuf 




1676 


Campylobacter lari 


This patent 


tuf 




1677 


Campylobacter sputorum subsp. sputorum 


This patent 


tuf 




1678 


Campylobacter upsaiiensis 


This patent 


tuf 


10 


1679 


- -Globicatetia sanguis 


— This patent- - 


—tuf 




1680 


Lactobacillus acidophilus 


This patent 


tuf 




1661 


Leuconostoc mesenteroides subsp. dextranicum 


This patent 


tuf 




1682 


Prevotella buccalis 


This patent 


tuf 




1683 


Ruminococcus bromii 


This patent 


tuf 


15 


1684 


Paracoccidioides brasilierisis 


This patent 


atpD 




1685 


Candida norvegica 


This patent 


M(EF-1) 




1686 


Aspergillus nidulans 


This patent 


tuf 




1687 


Aspergillus terreus 


This patent 


tuf 




1688 


Candida norvegica 


This patent 


tuf 


20 


1689 


Candida parapsilosis 


This patent 


tuf 




1702 


Streptococcus gordonii 


WO98/20157 


recA 




1703 


Streptococcus mutans 


WO98/20157 


recA 




1704 


Streptococcus pneumoniae 


WO98/20157 


recA 




1705 


Streptococcus pyogenes 


WO98/20157 


recA 


25 


1706 


Streptococcus salivarius subsp. thermophilus 


WO98/20157 


recA 




1707 


Escherichia coli 


WO98^0157 


oxa 




1708 


Enterococcus faecalis 


WO98/20157 


blaZ 




1709 


Pseudomonas aeruginosa 


WO98/20157 


aac6'-lla 




1710 


Staphylococcus aureus 


WO98/20157 


ermA 


30 


1711 


Escherichia coli 


WO98/20157 


ermB 




1712 


Staphylococcus aureus 


WO98/20157 


ermC 




1713 


Enterococcus faecalis 


WO98/20157 


vanB 




1714 


Campylobacter jejuni subsp. jejuni 


This patent 


recA 




1715 


Abiotrophia adiacens 


WO98/20157 


tuf 


35 


1716 


Abiotrophia defectiva 


WO98/20157 


tuf 




1717 


Corynebacterium accolens 


WO98/20157 


tuf 




1718 


Corynebacterium genitatium 


WO98/20157 


tuf 




1719 


Corynebacterium jeikeium 


WO98/20157 


tuf 




1720 


Corynebacterium pseudodiphtheriticum 


WO98/20157 


tuf 


40 


1721 


Corynebacterium striatum 


WO98/20157 


tuf 




1722 


Enterococcus avium 


WO98/20157 


tuf 




1723 


Gardnerella vaginalis 


WO98/20157 


tuf 




1724 


Listeria innocua 


WO98/20157 


tuf 




1725 


Listeria ivanovii 


WO98/20157 


tuf 


45 


1726 


Listeria monocytogenes 


WO98/20157 


tuf 




1727 


Listeria seeligeri 


WO98/20157 


tuf 




1728 


Staphylococcus aureus 


WO98/20157 


tuf 




1729 


Staphylococcus saprophytics 


WO98/20157 


tuf 




1730 


Staphylococcus simulans 


WO98/20157 


tuf 


50 


1731 


Streptococcus agalactiae 


WO98/20157 


tuf 




1732 


Streptococcus pneumoniae 


WO98/20157 


tuf 




1733 


Streptococcus salivarius 


WO98/20157 


tuf 




1734 


Agrobacterium radiobacter 


WO98/20157 


tuf 




1735 


Bacillus subtilis 


WO98/20157 


tuf 


55 


1736 


Bacteroides fragilis 


WO98/20157 


tuf 




1737 


Borrelia burgdorferi 


WO98/20157 


tuf 




1738 


Brevibacterium linens 


WO98/20157 


tuf 




1739 


Chlamydia trachomatis 


WO98/20157 


tuf 




1740 


Fibrobacter succinogenes 


WO98/20157 


tuf 


60 


1741 


Flavobacterium ferrugineum 


WO98/20157 


tuf 




1742 


Helicobacter pylori 


WO98/20157 


tuf 




1743 


Micrococcus iuteus 


WO98/20157 


tuf 




1744 


Mycobacterium tuberculosis 


WO98/20157 


tuf 


65 


1745 


Mycoplasma genitatium 


WO98/20157 


tuf 


1746 


Neisseria gonorrhoeae 


WO98/20157 


tuf 



204 



WO 01/23604 PCT/CAOO/01150 
Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 





SEQ ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


5 


1747 


Rickettsia prowazekii 


WO98/20157 


tuf 




1748 


Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 


WO98/20157 


tuf 




1749 


Shewanella putrefaciens 


WO98/20157 


tuf 




1750 


Stigmatella aurantiaca 


WO98/20157 


tuf 


_10_ 


.1.751 


Thiomonas cuprina 


-WO98/20157- 


-tuf 




1752 


Treponema pallidum 


WO98/20157 


tuf 




1753 


Ureaplasma ureatyticum 


WO98/20157 


tuf 




1754 


Wolinella succinogenes 


WO98/20157 


tuf 




1755 


Burkholderia cepacia 


WO98/20157 


tuf 


15 


1756 


Bacillus anthracis 


This patent 


recA 




1757 


Bacillus anthracis 


This patent 


reck 




1758 


Bacillus cereus 


This patent 


reck 




1759 


Bacillus cereus 


This patent 


reck 




1760 


Bacillus mycoides 


This patent 


reck 


20 


1761 


Bacillus pseudomycoides 


This patent 


reck 




1762 


Bacillus thuringiensis 


This patent 


reck 




1763 


Bacillus thuringiensis 


This patent 


reck 




1764 


Klebsiella oxytoca 


This patent 


gyrA 




1765 


Klebsiella pneumoniae subsp. ozaenae 


This patent 


gyrA 


25 


1766 


Klebsiella planticola 


This patent 


gyrA 




1767 


Klebsiella pneumoniae 


This patent 


gyrA 




1768 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


gyrA 




1769 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


gyrA 




1770 


Klebsiella pneumoniae subsp. rhinoscleromatis 


This patent 


gyrA 


30 


1771 


Klebsiella terrigena 


This patent 


gyrA 




1772 


Legionella pneumophila subsp. pneumophila 


This patent 


gyrA 




1773 


Proteus mirabilis 


This patent 


gyrA 




1774 


Providencia rettgeri 


This patent 


gyrA 




1775 


Proteus vulgaris 


This patent 


gyrA 


35 


1776 


Yersinia enterocolitica 


This patent 


gyrA 




1777 


Klebsiella oxytoca 


This patent 


parC (griA) 




1778 


Klebsiella oxytoca 


This patent 


parC (griA) 




1779 


Klebsiella pneumoniae subsp. ozaenae 


This patent 


parC (griA) 




1780 


Klebsiella planticola 


This patent 


parC (griA) 


40 


1781 


Klebsiella pneumoniae 


This patent 


parC (griA) 




1782 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


parC (griA) 




1783 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


parC (griA) 




1784 


Klebsiella pneumoniae subsp. rhinoscleromatis 


This patent 


parC (griA) 




1785 


Klebsiella terrigena 


This patent 


parC (griA) 


45 


1786 


Bacillus cereus 


This patent 


fusA 




1787 


Bacillus cereus 


This patent 


fusA 




1788 


Bacillus anthracis 


This patent 


fusA 




1789 


Bacillus cereus 


This patent 


fusA 




1790 


Bacillus anthracis 


This patent 


fusA 


50 


1791 


Bacillus pseudomycoides 


This patent 


fusA 




1792 


Bacillus cereus 


This patent 


fusA 




1793 


Bacillus anthracis 


This patent 


fusA 




1794 


Bacillus cereus 


This patent 


fusA 




1795 


Bacillus weihenstephanensis 


This patent 


fusA 


55 


1796 


Bacillus mycoides 


This patent 


fusA 




1797 


Bacillus thuringiensis 


This patent 


fusA 




1798 


Bacillus weihenstephanensis 


This patent 


fusA-tuf spacer 




1799 


Bacillus thuringiensis 


This patent 


fusA-tuf spacer 




1800 


Bacillus anthracis 


This patent 


fusA-tuf spacer 


60 


1801 


Bacillus pseudomycoides 


This patent 


fusA-tuf spacer 




1802 


Bacillus anthracis 


This patent 


fusA-tuf spacer 




1803 


Bacillus cereus 


This patent 


fusA-tuf spacer 




1804 


Bacillus cereus 


This patent 


fusA-tuf spacer 


65 


1805 


Bacillus mycoides 


This patent 


fusA-tuf spacer 


1806 


Bacillus cereus 


This patent 


fusA-tuf spacer 
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Table 7. Origin f the nucleic acids and/or sequences in the sequence listing (continued). 





btU \V IMU. 


Archaeat, bacterial, fungal or parasitical species 


Source 


Gene 




1A07 


PtunHtuc Karaite 


1 1119 pcilCl 11 


// iCo.ti if cnanof 
lUSM'iui opawd 




1A0A 

1 QUO 


Raniiluc parotic 


This natAnt 
1 1 no poioui 


fitcA.tiif cnanor 






Ranilh ic anthranic 


Thic nfltont 

1 1 lid JJCHtJI 11 


/ifco.l/if cnanor 
lUbM-lUI opdCol 




ifiin 


Ranilfi ic mi/A^iWfle 

Dacmus niycoiues 


Thic nntpnt 
I iiio paiciil 


htf 
luf 




1011 

loll 


DaCiuus inunnyiensis 


i iiio paioni 


hit 
lUf 


10 


1 0 It 


P, a nil ft ic narai ic 


Thic nntant 


tuf 




1 m o 

1 010 


Da nil It ip usaffaortcfonfianoncic 

DaCiuus Wuinenswpnanonsis 


Thic natont 
1 illo JJalcill 


lUf 




1 A14 
1 0 l*» 


Ro ft lit ic anthranic 
DtACIItUS al ill II aCIS 


Thic nnf Ant 
i mo paid ii 


tiff 

IUI 




1 A1 R 
1 0 1 o 


Oa nilli rc r/)n)i ic 


Thic nofont 
i iiio |Jait?i ii 


tut 




1 A1 A* 


Ranilli ic narai ic 
DaLlHUS LU la US 


Thic natpnt 
I IIIO paici 11 


tuf 

IUI 


u 
10 


1H17 
1 0 I / 


Ranilli ic anthranic 

DaCiuus aniniacis 


Thic nfltont 
1 iiio paiBi 11 


tiff 

IUI 




1 A1 A 

I o io 


Rani Hi ic naranc 
Da CI II US Off a US 


T>lic nafpnt 
1 1 no |Ja lei ii 


tuf 
tut 




1 Ol9 


Dn sv'ffi ic a nth ranic 
DaCIIIUS at llf If aCtS 


Thic nfltpnt 
1 1 no jjaici ii 


tuf 

IUI 




1 A°n 

1 0£U 


Rani Hi ic nco/ /Wrtmt//v"wWpc 
Dacuius uoauuui 1 lycuiutjo 


Thic natpnt 
1 1 no fjaiwi ii 


tuf 




1 A91 
lO£l 


O on! Hi ic na ra i tc 
DaCIIIUS Co f& US 


Thic nfltpnt 

I IIIO paid ll 


tuf 

IUI 


on 




dtrantnnnnm ic Ara/iV 
OirupWCOCCUS OiailS 


Thic natonf 
i iiio paid ll 


fi tcA 

IUSM 




I O^O 


Di iH\/inia ami a tina 
DUUVICIa aUUctllCa 


Thic nntpnt 
i iiio paiciu 


fue A 




1 Ot** 


Ri tttiai ivolia an roc tic 
DuiiidUADi'd ayioSllo 


Thic nfltpnt 
i iiio paiciii 


flKA 




1 ft9Ci 


k'/ohcio/Za nYi/t/tna 

f\ieusiwia oxyivcs 


Thic nntont 
I iiio [Jaioi ii 


fin: A 

IUSM 




1 ftOfi 
I O^D 


rlcolVTTlQnaS Suiy&IIUiuGS 


Thic natpnt 
1 1 no LJaici ll 


ft IK A 

fuo/1 


o^ 


1 A97 


0/ luwafiaiia puuciauioiio 


Thic nntsnt 
1 1 no paid ii 


fijc A 

1 uOfl 




1 A9A 


Ohoci imhsntarii im nmtGtio. 


Thic nat^nt 
i iiio paid ii 


fiic A 




1 A9Q 

I 0«vO 


rWVUSItilia UAyivGa 


Thic nfltpnt 
1 1 no paid it 


fu^A-tuf enflnor 
iujn iu/ opawd 




1 A^n 

lOOU 


P,nH\ /In! a ami istino 
DUUVICIa aUUaiICa 


Thic natpnt 
1 1 no paid ii 


fitcA-ti if cnanor 
iusr\-iui opauci 




loo » 


Plac issmnrtac c h in otln trio c 

ricSiornonas snigwioiuas 


Thic natpnt 
i iiio paid u 


fiicAmtnf cnanor 




1 00£ 


wuvsurnuaCwriufn pioiuuo 


Thic nnfont 
i iiio paid ii 


ft icA-ti if cnorof 
lusrA nut opavci 




i ooo 


one Wo/ie/ia puirciaCiyrio 


Thic natpnt 
l lilo paid u 


ft tcA-tt if cnaror 
ruori'iuf spaUd 




1 004 


DUWaUXOIId ayr&suS 


I nio paieni 


fiic A ti if ennror 
IU9r\-lui spdeer 




i QIC 


i^antpyiOuaCWi Cull 


Thic natont 
1 lllo paiuiu 


tiff 
luf 




1 QIC 
1 OOO 


l^aiTipytODaCWf WluS SUDS p. T&IUS 


Thic natont 

i nio paiem 


it if 
luf 




loo/ 


U3rnpyiOD3Ci9r T91US SUDS p. VenBrGallS 


Thic natpnt 
1 nio pdieni 


tuf 
luf 




I OOO 


DUWsUXBiiB ayreSllS 


T~Kic natonf 

I nis paierii 


tuf 
luf 






t\iGDSl6ll3 OXyiOCa 


Thic natont 

I nio paicDi 


tuf 
luf 




1 A4fi 
I 0*tU 


rlGSIUrnunaS SfliyUlluiU&S 


Thic natpnt 
i nio paid ii 


tuf 

IUI 




1A41 


Qha\A/an&fla ni itrafanionG 
oncvvautjiia puUKSiavffou9 


Thic nfltfint 
1 1 no paid ii 


tuf 




1 0*Tt 




Thic nntflnt 
• nio paid ii 


tuf 




i o**o 


fli iri\/iniQ ant tat'ma 
DUUVIUla aLfuaHl^a 


Thic natpnt 
i iiio paid ii 


tuf 

IUI 




1A44 

1 Q***T 


Ahintmnhia a/iiarxinc 

mjivllvfsl lia aLitaUBl to 


Thic nfltflnt 


atpD 




1 £MC 


AMUaliUUaClOl tut II 1 lav It lUiyilLUl 1 1 


Thic natpnt 
1 1 no paici ii 


atnn 




1 AAA 


Onr IHinhmtt i c ra nan /m 
DaSluluuVlUS I alia fun I 


Thic natpnt 
l iiio paid ii 


aipLs 




lO**/ 


DiaSiornycBs oermauiiois 


Thic natont 


aipu 




1 AdA 

io**o 


DiaSiornycus ovfTnauiiuis 


Thic nntont 
i nio paidll 






1 AAQ 


UBmpyiODaCWr COM 


Thic natont 
i nio paidii 






1 AAn 


uarnpyiODaCiBi wius suds p. jqius 


Thic natont 

i nio parent 


atnn 
aipu 




IOD 1 


l^ampyiODaCWT WIUS SUDS p. V&n&i&aliS 


This natont 

1 nis pdieni 


aipu 


*;n 

jU 




fT*onftr\ttl/\nantar n rani lie 
[^auipyiOVaClaf yiaCUIS 


Thic natpnt 
i nio paid ii 


Atnn 
aipu 




IOOO 


KsanipyiUDaCloi J&JUni o\JUSU. JUJUfil 


Thic natont 
I iiio paid ll 


atnn 
aipu 




10O** 


CniafUCOCCUS CaCvfufii 


Thic natont 
1 mo paidii 


atnn 
aipu 




1 A*^ 


t-nt&mns\nnttc nnli imhso 

cnwrucuccus Goiurnoau 


Thio natont 
i nio paivni 


atnn 
aipu 




1 occ 

1 Ov/O 


Pn to mnnnm ic W/c rvo /■ 
ciuuivcviAAJo visual 


Thic natont 
1 1 no paid ii 


atnn 
aipu 




1857 


FntfirfiPnffijG. mslcirifirzitmz 

l. i not L/oi/Owuo 1 1 laiuuui at(j& 


This Datent 

• 1110 puivlll 


atpD 




1PKA 


L/iICi fUvvvUd 1 IIUIIUUI 


Thic natpnt 

1 iiio paid n 


atpD 






t iiiai LfisLriAsUo l at III iu&Mj 


Thic natont 

1 1 no paid ii 


atoD 




1860 


Gfobicstella sanguis 


This patent 


atpD 




1861 


Lactococcus garvieae 


This patent 


atpD 


60 


1862 


Lactococcus lactis 


This patent 


atpD 




1863 


Listeria ivanovii 


This patent 


atpD 




1864 


Succinivibrio dextrinosolvens 


This patent 


atpD 




1865 


Tetragenococcus halophilus 


This patent 


atpD 




1866 


Campylobacter fetus subsp. fefus 


This patent 


recA 


65 


1867 


Campylobacter fetus subsp. venerealis 


This patent 


recA 




1868 


Campylobacter jejuni subsp. je/un/' 


This patent 


recA 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



SEO ID NO. Archaeal, bacterial, fungal or parasitical species Source Gene* 



c 


lob? 


Enterococcus avium 


in is patent 


recA 




1870 


Enterococcus faecium 


This patent 


reck 




4 0*71 

1871 


Listeria monocytogenes 


i nis patent 


reck 




1872 


Streptococcus mitis 


This patent 


reck 




1873 


Streptococcus oralis 


This patent 


reck 


-10- 


1874 — 


Aspergillus fumigatus 


~ "This patent" 


tuf{M) 




1875 


Aspergillus versicolor 


This patent 


tuf[M) 




1876 


Basidiobolus ranarum 


This patent 


tV/(M) 




1877 


Campylobacter gracilis 


This patent 


tuf 




1878 


Campylobacter jejuni subsp. jejuni 


This patent 


tuf 


15 


1879 


Coccidioides immitis 


This patent 


tuf(M) 




1880 


Erwinia amylovora 


This patent 


tuf 




1881 


Salmonella choleraesuis subsp. choleraesuts serotype 
Typhimurium 


This patent 


tuf 




1899 


Klebsiella pneumoniae 


Database 


blaSHV 


20 


1900 


Klebsiella pneumoniae 


Database 


blaSHV 




1901 


Escherichia coli 


Database 


blaSHV 




1902 


Klebsiella pneumoniae 


Database 


blaSHV 




1903 


Klebsiella pneumoniae 


Database 


blaSHV 




1904 


Escherichia coli 


Database 


blaSHV 


25 


1905 


Pseudomonas aeruginosa 


Database 


blaSHV 




1927 


Neisseria meningitidis 


Database 


blaTEM 




1928 


Escherichia coll 


Database 


blaTEM 




1929 


Klebsiella oxytoca 


Database 


blaTEM 




1930 


Escherichia coli 


Database 


blaTEM 


30 


1931 


Escherichia coli 


Database 


blaTEM 




1932 


Escherichia coli 


Database 


blaTEM 




1933 


Escherichia coli 


Database 


blaTEM 




1954 


Klebsiella pneumoniae subsp. pneumoniae 


Database 


gyrA 




1956 


Candida inconspicua 


This patent 


fu/(M) 


35 


1957 


Candida utilis 


This patent 


tuf (M) 




1958 


Candida zeylanoides 


This patent 


tuf (M) 




1959 


Candida catenuiata 


This patent 


tuf (M) 




1960 


Candida krusei 


This patent 


tuf (M) 




1965 


Plasmid pGSOS 


Database 


sulll 


Af\ 

40 


1970 


Transposon Tn10 


Database 


tetB 




1985 


Cryptococcus neoformans 


Database 


f<v/(EF-1) 




1986 


Cryptococcus neoformans 


Database 


tuf (EF-1) 




1987 


Saccharomyces cerevisiae 


Database 


ftvf(EF-l) 




1988 


Saccharomyces cerevisiae 


Database 


fuf (EF-1) 


45 


1989 


Eremothecium gossypii 


Database 


ft// (EF-1) 




4 AAA 

1990 


Eremothecium gossypii 


Database 


tut (cr-1) 




1991 


Aspergillus oryzae 


Database 


for (EF-1) 




1992 


Aureobasidium pullulans 


Database 


tuf (EF-1) 




1993 


Histoplasma capsulatum 


Database 


fU/ (cr-1) 


50 


1994 


Neurospora crassa 


Database 


tuf (cr-1) 




4 AAC 

1995 


Podospora anserina 


Database 


tUI (cr-1) 






P/v/ncnnro f*t irvj/YiZ/fl 
rOUOSpOia LUfVfUUIla 


notohaco 
L/alaUaoe 


lui ^cr- 1 ) 




1997 


Sordaria macrospora 


Database 


M(EF-1) 




1998 


Trichoderma reesei 


Database 


tuf (EPA) 


55 


2004 


Candida albicans 


Database 


tufm 




2005 


Schizosaccharomyces pombe 


Database 


tufM 




2010 


Klebsiella pneumoniae 


Database 


blaTEM 




2011 


Klebsiella pneumoniae 


Database 


blaTEM 


60 


2013 


Kluyvera ascorbata 


This patent 


gyrA 


2014 


Kluyvera georgiana 


This patent 


gyrA 




2047 


Streptococcus pneumoniae 


Database 


pbp\k 




2048 


Streptococcus pneumoniae 


Database 


pbp\k 




2049 


Streptococcus pneumoniae 


Database 


pbp\k 



65 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



SEQ ID NO. Archaeal, bacterial, fungal or parasitical species Source Gene* 



5 


2050 


Streptococcus pneumoniae 


Database 


pbp^A 




2051 


Streptococcus pneumoniae 


Database 


pbp\k 




2052 


Streptococcus pneumoniae 


Database 


pbp^A 




2053 


Streptococcus pneumoniae 


Database 


pPplA 




2054 


Streptococcus pneumoniae 


Database 


gyrA 


_10 


_2055_ 


Streptococcus pneumoniae 


Database 


parC 




2056 


Streptococcus pneumoniae 


This patent 


pfcplA 




2057 


Streptococcus pneumoniae 


This patent 


pPplA 




2058 


Streptococcus pneumoniae 


This patent 


pbplA 




2059 


Streptococcus pneumoniae 


This patent 


pPplA 


15 


2060 


Streptococcus pneumoniae 


This patent 


pPplA 




2061 


Streptococcus pneumoniae 


This patent 


pPplA 




2062 


Streptococcus pneumoniae 


This patent 


pPplA 




2063 


Streptococcus pneumoniae 


This patent 


pbplA 




2064 


Streptococcus pneumoniae 


This patent 


pbplA 


20 


2072 


Mycobacterium tuberculosis 


Database 


rpoB 




2097 


Mycoplasma pneumoniae 


Database 


tuf 




2101 


Mycobacterium tuberculosis 


Database 


inhA 




2105 


Mycobacterium tuberculosis 


Database 


embB 




2129 


Clostridium difficile 


Database 


cdtA 


25 


2130 


Clostridium difficile 


Database 


cdtB 




2137 


Pseudomonas outida 

f w w UUV/f f Ivl 'ww puUUU 


Genome project 


tuf 




2138 


Pseudomonas aeruginosa 


Genome project tui 




2139 


Campylobacter jejuni 


uataoase 


sipu 




2140 


Streotococcus oneumoniae 


Database 


pbpla 


30 


2144 


Staphylococcus aureus 


Database 


mupA 




2147 


Escherichia coli 


Database 


cat/ 




2150 


Escherichia coli 


Database 


catll 




2153 


Shicella flexneri 


Database 


catlll 




2156 


Clostridium perfringens 


Database 


catP 


35 


2159 


Staohvlococcus aureus 


Database 


cat 




2162 


Staohvlococcus aureus 


Database 


cat 




2165 


Salmonella typhimurium 


Database 


ppff>\\ke 




2183 


Alcaligenes faecal is subs p. faecalis 


This patent 


tuf 




2184 


Campylobacter coli 


This patent 


fusA 


40 


2185 


Succinivibrio dextrinosolvens 


This patent 


tuf 




2186 


Tetraoenococcus halODhilus 


This patent 


tuf 




2187 


CamDvlobacter ieiuni subsD ieiuni 


This patent 


fusA 




2188 


Campylobacter jejuni subsp. jejuni 


This patent 


fusA 




2189 


Leishmania guyanensis 


This patent 


atpD 


45 


2190 


Trvn&nnsinmR hmctQi brucei 

f 1 Jf km/Gr i w*0 w*f 1 IU t/i uww* fc^l Wvv I 


This patent 


atpD 




2191 


Aspergillus nidulans 


This patent 


atpD 




2192 


Leishmania panamensis 


This patent 


atpD 




2193 


AsDeroiltus nidulans 


This patent 


fuf(M) 




2194 


Aureobasidium pullulans 


This patent 


tuf(M) 


50 


2195 


Emmonsia parva 


This patent 


fuf(M) 




2196 


Exserohilum rostratum 


This patent 


frf(M) 




2197 


Fusarium moniiiforme 


This patent 


n/f(M) 




2198 


Fusarium solani 


This patent 


tuf(M) 




2199 


Histoplasma capsulatum 


This patent 


tuf(M) 


55 


2200 


Kocuria kristinae 


This patent 


tuf 




2201 


Vibrio mimicus 


This patent 


tuf 




2202 


Citrobacter freundii 


This patent 


recA 




2203 


Clostridium botulinum 


This patent 


recA 




2204 


Francisella tularensis 


This patent 


recA 


60 


2205 


Peptostreptococcus anaerobius 


This pat nt 


recA 




2206 


Peptostreptococcus asaccharolyticus 


This patent 


recA 




2207 


Providencia stuariii 


This patent 


recA 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 





SEQ ID NO. 


Archaeal. bacterial, funaal or parasitical SDecies 


Soured 


Gene* 


5 


2208 


Salmonella choteraesuis subsp. choleraesuis 










serotype Paratyphi A 


This patent 


recA 




2209 


Salmonella choleraesuis subsp. choleraesuis 








serotype Typhimurium 


This patent 


recA 




2210 


Staphylococcus saprophytics 


This patent 


recA 


"id - 


2211 


Yersinia pseudotuberculosis 


This patent 


recA 




2212 


Zoogloea ramigera 


This patent 


recA 




2214 


Abiotrophia adiacens 


This patent 


fusA 




2215 


Acinetobacter baumannii 


This patent 


fusA 




2216 


Actinomyces meyeri 


This patent 


fusA 


15 


2217 


Clostridium difficile 


This patent 


fusA 




2218 


Corynebacterium diphtheriae 


This patent 


fusA 




2219 


Enterobacter cloacae 


This patent 


fusA 




2220 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


fusA 




2221 


Listeria monocytogenes 


This patent 


fusA 


20 


2222 


Mycobacterium avium 


This patent 


fusA 




2223 


Mycobacterium gordonae 


This patent 


fusA 




2224 


Mycobacterium kansasii 


This patent 


fusA 




2225 


Mycobacterium terrae 


This patent 


fusA 




2226 


Neisseria polysaccharea 


This patent 


fusA 


25 


2227 


Staphylococcus epidermidis 


This patent 


fusA 




2228 


Staphylococcus haemolyticus 


This patent 


fusA 




2229 


Succinivibrip dextrinosolvens 


This patent 


fusA 




2230 


Tetragenococcus halophilus 


This patent 


fusA 




2231 


Veillonella parvula 


This patent 


fusA 


30 


2232 


Yersinia pseudotuberculosis 


This patent 


fusA 




2233 


Zoogloea ramigera 


This patent 


fusA 




2234 


Aeromonas hydrophila 


This patent 


fusA 




2235 


Abiotrophia adiacens 


This patent 


fusA~tuf spacer 




2236 


Acinetobacter baumannii 


This patent 


fusA-tut spacer 


35 


2237 


Actinomyces meyeri 


This patent 


fusA-tuf spacer 




2238 


Clostridium difficile 


This patent 


fusA-tuf spacer 




2239 


Corynebacterium diphtheriae 


This patent 


fusA-tut spacer 




2240 


Enterobacter cloacae 


This patent 


fusA-tuf spacer 




2241 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


fusA- tuf spacer 


40 


2242 


Listeria monocytogenes 


This patent 


fusA-tuf spacer 




2243 


Mycobacterium avium 


This patent 


fusA-tuf spacer 




2244 


Mycobacterium gordonae 


This patent 


fusA -tuf spacer 




2245 


Mycobacterium kansasii 


This patent 


fusA-tuf spacer 




2246 


Mycobacterium terrae 


This patent 


fusA-tuf spacer 


45 


2247 


Neisseria polysaccharea 


This patent 


fusA-tuf spacer 




2248 


Staphylococcus epidermidis 


This patent 


fusA-tuf spacer 




2249 


Staphylococcus haemolyticus 


This patent 


fusA-tuf spacer 




2255 


Abiotrophia adiacens 


This patent 


tuf 




2256 


Acinetobacter baumannii 


This patent 


tuf 


50 


2257 


Actinomyces meyeri 


This patent 


tuf 




2258 


Clostridium difficile 


This patent 


tuf 




2259 


Corynebacterium diphtheriae 


This patent 


tuf 




2260 


Enterobacter cloacae 


This patent 


tuf 




2261 


Klebsiella pneumoniae subsp. pneumoniae 


This patent 


tuf 


55 


2262 


Listeria monocytogenes 


This patent 


tuf 




2263 


Mycobacterium avium 


This patent 


tuf 




2264 


Mycobacterium gordonae 


This patent 


tuf 




2265 


Mycobacterium kansasii 


This patent 


tuf 




2266 


Mycobacterium terrae 


This patent 


tuf 


60 


2267 


Neisseria polysaccharea 


This patent 


tuf 




2268 


Staphylococcus epidermidis 


This patent 


tuf 




2269 


Staphylococcus haemolyticus 


This patent 


tuf 




2270 


Aeromonas hydrophila 


This patent 


tuf 




2271 


Bilophila wadsworthia 


This patent 


tuf 


65 


2272 


Brevundimonas diminuta 


This patent 


tuf 




2273 


Streptococcus mitis 


This patent 


pbpla 
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Table 7. Origin of the nucleic acids and/or sequences in the sequence listing (continued). 



SEQ ID NO. 


Archaeal, bacterial, fungal or parasitical species 


Source 


Gene* 


5 2274 


Streptococcus mitts 


This patent 


pbpla 


2275 


Streptococcus mitis 


This patent 


pbpla 


2276 


Streptococcus oralis 


This patent 


pbpla 


2277 


Escherichia coti 


This patent 


gyrA 


2278 


Escherichia coli 


This patent^ 


gy/SL. 


10 2279 


Escherichia coli 


~ This patent 


gyrA 


2280 


Escherichia coli 


This patent 


gyrA 


2288 


Enterococcus faecium 


Database 


ddl 


2293 


Enterococcus faecium 


Database 


vanA 


2296 


Enterococcus faecalis 


Database 


vanB 



15 



* M indicates fuf sequences, tuf (C) indicates tuf sequences divergent from main (usually A and B) copies of the 
elongation factor-Tu, tuf (EF-1) indicates M sequences of the eukaryotic type (elongation factor 1a), tuf(M) indicates 
20 tuf sequences from organellar (mostly mitochondrial) origin. 

fusA indicates fusA sequences; fusA-tuf spacer indicates the intergenic region between fusA and tuf. 
atpD indicates apD sequences ot the F-type, atpD (V) indicates atpD sequences of the V-type. 
recA indicates recA sequences, rec4(Rad51) indicates rad51 sequences or homologs and recA(Dmd) indicates dmd 
sequences or homologs. 
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40 



Table 8. Bacterial species used to test the specificity of the Streptococcus agalactlae-speclllc 
amplification primers derived from tuf sequences. 



Strain 



Reference number 



Strain 



Reference number 



10 



15 



20 



25 



30 



35 



Streptococcus acidominimus ATCC 51 726 

Streptococcus agalactiae ATCC 1 2403 

Streptococcus.agalactiae. ATCC 12973 

Streptococcus agalactiae ATCC 13813 

Streptococcus agalactiae ATCC 2759 1 

Streptococcus agalactiae CDCs 1 073 

Streptococcus anginosus ATCC 27335 

Streptococcus anginosus ATCC 33397 

Streptococcus bovis ATCC 3331 7 

Streptococcus anginosus ATCC 27823 

Streptococcus cricetus ATCC 1 9642 

Streptococcus cristatus ATCC 51 1 00 

Streptococcus downei ATCC 33748 

Streptococcus dysgalactiae ATCC 43078 
Streptococcus equi s ubsp. equi ATCC 9528 

Streptococcus ferus ATCC 33477 

Streptococcus gordonii ATCC 1 0558 

Streptococcus macacae ATCC 359 1 1 

Streptococcus mitis ATCC 49456 

Streptococcus mutans ATCC 251 75 

Streptococcus oralis ATCC 35037 

Streptococcus parasanguinis ATCC 1 591 2 
Streptococcus parauberis DSM 6631 

Streptococcus pneumoniae ATCC 27336 

Streptococcus pyogenes ATCC 19615 

Streptococcus ratti ATCC 1 9645 

Streptococcus salivarius ATCC 7073 

Streptococcus sanguinis ATCC 1 0556 

Streptococcus sobrinus ATCC 27352 

Streptococcus suis ATCC 43765 

Streptococcus uberis ATCC 1 9436 

Streptococcus vestubularis ATCC 49124 



Bacteroides caccae ATCC 431 85 

Bacteroides vulgatus ATCC 8482 

Bacteroides fragilis_ ATCC 25285 

Candida albicans ATCC11006 

Clostridium innoculum ATCC 14501 

Clostridium ramosum ATCC 25582 

Lactobacillus casei subsp. easel ATCC 393 

Clostridium septicum ATCC 1 2464 

Corynebacterium cervicis NCTC 1 0604 

Corynebacterium genitalium ATCC 3303 1 

Corynebacterium urealyticum ATCC 43042 

Enterococcus faecalis ATCC 2921 2 

Enterococcus faecium ATCC 1 9434 

Eubacterium lentum ATCC 43055 

Eubacterium nodutum ATCC 33099 

Gardnerella vaginalis ATCC 1 401 8 

Lactobacillus acidophilus ATCC 4356 

Lactobacillus crispatus ATCC 33820 

Lactobacillus gasseri ATCC 33323 

Lactobacillus johnsonii ATCC 33200 

Lactococcus lactis subsp. lactis ATCC 1 9435 

Lactococcus lactis subsp. lactis ATCC 1 1 454 

Listeria innocua ATCC 33090 

Micrococcus luteus ATCC 9341 

Escherichia coli ATCC 25922 

Micrococcus lylae ATCC 27566 

Porphyromonas asaccharolytica ATCC 25260 

Prevotella corporis ATCC 33547 

Prevotella melanogenica ATCC 25845 

Staphylococcus aureus ATCC 1 3301 

Staphylococcus epidermidis ATCC 1 4990 

Staphylococcus saprophytics ATCC 1 5305 
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Table 9. Bacterial species used to test the specificity f the Streptococcus agalactiae*$pec\1ic 
amplification primers derived from atpD sequences. 



Strain 



Reference number 



Strain 



Reference number 



10 



15 



20 



Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 



acidominimus 
agalactiae 

agalactiae 

agalactiae 

agalactiae 

agalactiae 

agalactiae 

anginosus 

anginosus 

bovis 

cricetus 

cristatus 

downei 

dysgalactiae 

equi subsp. equi 

ferus 



ATCC 51726 
ATCC 12400 
ATCC 12403 
ATCC 12973 
ATCC 13813 
ATCC 27591 
CDCs-1073 
ATCC 27335 
ATCC 27823 
ATCC 33317 
ATCC 19642 
ATCC 51100 
ATCC 33748 
ATCC 43078 
ATCC 9528 
ATCC 33477 



Streptococcus 
Streptococcus 
-Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 
Streptococcus 



gordonii 
macacae 

mitis 

mutans 
oralis 

parasanguinis 

parauberis 

pneumoniae 

pyogenes 

raff/ 

salivarius 

sanguinis 

sobrinus 

suis 

uberis 

vestibularis 



ATCC 10558 
ATCC 35911 
ATCC49456- 
ATCC 25175 
ATCC 35037 
ATCC 15912 

DSM 6631 
ATCC 27336 
ATCC 19615 
ATCC 19645 
ATCC 7073 
ATCC 10556 
ATCC 27352 
ATCC 43765 
ATCC 19436 
ATCC 49124 
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Table 10. Bacterial species used to test the specificity f the Enferococcus-speciflc amplification 
primers derived fr m tuf sequences. 



Strain 



Reference number 



Strain 



Reference number 



5 Gram-positive species (n=74) 

Abiglrqf)hiaadiacen$ ATCC49176 

Abiotrophia defectiva" ATCC49T75 - 

Bacillus cereus ATCC 1 4579 

Bacillus subtilis ATCC 27370 

10 Bifidobacterium adolescentis ATCC 27534 

Bifidobacterium breve ATCC 1 5700 

Bifidobacterium dentium ATCC 27534 

Bifidobacterium longum ATCC 1 5707 

Clostridium perfringens ATCC 3 1 24 

15 Clostridium septicum ATCC 1 2464 

Corynebacterium aquaticus ATCC 1 4665 

Corynebacterium ATCC 10700 
pseudodiphtheriticum 

Enterococcus avium ATCC 14025 

20 Enterococcus casseliflavus ATCC 25788 

Enterococcus cecorum ATCC 43 1 99 

Enterococcus columbae ATCC 51 263 

Enterococcus dispar ATCC 51 266 

Enterococcus durans ATCC 19432 

25 Enterococcus faecalis ATCC 292 1 2 

Enterococcus faecium ATCC 1 9434 

Enterococcus flavescens ATCC 49996 

Enterococcus gallinarum ATCC 49573 

Enterococcus hirae ATCC 8044 

30 Enterococcus matodoratus ATCC 43 1 97 

Enterococcus mundtii ATCC 43 1 86 

Enterococcus pseudoavium ATCC 49372 

Enterococcus raffinosus ATCC 49427 

Enterococcus saccharolyticus ATCC 43076 

35 Enterococcus solitarius ATCC 49428 

Enterococcus sulfureus ATCC 49903 

Eubacterium lentum ATCC 49903 

Gemella haemolysans ATCC 1 0379 

Gemella morbillorum ATCC 27842 

40 Lactobacillus acidophilus ATCC 4356 

Leuconostoc mesenteroides ATCC 1 9225 

Listeria grayi ATCC 1 91 20 

Listeria grayi ATCC 1 91 23 



Listeria innocua ATCC 33090 
'Listeria ivanovii ~~ ATCC 19119 

Listeria monocytogenes ATCC 1 531 3 

Listeria seeligeri ATCC 35967 

Micrococcus luteus ATCC 9341 

Pediococcus acidilacti ATCC 3331 4 

Pediococcus pentosaceus ATCC 333 1 6 

Peptococcus niger ATCC 27731 
Peptostreptococcus anaerobius ATCC 27337 

Peptostreptococcus indolicus ATCC 29247 

Peptostreptococcus micros ATCC 33270 

Propionibacterium acnes ATCC 691 9 

Staphylococcus aureus ATCC 43300 

Staphylococcus capitis ATCC 27840 

Staphylococcus epidermidis ATCC 1 4990 

Staphylococcus haemolyticus ATCC 29970 

Staphylococcus hominis ATCC 27844 

Staphylococcus lugdunensis ATCC 43809 

Staphylococcus saprophytics ATCC 1 5305 

Staphylococcus simulans ATCC 27848 

Staphylococcus warneri ATCC 27836 

Streptococcus agalactiae ATCC 1 381 3 

Streptococcus anginosus ATCC 33397 

Streptococcus bovis ATCC 3331 7 

Streptococcus consteliatus ATCC 27823 

Streptococcus cristatus ATCC 51 1 00 

Streptococcus intermedius ATCC 27335 

Streptococcus mitis ATCC 49456 

Streptococcus mitis ATCC 3639 

Streptococcus mutans ATCC 27 1 75 

Streptococcus parasanguinis ATCC 1 591 2 

Streptococcus pneumoniae ATCC 27736 

Streptococcus pneumoniae ATCC 6303 

Streptococcus pyogenes ATCC 1 961 5 

Streptococcus salivarius ATCC 7073 

Streptococcus sanguinis ATCC 1 0556 

Streptococcus suis ATCC 43765 
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Table 10. Bacterial species used to test the specificity of the E nterococcus*spec\1\c amplification 
primers derived from tuf sequences (continued). 



Strain 



Reference number 



Strain 



Reference number 



5 Gram-negative species (n=39) 

Acidominococcus fermentans ATCC 2508 

Acinetobacter^ baumannii ATCC 1 9606 

Alcaligenes faecalis ATCC 8750" 

Anaerobiospirillum ATCC 29305 

10 succiniproducens 

Anaerorhabdus furcosus ATCC 25662 

Bacteroides distasonis ATCC 8503 
Bacteroides thetaiotaomicron ATCC 29741 

Bacteroides vulgatus ATCC 8482 

15 Bordetella pertussis LSPQ 3702 

Bulkholdeha cepacia LSPQ 221 7 

Butyvibrio fibrinosotvens ATCC 1 91 71 

Cardiobacterium hominis ATCC 1 5826 

Citrobacter freundii ATCC 8090 

20 Desulfovibrio vulgaris ATCC 29579 

Edwardsiellae tarda ATCC 1 5947 

Enterobacter cloacae ATCC 1 3047 

Escherichia coli ATCC 25922 

Fusobacterium russii ATCC 25533 

25 Haemophilus influenzae ATCC 9007 



Hafnia alvei 
Klebsiella oxytoca 
~Meganomonas hypermegas~ 
Mitsukoella multiacidus 
Moraxella catarrhalis 
Morganella morganii 
Neisseria meningitidis 
Pasteurella aerogenes 
Proteus vulgaris 
Providencia alcalifaciens 
Providencia rettgeri 
Pseudomonas aeruginosa 
Salmonella typhimurium 
Serratia marcescens 
Shigella flexneri 
Shigella sonnei 
Succinivibrio dextrinosolvens 
Tissierella praeacuta 
Veillonella parvula 
Yersinia enterocolitica 



ATCC 13337 
ATCC 13182 
-ATCC 25560 
ATCC 27723 
ATCC 43628 
ATCC 25830 
ATCC 13077 
ATCC 27883 
ATCC 13315 
ATCC 9886 
ATCC 9250 
ATCC 27853 
ATCC 14028 
ATCC 13880 
ATCC 12022 
ATCC 29930 
ATCC 19716 
ATCC 25539 
ATCC 10790 
ATCC 9610 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases. 



Species Strain Accession number Coding gene* 



tuf sequences 

Bacteria 



5 


Actinobacillus actinomycetemcomitans 


HK1651 


Genome project 2 


• tuf 




Actinobacillus actinomycetemcomitans 


HK1651 


Genome project 2 


tuf (EF-G) 




Agrobacterium tumefaciens 




X99673 


tuf 




Agrobacterium tumefaciens 




X99673 


tuf (EF-G) 




Agrobacterium tumefaciens 




X99674 


tuf 


10 


Anacystis nidulans 


PCC 6301 


X17442 


tuf 




Aquifex aeolicus 


VF5 


AE000669 


tuf 




Aquifex aeolicus 


VF5 


AE000669 


tuf (EF-G) 




Aquifex pyrophilus 




Genome project 2 


fuf(EF-G) 




Aquifex pyrophilus 




Y15787 


tuf 


15 


Bacillus anthracis 


Ames 


Genome project 2 


tuf 




Bacillus anthracis 


Ames 


Genome project 2 


tuf (EF-G) 




Bacillus halodurans 


C-125 


AB017508 


tuf 




Bacillus halodurans 


C-125 


AB017508 


tuf (EF-G) 




Bacillus stearothermophilus 


CCM2184 


AJ000260 


tuf 


20 


Bacillus subtilis 


168 


D64127 


tuf 




Bacillus subtilis 


168 


D64127 


tuf (EF-G) 




Bacillus subtilis 


DSM10 


Z99104 


tuf 




Bacillus subtilis 


DSM10 


Z99104 


tuf (EF-G) 




Bacteroides forsythus 


ATCC 43037 


AB035466 


tuf 


25 


Bacteroides fragilis 


DSM 1151 


_i 


tuf 




Bordetella bronchiseptica 


RB50 


Genome project 2 


tuf 




Bordetella pertussis 


Tohama 1 


Genome project 2 


tuf 




Bordetella pertussis 


Tohama 1 


Genome project 2 


tuf (EF-G) 




Borrelia burdorgferi 


B31 


U78193 


tuf 


30 


Borrelia burgdorferi 




AE001155 


tuf (EF-G) 




Brevibacterium linens 


DSM 20425 


X76863 


tuf 




Buchnera aphidicola 


Ap 


Y12307 


tuf 




Burkholderia pseudomallei 


K96243 


Genome project 2 


tuf (EF-G) 




Campylobacter jejuni 


NCTC 11168 


Y17167 


tuf 


35 


Campylobacter jejuni 


NCTC 11168 


CJ11168X2 


tuf (EF-G) 




Chlamydia pneumoniae 


CWL029 


AE001592 


tuf 




Chlamydia pneumoniae 


CWL029 


AE001639 


tuf (EF-G) 




Chlamydia trachomatis 




M74221 


tuf 




Chlamydia trachomatis 


D/UW-3/CX 


AE001317 


tuf (EF-G) 


40 


Chlamydia trachomatis 


D/UW-3/CX 


AE001305 


tuf 




Chlamydia trachomatis 


F/IC-Cal-13 


L22216 


tuf 




Chlorobium vibrioforme 


DSM 263 


X77033 


tuf 




Chloroflexus aurantiacus 


DSM 636 


X76865 


tuf 




Clostridium acetobutylicum 


ATCC 824 


Genome project 2 


tuf 


45 


Clostridium difficile 


630 


Genome project 2 


tuf 




Clostridium difficile 


630 


Genome project 2 


tuf (EF-G) 




Corynebacterium diphtheriae 


NCTC 13129 


Genome project 2 


tuf 




Corynebacterium diphtheriae 


NCTC 13129 


Genome project 2 


tuf (EF-G) 




Corynebacterium glutamicum 


AS019 


X77034 


tuf 


50 


Corynebacterium glutamicum 


MJ-233 


E09634 


tuf 




Coxiella burnetii 


Nine Mile phase I 


AF1 36604 


tuf 




Cytophaga lytica 


DSM 2039 


X77035 


tuf 




Deinococcus radiodurans 


R1 


AE001891 


tuf (EF-G) 




Deinococcus radiodurans 


R1 


AE1 80092 


tuf 
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Table Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species Strain Accession number Coding gene* 





Deinococcus radiodurans 


R1 


AE002041 


tuf 




Deinonema sp. 






tuf 




_EikeneJla corrodens 


ATCC 23834 


Z12610 


tuf 




Eikenella corrodens 


ATCC 23834 


_ -212610 


firf(EF-G) 


5 


Enterococcus faecalis 




Genome project 2 


tuf (EF-G) 




Escherichia coli 




J01690 


tuf 




Escherichia coli 




J01717 


tuf 




Escherichia coli 




X00415 


tuf (EF-G) 




Escherichia coli 




X57091 


tuf 


10 


Escherichia coli 


K-12 MG1655 


U00006 


tuf 




Escherichia coli 


K-12 MG1 655 


U00096 


tuf 




Escherichia coli 


K-12 MG1655 


AE000410 


tuf (EF-G) 




Fervidobacterium islandicum 


DSM 5733 


Y15788 


tuf 




Fibrobacter succinogenes 


S85 


X76866 


tuf 


15 


Flavobacterium fem'geneum 


DSM 13524 


X76867 


tuf 




Flexistipes sinusarabici 




X59461 


tuf 




Gloeobacter violaceus 


PCC 7421 


U09433 


tuf 




Gloeothece sp. 


PCC 6501 


U09434 


tuf 




Haemophilus actinomycetemcomitans 


HK1651 


Genome project 2 


tuf 


20 


Haemophilus ducreyi 


35000 


AF087414 


tuf (EF-G) 




Haemophilus influenzae 


Rd 


U32739 


tuf 




Haemophilus influenzae 


Rd 


U32746 


tuf 




Haemophilus influenzae 


Rd 


U32739 


tuf (EF-G) 




Helicobacter pylori 


26695 


AE000511 


tuf 


25 


Helicobacter pylori 


J99 


AE001539 


tuf {EF-G) 




Helicobacter pylori 


J99 


AE001541 


tuf 




Herpetosiphon aurantiacus 


Hpgal 


X76868 


tuf 




Klebsiella pneumoniae 


M6H 78578 


Genome project 2 


tuf 




Klebsiella pneumoniae 


M6H 78578 


Genome project 2 


tuf (EF-G) 


30 


Lactobacillus paracasei 




E13922 


tuf 




Legionella pneumophila 


Philadelphia-1 


Genome project 2 


tuf 




Leptospira interrogans 




AF1 15283 


tuf 




Leptospira interrogans 




AF1 15283 


tuf (EF-G) 




Micrococcus luteus 


IFO 3333 


M17788 


tuf (EF-G) 


35 


Micrococcus luteus 


IFO 3333 


M17788 


tuf 




Moraxella sp. 


TAC II 25 


AJ249258 


tuf 




Mycobacterium avium 


104 


Genome project 2 


tuf 




Mycobacterium avium 


104 


Genome project 2 


tuf (EF-G) 




Mycobacterium bovis 


AF2122/97 


Genome project 2 


tuf 


40 


Mycobacterium bovis 


AF21 22/97 


Genome project 2 


tut (EF-G) 




Mycobacterium leprae 




L13276 


tuf 




Mycobacterium leprae 




Z14314 


tuf 




Mycobacterium leprae 




Z14314 


tuf (EF-G) 




Mycobacterium leprae 


Thai 53 


D13869 


tuf 


45 


Mycobacterium tuberculosis 


Erdmann 


S40925 


tuf 




Mycobacterium tuberculosis 


H37Rv 


AL021943 


tuf (EF-G) 




Mycobacterium tuberculosis 


H37Rv 


Z84395 


tuf 




Mycobacterium tuberculosis 


y42 


AD000005 


tuf 




Mycobacterium tuberculosis 


CSU#93 


Genome project 2 


tuf 


50 


Mycobacterium tuberculosis 


CSU#93 


Genome project 2 


tuf (EF-G) 




Mycoplasma capricolum 


PG-31 


X16462 


tuf 




Mycoplasma genitalium 


G37 


U39732 


tuf 




Mycoplasma genitalium 


G37 


U39689 


tuf (EF-G) 




Mycoplasma hominis 




X57136 


tuf 


55 


Mycoplasma hominis 


PG21 


M57675 


tuf 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 





Species 


Strain 


Accession number 


Coding gene* 




Mycoplasma pneumoniae 


M129 


AE000019 


tuf 




Mycoplasma pneumoniae 


M129 


AE000058 


tuf (EF-G) 




Neisseria gonorrhoeae 


MS11 


L36380 


tuf 




— Neisseria-gonorrhoeae 


- MS1-1 


- - L36380 


- tuf(EF-Q) 


5 


Neisseria meningitidis 


Z2491 


Genome projecr 


fu/(EF-G) 




Neisseria meningitidis 


Z2491 


Genome project 2 


tuf 




Pasteurella muttocida 


Pm70 


Genome project 2 


tuf 




Peptococcus niger 


DSM 20745 


X76869 


tuf 




Phormidium ectocarpi 


PCC 7375 


U09443 


tuf 


10 


Planobispora rosea 


ATCC 53773 


U67308 


tuf 




Planobispora rosea 


ATCC 53733 


X98830 


tuf 




Planobispora rosea 


ATCC 53733 


X98830 


tuf (EF-G) 




Plectonema boryanum 


PCC 73110 


U09444 


tuf 




Porphyromonas gingivalis 


W83 


Genome projecr 


tuf 


15 


Porphyromonas gingivalis 


W83 


Genome projecr 


tuf (EF-G) 




Porphyromonas gingivalis 


FDC 381 


AB035461 


tuf 




Porphyromonas gingivalis 


W83 


AB035462 


tuf 




Porphyromonas gingivalis 


SUNY 1021 


AB035463 


tuf 




Porphyromonas gingivalis 


A7A1-28 


AB035464 


tuf 


20 


Porphyromonas gingivalis 


ATCC 33277 


AB035465 


tuf 




Porphyromonas gingivalis 


ATCC 33277 


AB035471 


tuf(EF-G) 




Prochlorothrix hollandica 




U09445 


tuf 




Pseudomonas aeruginosa 


PAO-1 


Genome project 2 


tuf 




Pseudomonas putida 




Genome projecr 


tuf 


25 


Rickettsia prowazekii 


Madrid E 


AJ235272 


tuf 




Rickettsia prowazekii 


Madrid E 


AJ235270 


tuf (EF-G) 




Rickettsia prowazekii 


Madrid E 


Z54171 


fu/(EF-G) 




Salmonella choleraesuis subsp. 








choleraesuis serotype Typhimurium 




X64591 


firf(EF-G) 


30 


Salmonella choleraesuis subsp. 










choleraesuis serotype Typhimurium 


LT2 trpE91 


X55116 


tuf 




Salmonella choleraesuis subsp. 










choleraesuis serotype Typhimurium 


LT2trpE91 


X55117 


tuf 




Serpulina hyodysenteriae 


B204 


U51635 


tuf 


35 


Serratia marcescens 




AF058451 


tuf 




Shewanella putrefaciens 


DSM 50426 




tuf 




Shewanella putrefaciens 


MR-1 


Genome projecr 


tuf 




Spirochaeta aurantia 


DSM 1902 


X76874 


tuf 




Staphylococcus aureus 




AJ237696 


tuf (EF-G) 


40 


Staphylococcus aureus 


EMRSA-16 


Genome project 2 


tuf 




Staphylococcus aureus 


NCTC 8325 


Genome project 2 


tuf 




Staphylococcus aureus 


COL 


Genome project 2 


tuf 




Staphylococcus aureus 


EMRSA-16 


Genome project 2 


tuf (EF-G) 




Stigmatella aurantiaca 


DW4 


X82820 


tuf 


45 


Stigmatella aurantiaca 


Sga1 


X76870 


tuf 




Streptococcus mutans 


GS-5 Kuramitsu 


U75481 


tuf 




Streptococcus mutans 


1 |AD4 eft 

UAB159 


Genome project 2 


A. .£ 

tuf 




Streptococcus oralis 


NTCC11427 


P331701 


tuf 




Streptococcus pyogenes 




Genome project 2 


tuf (EF-G) 


50 


Streptococcus pyogenes 


M1-GAS 


Genome project 2 


tuf 




Streptomyces aureofaciens 


ATCC 10762 


AF007125 


tuf 




Streptomyces cinnamoneus 


Tue89 


X98831 


tuf 




Streptomyces coelicolor 


A3(2) 


AL031013 


tuf (EF-G) 




Streptomyces coelicolor 


A3(2) 


X77039 


tuf (EF-G) 


55 


Streptomyces coelicolor 


M145 


X77039 


tuf 
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Table 11. Microbial species for which tuf and/ r atpD and/or recA sequences are available In 
public databases (continued). 





Species 


Strain 


Accession number 


Coding gene* 




Streptomyces collmus 


BSM 40733 


o7940o 


A .X 

tuf 




Streptomyces netropsis 


IU1063 


Ar153618 


tuf 




- -Streptomyces jamocissimus 




VC7ACT 

X67057 


tuf 




Streptomyces ramocissfmus 


— 


— X67058_ 


_ tuf 


5 


Streptomyces ramocissimus 




X67057 


tuf (EF : G)~ 




Synechococcus sp. 


PCC 6301 


V/ -4 "T A A f\ 

X17442 


fu/(EF-G) 




Synechococcus sp. 


t^S. A**% A A A ^ 

PCC 6301 


X17442 


tuf 




Synechocystis sp. 


PCC 6803 


D90913 


tuf (EF-G) 




Synechocystis sp. 


PCC 6803 


D90913 


tuf 


10 


Synechocystis sp. 


PCC 6803 


X65159 


tuf (EF-G) 




Taxeobacter occealus 


Myx2105 


X77036 


tuf 




Thermotoga maritima 


Genome project 2 


tuf (EF-G) 




Thermotoga maritima 




M27479 


/u/ 




Thermus aquaticus 


EP 00276 


X66322 


/a/ 


15 


Thermus thermophilus 


HB8 


X 16278 


fitf(EF-G) 




Thermus thermophilus 


HB8 


X05977 


tuf 




Thermus thermophilus 


HB8 


X06657 


tuf 




Thtomonas cuprtna 


DSM 5495 


U78300 


tuf 




Thiomonas cupnna 


DSM 5495 


U78300 


tuf {EF-G) 


20 


Thiomonas cuprina 


Hoe5 


X76871 


tuf 




Treponema denticola 




Genome projecr 


tuf 




Treponema denticola 




Genome projecr 


tuf (EF-G) 




Treponema pallidum 




AE001202 


to/ 




Treponema pallidum 




AE001222 


to/ (EF-G) 


25 


Treponema pallidum 




AE001248 


to/ (EF-G) 




Ureaplasma urealyticum 


ATCC 33697 


234275 


tuf 




Ureaplasma urealyticum 


serovar 3 biovar 1 


AE002151 


tuf 




Ureaplasma urealyticum 


serovar 3 biovar 1 


AE002151 


tuf (EF-G) 




Vibrio cholerae 


N16961 


Genome project 2 


to/ 


30 


Wolinella succinogenes 


DSM 1740 


X76872 


tuf 




Yersinia pestis 


CO-92 


Genome projecr 


tuf 




Yersinia pestis 


CO-92 


Genome project 2 


to/ (EF-G) 


35 


Archaebacteria 










Archaeoglobus fulgidus 




Genome project 2 


to/ (EF-G) 




Halobacterium marismortui 




X16677 


to/ 




Methanobacterium thermoautrophicum 


delta H 


AE000877 


tuf 


40 


Methanococcus jannaschii 


ATCC 43067 


U67486 


tuf 




Methanococcus vannielii 




X05698 


tuf 




Pyrococcus abyssi 


Orsay 


AJ248285 


tuf 




Thermoplasma acidophilum 


DSM 1728 


X53866 


tuf 



45 



50 



55 



Fungi 

Absidia glauca 
Arxula adeninivorans 
Aspergillus oryzae 
Aureobasidium pullulans 
Candida albicans 
Candida albicans 
Candida albicans 
Cryptococcus neoformans 



CBS 101.48 
Ls3 

KBN616 

R106 

SC5314 

SC5314 

SC5314 

B3501 



X54730 
Z47379 
AB007770 
U19723 

Genome project 2 
M29934 
M29935 
U81803 



'uf(EF-l) 
tuf (EF-1) 
to/ (EF-1) 
<lrf(EF-1) 
tuf{M) 
fi/f(EF-1) 
fuf(EF-1) 
tuf (EF-1) 
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Table 11. 



Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Cryptococcus neoformans 


M1-106 


U81804 


Eremothecium gossypii 


ATCC 10895 


X73978 


Eremothecium gossypii 




A29820 


Fusarium oxysporum 


NRRL -26037- - 


- -AF008498 


5 Histoplasma capsulatum 


186AS 


U14100 


Podospora anserina 




X74799 


Podospora curvicolla 


VLV 


X96614 


Prototheca wickerhamii 


263-11 


AJ245645 


Puccinia graminis 


race 32 


X73529 


10 Reclinomonas americana 


ATCC 50394 ' 


AF007261 


Rhizomucor racemosus 


ATCC 1216B 


X17475 


Rhizomucor racemosus 


ATCC 1216B 


J02605 


Rhizomucor racemosus 


ATCC 1216B 


X17476 


Rhodotorula mucilaginosa 




AF016239 


15 Saccharomyces cerevisiae 




K00428 


Saccharomyces cerevisiae 




M59369 


Saccharomyces cerevisiae 




X00779 


Saccharomyces cerevisiae 




X01638 


Saccharomyces cerevisiae 




M10992 


20 Saccharomyces cerevisiae 


Alpha S288 


X78993 


Saccharomyces cerevisiae 




M15666 


Saccharomyces cerevisiae 




Z35987 


Saccharomyces cerevisiae 


S288C (AB972) 


U51033 


Schizophyllum commune 


1-40 


X94913 


25 Schizosaccharomyces pombe 


972h- 


AL021816 


Schizosaccharomyces pombe 


972h- 


AL021813 


Schizosaccharomyces pombe 


972h- 


D82571 


Schizosaccharomyces pombe 




U42189 


Schizosaccharomyces pombe 


PR745 


D89112 


30 Sordaria macrospora 


OOO 


X96615 


Trichoderma reesei 


QM9414 


223012 


Yarrowia lipolytics 




AF054510 



fuf(EF-1) 
fi/^EF-1) 
firf(EF-l) 
_ftrf(E_F-1)_ 
ftrf(EF-l) 
fl/f(EF-1) 
ru/(EF-1) 
tuf (EF-1) 
M(EF-1) 

ft// (EF-1) 
tuf (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/(M) 
to/(EF-G) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 
to/ (EF-1) 

rwf (EF-1) 

^ (EF-1) 
to/ (EF-1) 
tuf (EF-1) 
to/ (EF-1) 
tuf (EF-1) 



35 Parasites 



Blastocysts hominis 
Cryptosporidium parvum 
Eimeria tenella 

40 Entamoeba histolytica 
Entamoeba histolytica 
Giardia lamblia 
Kentrophoros sp. 
Leishmania amazonensis 

45 Leishmania braziliensis 
Onchocerca volvulus 
Porphyra purpurea 
Plasmodium berghei 
Plasmodium falcipamm 

50 Plasmodium knowlesi 
Toxoplasma gondii 
Trichomonas tenax 
Trypanosoma brucei 

55 Trypanosoma cnjzi 



HE87-1 


064080 


tuf (EF-1) 




U69697 


tuf (EF-1) 


LS18 


AI755521 


to/(EF-1) 


HM1:IMSS 


X83565 


fur" (EF-1) 


NIH 200 


M92073 


fur" (EF-1) 




D14342 


to/ (EF-1) 




AF056101 


to/(EF-1) 


IFLA/BR/67/PH8 


M92653 


tuf (EF-1) 




U72244 


to/ (EF-1) 




M64333 


to/ (EF-1) 


Avonport 


U08844 


to/ (EF-1) 


ANKA 


AJ224150 


to/ (EF-1) 


K1 


X60488 


to/ (EF-1) 


lineH 


AJ224153 


to/ (EF-1) 


RH 


Y11431 


to/ (EF-1) 


ATCC 30207 


D78479 


to/ (EF-1) 


LVH/75/ 


U 10562 


to/ (EF-1) 


USAMRU-K/18 






Y 


L76077 


to/(EF-1) 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequ nces are available In 
public databases (continued). 



Species Strain Accession number Coding gene* 



Human and plants 



10 



Arabiddpsislfialiana 
Glycine max 
Glycine max 
Glycine max 
Glycine max 
Homo sapiens 
Pyramimonas disomata 



Columbia — 
Ceresia 
Ceresia 
Ceresia 
Maple Arrow 



-X89227^ _ 
X89058 
Y15107 
Y15108 
X66062 
X03558 
AB008010 



__ tuf (EM) 

toffEM)" 

luf (EM) 

/uf(EM) 

fu/(EM) 

fu/(EM) 

tuf 



atpD sequences 

15 Bacteria 





nbcluUutiCi'U/ii rrWUI 


i»/sJIVl 1 Uuw 




aifJLs 




Artinnharilli /c fiMinnmurof/^mmmitftn^ 

raisin t \JUG\*it \UO aLrlli lUl i %j\0\3\%5lt iwWI / tliaiio 


HK1651 




alfJL/ 




Rati Mug snthraric 


Amfic 








DaUIHUo lllFJIUo 




MR01 1 7 


ntnD 




DaLIIIUo lll&yalol lUlll 






aifJLs 




Rsif^HIi ic ct&amfh&rmnnhilnc 
DaUiltuo oiuaiUll lt>! (1 lUfJI illUo 




L/OOUJO 


aijJLs 




RArilh jc cfpfl rf)ih&rmnnhifi /c 

UaUHUC OlCaf Uli /t?» f IIKJUt IIIUD 


irw I 




aifJLs 




Barillas ciihf/Jfe 


168 


228592 


stoD 


25 


Bacteroides fraoilis 


DSM 2151 


M22247 


atpD 




Bordetella bronchiseptica 


RB50 


Genome project 2 


atpD 




Bordetella oertussis 


Tohama 1 


Genome oroiecr 


atpD 




Borrelia burgdorferi 


B31 


AE001 122 


atpD(Y) 




Burkholderia cepacia 


DSM50181 


X76877 


atpD 


30 


Burkholderia pseudomallei 


K96243 


Genome project 2 


atpD 




Campylobacter jejuni 


NCTC 11168 


CJ1 1168X1 


atpD 




Chlamydia pneumoniae 




Genome project 2 


atpD(V) 




Chlamydia trachomatis 


MoPn 


Genome project 2 


atpD(V) 




Chlorobium vibrioforme 


DSM 263 


X76873 


atpD 


35 


Citrobacter freundii 


JEO503 


AF037156 


atpD 




Clostridium acetobutylicum 


ATCC 824 


Genome project 2 


atpD 




Clostridium acetobutylicum 


DSM 792 


AF101055 


atpD 




Clostridium difficile 


630 


Genome project 2 


atpD 




Corynebacterium diphtheriae 


NCTC13129 


Genome project 2 


atpD 


40 


Corynebacterium glutamicum 


AS019 


X76875 


atpD 




Corynebacterium glutamicum 


MJ-233 


E09634 


atpD 




Cytophaga lytica 


DSM 2039 


M22535 


atpD 




Enterobacter aerogenes 


DSM 30053 


3 


atpD 




Enterococcus faecalis 


V583 


Genome project 2 


atpD(V) 


45 


Enterococcus hirae 




M90060 


atpD 




Enterococcus hirae 


ATCC 9790 


D17462 


atpD(V) 




Escherichia coli 




J01594 


atpD 




Escherichia coli 




M25464 


atpD 




Escherichia coli 




V00267 


atpD 


50 


Escherichia coli 




V00311 


atpD 




Escherichia coli 


K12MG1655 


L10328 


atpD 




Flavobacterium ferrugineum 


DSM 13524 


3 


atpD 




Haemophilus actinomycetemcomitans 




Genome project 2 


atpD 




Haemophilus influenzae 


Rd 


U32730 


atpD 


55 


Helicobacter pylori 


NCTC 11638 


AF004014 


atpD 
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Table 11. Microbial species for which tut and/or atpD and/or recA sequences are available In 
public databases (continued). 



Species Strain Accession number Coding gene* 





Helicobacter pylon 


26695 


Genome project 2 


atpD 




Helicobacter pylori 


J99 


Genome project 2 


atpD 




Klebsiella \ pneumoniae 


M6H 78578 


Genome project 2 


atpD 




Lactobacillus casei 


DSM 20021- - 


X64542 


atpD 


5 


Legionella pneumophila 


Philadelphia-1 


Genome project 2 


atpD 




Moorella thermoacetica 


ATCC 39073 


U64318 


atpD 




Mycobacterium avium 


104 


Genome project 2 


atpD 




Mycobacterium bovis 


AF21 22/97 


Genome project 2 


atpD 




Mycobacterium leprae 




U15186 


atpD 


10 


Mycobacterium leprae 




Genome project 2 


atoD 




Mycobacterium tuberculosis 


H37Rv 


273419 


atpD 




Mycobacterium tuberculosis 


CSU#93 


Genome project 2 


atpD 




Mycoplasma gatlisepticum 




X64256 


atpD 




Mycoplasma genitalium 


G37 


U39725 


atpD 


15 


Mycoplasma pneumoniae 


M129 


U43738 


atpD 




Neisseria gonorrhoeae 


FA 1090 


Genome project 2 


atpD 




Neisseria meningitidis 


Z2491 


Genome project 2 


atoD 




Pasteurella multocida 


Pm70 


Genome project 2 


atpD 




Pectinatus frisingensis 


DSM 20465 


X64543 


atpD 


20 


Peptococcus niger 


DSM 20475 


X76878 


atpD 




Pirellula marina 


IFAM 1313 


X57204 


atpD 




Porphyromonas gingivalis 


W83 


Genome project 2 


atpD (V) 




Propionigenium modestum 


DSM 2376 


X58461 


atpD 




Pseudomonas aeruginosa 


PA01 


Genome project 2 


atpD 


25 


Pseudomonas putida 




Genome project 2 


atpD 




Rhodobacter capsulatus 


B100 


X99599 


atpD 




Rhodospirillum rubrum 




X02499 


atpD 




Rickettsia prowazekii 


F-12 


AF036246 


atpD 




Rickettsia prowazekii 


Madrid 


Genome project 2 


atpD 


30 


Ruminococcus albus 


7ATCC 


AB006151 


atpD 




Salmonella bongori 


JE04162 


AF037155 


atoD 




Salmonella bongori 


BR1859 


AF037154 


atpD 




Salmonella choleraesuis 


S83769 


AF037146 


atoD 

a iff 




subsp. arizonae 






35 


Salmonella choleraesuis 


u24 


AF037147 


atpD 




subsp. arizonae 








Salmonella choleraesuis subsp. 


K228 


AF037140 


atpD 




choleraesuis serotype Dublin 








Salmonella choleraesuis subsp. 


K771 


AF037139 


atpD 


40 


choleraesuis serotype Dublin 








Salmonella choleraesuis subsp. 


Div36-86 


AF037142 


atpD 




choleraesuis serotype Infantis 








Salmonella choleraesuis subsp. 


Div95-86 


AF037143 


atpD 




choleraesuis serotype Tennessee 






45 


Salmonella choleraesuis subsp. 


LT2 


AF037141 


atpD 




choleraesuis serotype Typhimurium 








Salmonella choleraesuis 


DS210/89 


AF037149 


atpD 




subsp. diarizonae 








Salmonella choleraesuis 


JEO307 


AF037148 


atpD 


50 


subsp. diarizonae 








Salmonella choleraesuis 


S1 09671 


AF037150 


atpD 




subsp. diarizonae 










Salmonella choleraesuis 


S84366 


AF037151 


atpD 




subsp. houtenae 






55 


Salmonella choleraesuis 


S84098 


AF037152 


atpD 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



subsp. houtenae 
Salmonella choleraesuis 
subsp. indica 

Sal^nellaj:holeraesuis_ _ 

5 subsp. salamae 

Salmonella choleraesuis 
subsp. salamae 
Shewanella putrefaciens 
Staphylococcus aureus 

10 Stigmatella aurantiaca 
Streptococcus bovis 
Streptococcus mutans 
Streptococcus mutans 
Streptococcus pneumoniae 

15 Streptococcus pneumoniae 
Streptococcus pyogenes 
Streptococcus pyogenes 
Streptococcus sanguinis 
Streptomyces lividans 

20 Thermus thermophilus 
Thiobacillus ferrooxidans 
Treponema pallidum 
Vibrio alginolyticus 
Vibrio cholerae 

25 Wolinella succinogenes 
Yersinia enterocolitica 
Yersinia pestis 



BR2047 

NSC72- 

S1 14655 

MR-1 
COL 
Sga1 
JB-1 
, GS-5 
UAB159 
Type 4 
Type 4 
M1-GAS 
M1-GAS 
10904 
1326 
HB8 

ATCC 33020 
Nichols 

N16961 
DSM 1470 
NCTC 10460 
CO-92 



AF037153 afpO 
-AF037-144 atpD 



AF037145 

Genome project 2 
Genome project 2 
X76879 
AB009314 
U31170 

Genome project 2 

Genome project 2 

Genome project 2 

Genome project 2 

Genome project 2 

AF001955 

Z22606 

D63799 

M81087 

AE001228 

X16050 

Genome project 2 
X76880 
AF037157 
Genome project 2 



atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

afpD(V) 

atpD 

atpD(V) 

atpD 

atpD 

atpD 

atpD (V) 

atpD 

atpD(V) 

atpD 

atpD 

atpD 

atpD 

atpD 



30 Archaebacteria 

Archaeoglobus fulgidus 
Halobacterium salinarum 
Haloferax volcanii 
35 Methanococcus jannaschii 
Methanosarcina barken 



DSM 4304 

WR 340 
DSM 2661 
DSM 800 



AE001023 

S56356 

X79516 

U67477 

J04836 



afpD(V) 

atpD{V) 

atpD 

atpD(V) 

atpD(V) 



40 



45 



50 



55 



Fungi 

Candida albicans 
Candida tropicalis 
Kluyveromyces lactis 
Neurospora crassa 
Saccharomyces cerevisiae 
Saccharomyces cerevisiae 
Schizosaccharomyces pombe 
Schizosaccharomyces pombe 



Parasites 

Giardia lamblia 
Plasmodium falciparum 
Trypanosoma congolense 



SC5314 
2359/152 



X2180-1A 
972 h- 
972 h- 



WB 
3D7 
IL3000 



Genome project 2 

M64984 

U37764 

X53720 

M12082 

J05409 

S47814 

M57956 



U18938 
L08200 
Z25814 



atpD 
atpD (V) 
atpD 
atpD 
atpD 
atpD (V) 
afpD(V) 
atpD 



atpD 
atpD (V) 
atpD(V) 



222 



WO 01/23604 



PCT/CAOO/01150 



Table 11. Microbial sped s for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species Strain Accession number Coding gene* 



Human and plants 



5 Homo sapiens L09234 afpD(V) 

Homo sapiens M27132 atpD 



recA sequences 

10 

Bacteria 





Acetobacter aceti 


no. 1023 


S60630 


recA 




Acetobacter altoacetigenes 


MH-24 


E05290 


recA 


15 


Acetobacter polyoxogenes 


NBI1028 


D13183 


recA 




Acholeplasma laidlawii 


8195 


M81465 


recA 




Acidiphilium facilis 


ATCC 35904 


D16538 


recA 




Acidothermus celiulolyticus 


ATCC 43068 


AJ006705 


recA 




Acinetobacter calcoaceticus 


BD413/ADP1 


L26100 


recA 


20 


Actinobacillus actinomycetemcomitans 


HK1651 


Genome project 2 


recA 




Aeromonas salmonicida 


A449 


U83688 


recA 




Agrobacterium tumefaciens 


C58 


L07902 


recA 




Allochromatium vinosum 




AJ000677 


recA 




Aquifex aeolicus 


VF5 


AE000775 


recA 


25 


Aquifex pyrophilus 


Kol5a 


L23135 


recA 




Azotobacter vinelandii 




S96898 


recA 




Bacillus stearothermophilus 


10 


Genome project 2 


recA 




Bacillus subtilis 


PB1831 


U87792 


recA 




Bacillus subtilis 


168 


Z99112 


recA 


30 


Bacteroides fragilis 




M63029 


recA 




Bifidobacterium breve 


NCFB 2258 


AF094756 


recA 




Blastochloris viridis 


DSM 133 


AF022175 


recA 




Bordetella pertussis 


165 


X53457 


recA 




Bordetelia pertussis 


Tohama 1 


Genome project 2 


recA 


35 


Borrelia burgdorferi 


Sh-2-82 


U23457 


recA 




Borrelia burgdorferi 


B31 


AE001124 


recA 




Brevibacterium flavum 


MJ-233 


E10390 


recA 




Brucella abortus 


2308 


L00679 


recA 




Burkholderia cepacia 


ATCC 17616 


U70431 


recA 


40 


Burkholderia cepacia 




D90120 


recA 




Burkholderia pseudomallei 


K96243 


Genome project 2 


recA 




Campylobacter fetus subsp. fetus 


23D 


AF020677 


recA 




Campylobacter jejuni 


8M76 


U03121 


recA 




Campylobacter jejuni 


NCTC 11168 


AL1 39079 


recA 


45 


Chlamydia trachomatis 


L2 


U16739 


recA 




Chlamydia trachomatis 


D/UW-3/CX 


AE001335 


recA 




Chlamydophila pneumoniae 


CWL029 


AE001658 


recA 




Chloroflexus aurantiacus 


J-10-fl 


AF037259 


recA 




Clostridium acetobutylicum 




M94057 


recA 


50 


Clostridium perfringens 


13 


U61497 


recA 




Corynebacterium diphtheriae 


NCTC13129 


Genome project 2 


recA 




Corynebacterium glutamicum 


AS019 


U14965 


recA 




Corynebacterium pseudotuberculosis 


C231 


U30387 


recA 




Deinococcus radiodurans 


KD8301 


AB005471 


recA 


55 


Deinococcus radiodurans 


R1 


U01876 


recA 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 





opecies 


Strain 


Accession numDer 


Coding gene* 




Enterobacter agglomerans 


339 


L03291 


recA 




Enterococcus faecalis 


OGIX 


M81466 


recA 




Erwinia carotovora 




X55554 


recA 





—Escherichia coli 





J01672 


— recA — 


5 


Escherichia coli 




X55552 


recA 




Escherichia coli 


K-12 


AE000354 


recA 




Frankia alni 


Arl3 


AJ006707 


recA 




Gluconobacter oxydans 




U21001 


recA 




Haemophilus influenzae 


Rd 


U32687 


recA 


10 


Haemophilus influenzae 


Rd 


U32741 


recA 




Haemophilus influenzae 


Rd 


L07529 


recA 




Helicobacter pylori 


69A 


235478 


recA 




Helicobacter pylori 


26695 


AE000536 


recA 




Helicobacter pylori 


J99 


AE001453 


recA 


15 


Klebsiella pneumoniae 


M6H 78578 


Genome project 2 


recA 




Lactococcus lactis 


ML3 


M88106 


recA 




Legionella pneumophila 




X55453 


recA 




Leptospira biflexa 


serovar patoc 


U32625 


recA 




Leptospira interrogans 


serovar pomona 


U29169 


recA 


20 


Magnetospirillum magnetotacticum 


MS-1 


X17371 


recA 




Methylobacillus flagelfatus 


MFK1 


M35325 


recA 




Methylomonas clara 


ATCC 31226 


X59514 


recA 




Mycobacterium avium 


104 


Genome project 2 


recA 




Mycobacterium bovis 


AF 122/97 


Genome project 2 


recA 


25 


Mycobacterium leprae 




X73822 


recA 




Mycobacterium tuberculosis 


H37Rv 


X58485 


recA 




Mycobacterium tuberculosis 


CSU#93 


Genome project 2 


recA 




Mycoplasma genitalium 


G37 


U39717 


recA 




Mycoplasma mycoides 


GM9 


L22073 


recA 


30 


Mycoplasma pneumoniae 


ATCC 29342 


MPAE000033 


recA 




Mycoplasma pulmonis 


KD735 


L22074 


recA 




Myxococcus xanthus 




L40368 


recA 




Myxococcus xanthus 




L40367 


recA 




Neisseria animalis 


NCTC 10212 


U57910 


recA 


35 


Neisseria cinerea 


LCDC 81-176 


AJ223869 


recA 




Neisseria cinerea 


LNP 1646 


U57906 


recA 




Neisseria cinerea 


NCTC 10294 


AJ223871 


recA 




Neisseria cinerea 


Vedros M601 


AJ223870 


recA 




Neisseria elongata 


CCUG 2131 


AJ223882 


recA 


40 


Neisseria elongata 


CCUG 4165A 


AJ223880 


recA 




Neisseria elongata 


NCTC 10660 


AJ223881 


recA 




Neisseria elongata 


NCTC 11050 


AJ223878 


recA 




Neisseria elongata 


NHITCC2376 


AJ223877 


recA 




Neisseria elongata 


CCUG 4557 


AJ223879 


recA 


45 


subsp. intermedia 










Neisseria flava 


Bangor 9 


AJ223873 


recA 




Neisseria flavescens 


LNP 444 


U57907 


recA 




Neisseria gonorrhoeae 


CH95 


U57902 


recA 




Neisseria gonorrhoeae 


FA19 


X64842 


recA 


50 


Neisseria gonorrhoeae 


MS11 


X17374 


recA 




Neisseria gonorrhoeae 




Genome project 2 


recA 




Neisseria lactamica 


CCUC 7757 


AJ223866 


recA 




Neisseria lactamica 


CCUG 7852 


Y11819 


recA 




Neisseria lactamica 


LCDC 77-143 


Y11818 


recA 


55 


Neisseria lactamica 


LCDC 80-111 


AJ223864 


recA 
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Table 11. Microbial species for which tut and/or atpD and/or recA sequences are available In 
public databases (continued). 





Species 


Strain 


Accession number 


Coding 




Neisseria lactamica 


LCDC 845 


AJ223865 


recA 




Neisseria lactamica 


NCTC 10617 


U57905 


recA 




Neisseria lactamica 


NCTC 10618 


AJ 223863 


recA 




Neisseria meninoitidis 


44/46 


X64849 


^recA 


s 


Ato / e c a /fa men in nitidis 

IMCIOOCfl tO 1 I Id III iyilHJI& 


Ranaor 13 


AJ223868 


rerA 




Neisseria meninoitidis 

MCiJdbifa I f I wi 1 /i lyillLMIO 


HF116 


X64848 


recA 




Neisseria meninnitiriis 


HF130 


X64844 


recA 




Neisseria meninnitiriis 

/iwJ^Oci (a fffwf ill iMIffwfw 


HF46 


X64847 


ffiCA 

iGusy 




Neisseria meninnitiriis 

IVGtOOUl ra If fvf ill /Uiliwfw 


M470 


X64850 


ran A 
f C?w>n 


in 


Neisseria meninnitiriis 


N94II 


X 64846 


ran A 
fe?w/^ 




Neisseria tneninnitiriiQ 

j foJoaoi id 1 1 ici m lyiiiuio 




A 1223867 


ran A 




Neisseria meninnitiriis 

livfwwcllQ 1 1 ICI III lyi ll\JIO 


P63 


X64845 


rerA 




Neisseria meninnitiriis 


' S3446 


U57903 


recA 




Neisseria meninnitiriis 

1 V Vl Jwu' IB 1 1 ICI HI lyillSJIO 


FAM18 


Genome nroiecr 


recA 


i< 

i j 


Neisseria mucosa 


LNP405 


U57908 


recA 




Neisseria mucosa 

IVvlwWwJ/fl 1 1 ILl\m> ww CI 


Vedros M1801 

VUUIU3 IVI 1 ww 1 


AJ223875 






Neisseria nerflai/a 


UUUU i / w 1 wl 


AJ223876 


1 BW/l 




Neisseria narflai/a 
tvciooGi la fJOl I lava 


L CDC flS402 


AJ223862 


rerA 




A/pjccpn'fl nhan/nnis x/ar fiai/a 
iVGiootzi la fji lat yi lyio veil, iiava 


NCTC 4^90 


U57909 

WW/ www 


recA 




Neisseria nnlx/carrharea 
<Vt/ooDi fa fJUlyoctUUi lal Va 


CCLJG 180^1 

wvUU IOUOI 


Y11815 

1 1 1 w 1 w 


Ttkf*A 
1 PW1 




Neisseria nolvsarcharea 


CCUG 24845 


Y11816 






tWl90GI /a pUty$cxL>L,l ial &d 


CCI IR P4ft4fi 


Y11814 

1 1 IO If 






hlGiGcaria nnli/sarrharea 


IN^ MA ^fiOft 


Y11817 






IVGI&oufia fJUiyoaL+0'i lai Oa 


NCTC 1 1ft5ft 


U 57904 

w J / WW*T 


rerA 




Neisseria sirra 


NRL 30016 


AJ223872 






Neisseria suhfiax/a 

/ic/joc/ /a ouuiiava 


NRL 30017 


AJ 223874 


tecA 




Pa ra mm ic Wo n itrifina n c 


DSM 413 


U59631 

WWWWW 1 


recA 




0a eta i 1 ratio m 1 1 \if\f*iria 
r d&lGUrtflia illUllUUiUa 




XQ9394 


rerA 




Pornhvromonas oinoivalis 

i \si }ji iy i \jn iksi iao yuiyivaiio 


W83 


U70054 

W 1 www~ 


recA 


^n 


Preuntetla rt iminimla 
i i CfVUidia i ui i hi //ut/'ci 


JCM 8958 


U61227 


recA 




Proteus mirabilis 

1 1 UlpUJ 1 1 iff ClWlflw 


DG1300 


X 14870 


recA 




Drrsfdl ic i/i ifria no 

rfviGUS Vuiyans 




yccccc 

AJJJJJ 


rerA 




Pseurinmnnas eeruninnsa 




X05691 


rgcA 




Psei irinmnnas aeruninnsa 


PAM 7 


X52261 


recA 


15 
jj 


Pseudomonas aeruninosa 


PA012 


D13090 


recA 




Pseurinmnnas flunresrens 

r jojuxsi i list iao i iuwi coi/C/ /o 


OE 28 3 


M96558 

IV 1 wwwww 


recA 




Pseudomonas Dutida 

1 vCUUvl * «WI luw piiKUD 




L 12684 


recA 




Pseurinmnnas nutiria 


PdS145 


U70864 

w 7 # ww w» 


recA 




DHiy/^hii im lam iminncan im 
ni iizuuiuin itfyuiiiifiuoafuiii 


vro9 


X59956 

yvwwwww 


rerA 




h invar i//V s /ao 










Rhi7nhium nhasenli 

ill II4.UUIUI II yl /fljpyH 


CNPAF512 


X62479 


recA 




Rhnrinharter canst tlati is 

III IVWVUBvlvl L/apOUiaiwO 


J50 


X82183 

/ v W w 1 WW 


recA 




Rhodobacter sohaeroides 


2.4.1 


X72705 


recA 




RhodoosGudomonas oalustris 

1 II ' WW vk/v WW W W» ■ « W» *fc«W l/BIWJKIw 


N7 


D84467 


recA 


4S 


Rickettsia nrowa2ekii 


Madrid E 


AJ235273 


recA 




Rickettsia nrowazekii 


Madrid E 


U01959 

w 1 Www 


recA 




Serratia marcescens 




M22935 


recA 




Shigella flexneri 




X55553 


recA 




Shigella sonnet 


KNIH104S 


AF101227 


recA 


50 


Sinorhizobium meliloti 


2011 


X59957 


recA 




Staphylococcus aureus 




L25893 


recA 




Streptococcus gordonii 


Challis V288 


L20574 


recA 




Streptococcus mutans 


UA96 


M81468 


recA 




Streptococcus mutans 


GS-5 


M61897 


recA 


55 


Streptococcus pneumoniae 




Z17307 


recA 
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Table 11. Microbial species for which tuf and/or atpD and/ r recA sequences are available In 
public databases (continued). 





Species 


Strain 


Accession number 


Coding gene* 




Streptococcus pneumoniae 


R800 


Z34303 


recA 




Streptococcus pyogenes 


NZ131 


U21934 


recA 




Streptococcus pyogenes 


D471 


M81469 


recA 





-Streptococcus salivarius 




M94062 


recA 


5 


subsp. thermophilus 










Streptomyces ambofaciens 


DSM 40697 


230324 


recA 




Streptomyces coelicolor 


A3(2) 


AL020958 


recA 




Streptomyces lividans 


TK24 


X76076 


recA 




Streptomyces rimosus 


R6 


X94233 


recA 


10 


Streptomyces venezuelae 


ATCC10712 


U04837 


recA 




Synechococcus sp. 


PR6 


M29495 


recA 




Synechocystis sp. 


PCC6803 


D90917 


recA 




Thermotoga maritima 




L23425 


recA 




Thermotoga maritima 




AE001823 


recA 


15 


Thermus aquaticus 




L20095 


recA 




Thermus thermophilus 


HB8 


D17392 


recA 




Thiobacillus ferrooxidans 




M26933 


recA 




Treponema denticoia 




Genome projecr 


recA 




Treponema paiiidum 


Nichols 


AE001243 


recA 


20 


Vibrio anguiiiarum 




M80525 


recA 




Vibrio cholerae 


017 


X71969 


recA 




Vibrio cholerae 


2740-80 


U10162 


recA 




Vibrio cholerae 


569B 


L42384 


recA 




Vibrio cholerae 


M549 


AF1 17881 


recA 


25 


Vibrio cholerae 


M553 


AF1 17882 


recA 




Vibrio cholerae 


M645 


AF1 17883 


recA 




Vibrio cholerae 


M793 


AF1 17878 


recA 




Vibrio cholerae 


M794 


AF1 17880 


recA 




Vibrio cholerae 


M967 


AF1 17879 


recA 


30 


Xanthomonas citri 


XW47 


AF006590 


recA 




Xanthomonas oryzae 




AF013600 


recA 




Xenorhabdus bovienii 


T228/1 


U87924 


recA 




Xenorhabdus nematophilus 


AN6 


AF1 27333 


recA 




Yersinia pestis 


231 


X75336 


recA 


35 


Yersinia pestis 


CO-92 


Genome projecr 


recA 




Fungi, parasites, human and plants 








40 


Anabaena variabilis 


ATCC 29413 


M29680 


recA 




Arabidopsis thaliana 




U43652 


rec>4(Rad51) 




Candida albicans 




U39808 


rec>4(Dmc1) 




Coprinus cinereus 


Okayama-7 


U21905 


recA (Rad51) 




Emericella nidulans 




Z80341 


recA (Rad51) 


45 


Gallus gallus 




L09655 


recA (Rad51) 




Homo sapiens 




A\ f\ f\f\ A 

D 13804 


recA (Rad51 ) 




Homo sapiens 






rec/\ (umci ) 




Leishmania major 


Friedlin 


AF062379 


rec>4(Rad51) 




Leishmania major 


Friedlin 


AF062380 


recA(Dmd) 


50 


Mus musculus 




D58419 


recA (Dmc1) 




Neurospora crassa 


74-OR23-1A 


D29638 


recA (Rad51) 




Saccharomyces cerevisiae 




D10023 


rec/4(Rad51) 




Schizosaccharomyces pombe 




Z22691 


rea4(Rad51) 




Schizosaccharomyces pombe 


972h- 


AL021817 


recA (Drnd) 


55 


Tetrahymena thermophila 


PB9R 


AF064516 


AfiCi4(Rad51) 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Trypanosoma brucei 
Ustilago maydis 

Xenopus laevis 

Xenopus laevis 



stock 427 



Y13144 
U62484 
--D38488- 
D38489 



recA (Rad51) 
recA (Rad51) 
recA -(RadS-1 ) 
reo4(Rad51) 



* fu/indicates tuf sequences, including tuf genes, fusA genes and fusAfu/intergenic spacers. 

tuf (C) indicates fuf sequences divergent from main (usually A and B) copies of the elongation factor-Tu 
10 tuf (EF-1 ) indicates tuf sequences of the eukaryotic type (elongation factor 1a) 

tuf(M) indicates tuf sequences from organellar (mostly mitochondrial) origin 

atpD indicates atpD sequences ot the F-type 

atpD (V) indicates atpD sequences of the V-Type 

recA indicates recA sequences 
15 recA (Rad51 ) indicates rad51 sequences or homologs 

recA (Dmd,) indicates dmd sequences or homologs 

1 Nucleotides sequences published in Arch. Microbiol. 1990 153:241-247 

2 These sequences are from theTIGR database (http://www.tigr.org/tdb/tdb.html) 

3 Nucleotides sequences published in FEMS Microbiology Letters 1988 50:101-106 
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Table 12. Bacterial species used to test the specificity of the Staphylococcus-spec if ic amplification 
primers derived from tut sequences, 



Strain 



Reference number 



Strain 



Reference number 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Staphylococcal species (n=27) 

Staphylococcus arfettae ATCC 43957 

-Staphylococcus aureus ATCC-35844 . 

subsp. anaerobius 

Staphylococcus aureus ATCC 43300 
subsp. aureus 

Staphylococcus auricularis ATCC 33753 

Staphylococcus capitis ATCC 27840 
subsp. capitis 

Staphylococcus caprae ATCC 35538 

Staphylococcus carnosus ATCC 5 1 365 

Staphylococcus chromogenes ATCC 43764 

Staphylococcus cohnii DSM 20260 
subsp. urealyticum 

Staphylococcus delphini ATCC 491 71 

Staphylococcus epidermidis ATCC 1 4990 

Staphylococcus equorum ATCC 43958 

Staphylococcus felis ATCC 49 1 68 

Staphylococcus gallinarum ATCC 35539 

Staphylococcus haemolyticus ATCC 29970 

Staphylococcus hominis ATCC 27844 

Staphylococcus hyicus ATCC 1 1 249 

Staphylococcus intermedius ATCC 29663 

Staphylococcus kloosis ATCC 43959 

Staphylococcus lentus ATCC 29070 

Staphylococcus lugdunensis ATCC 43809 

Staphylococcus saprophytics ATCC 1 5305 

Staphylococcus schteiferi ATCC 49545 
subsp. coagulans 

Staphylococcus sciuri ATCC 29060 
subsp. sciuri 

Staphylococcus simulans ATCC 27848 

Staphylococcus warned ATCC 27836 

Staphylococcus xylosus ATCC 29971 

Gram-negative bacteria (n=33) 

Acinetobacter baumannii ATCC 1 9606 

Bacteroides distasonis ATCC 8503 

Bacteroides fragilis ATCC 25285 

Buikholderia cepacia ATCC 254 1 6 

Bordetella pertussis ATCC 9797 

Citrobacter freundii ATCC 8090 

Enterobacter aerogenes ATCC 1 3048 

Enterobacter cloacae ATCC 1 3047 

Escherichia coli ATCC 25922 

Haemophilus influenzae ATCC 8907 

Haemophilus parahaemolyticus ATCC 1 00 1 4 

Haemophilus parainfluenzae ATCC 7901 

Hafnia alvei ATCC 1 3337 

Kingella indologenes ATCC 25869 

Klebsiella oxytoca ATCC 1 31 82 

Klebsiella pneumoniae ATCC 1 3883 

Moraxella catarrhalis ATCC 25240 



Other Gram-positive bacteria (n=20) 

Bacillus subtilis ATCC 27370 
ATCC -1 4025- 



Enterococcusavium 

Enterococcus durans 

Enterococcus faecalis 
Enterococcus faecium 

Enterococcus fiavescens 
Enterococcus gallinarum 
Lactobacillus acidophilus 
Lactococcus lactis 

Listeria innocua 
Listeria ivanovii 
Listeria monocytogenes 
Macrococcus caseotyticus 
Streptococcus agalactiae 
Streptococcus anginosus 
Streptococcus bovis 
Streptococcus mutans 
Streptococcus pneumoniae 
Streptococcus pyogenes 
Streptococcus salivarius 



Morganella morganii 
Neisseria gonorrhoeae 
Neisseria meningitidis 
Proteus mirabilis 
Proteus vulgaris 
Providencia rettgeri 
Providencia stuartii 
Pseudomonas aeruginosa 
Pseudomonas fluorencens 
Salmonella choleraesuis 
Salmonella typhimurium 
Serratia marcescens 
Shigella flexneri 
Shigella sonnei 
Stenotrophomonas maltophilia 
Yersinia enterocolitica 



ATCC 19432 

ATCC 19433 
ATCC 19434 

ATCC 49996 
ATCC 49573 
ATCC 4356 
ATCC 11454 

ATCC 33090 
ATCC 19119 
ATCC 15313 
ATCC 13548 
ATCC 13813 
ATCC 33397 
ATCC 33317 
ATCC 25175 
ATCC 6303 
ATCC 19615 
ATCC 7073 



ATCC 25830 
ATCC 35201 
ATCC 13077 
ATCC 25933 
ATCC 13315 
ATCC 9250 
ATCC 29914 
ATCC 27853 
ATCC 13525 
ATCC 7001 
ATCC 14028 
ATCC 8100 
ATCC 12022 
ATCC 29930 
ATCC 13843 
ATCC 9610 
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Table 13. Bacterial species used to test the specificity f the penicillin-resistant Streptococcus 
pneumoniae assay. 





Strain 


Reference number 


Strain Reference number 


5 


uram-posiuv© species (n=oij 










Abiotrophia adiacens 


ATCC 49175 


Staphylococcus hominis 


ATCC 27844 




Abiotrophia defectiva 


ATCC 49176 


Staphylococcus lugdunensis 


ATCC 43809 





Actinomyces pyogenes 


ATCG-1 941-1 


- - Staphylococcus saprophyticus~PtTGC \5305 


10 


Bacillus anthracis 


ATCC 4229 


Staphylococcus simulans 


ATCC 27848 




Bacillus cereus 


ATCC 14579 


Staphylococcus, warneri 


ATCC 27836 




Bifidobacterium breve 


ATCC 15700 


Streptococcus acidominimus 


ATCC 51726 




Clostridium difficile 


ATCC 9689 


Streptococcus agalactiae 


ATCC 12403 




Enterococcus avium 


ATCC 14025 


Streptococcus anginosus 


ATCC 33397 


15 


Enterococcus casseliflavus 


ATCC 25788 


Streptococcus bovis 


ATCC 33317 




Enterococcus dispar 


ATCC 51266 


Streptococcus constellatus 


ATCC 27823 




Enterococcus durans 


ATCC 19432 


Streptococcus cricetus 


ATCC 19624 




Enterococcus faecalis 


ATCC 29212 


Streptococcus crlstatus 


ATCC 51100 




Enterococcus faecium 


ATCC 19434 


Streptococcus downei 


ATCC 33748 


20 


Enterococcus flavescens 


ATCC 49996 


Streptococcus dysgalactiae 


ATCC 43078 




Enterococcus gallinarum 


ATCC 49573 


Streptococcus equi 


ATCC 9528 




Enterococcus hirae 


ATCC 8043 


Streptococcus ferns 


ATCC 33477 




Enterococcus mundtii 


ATCC 43186 


Streptococcus gordonii 


ATCC 10558 




Enterococcus raffinosus 


ATCC 49427 


Streptococcus intermedius 


ATCC 27335 


25 


Lactobacillus lactis 


ATCC 19435 


Streptococcus mitis 


ATCC 903 




Lactobacillus monocytogenes 


ATCC 15313 


StreDtococcus mitis 


LSPQ 2583 




Mobiluncus curtisii 


ATCC 35242 


Streptococcus mitis 


ATCC 49456 




Peptococcus niger 


ATCC 27731 


Streptococcus mutans 


ATCC 27175 




Peptostreptococcus acones 


ATCC 6919 


Streptococcus oralis 


ATCC 10557 


30 


Peptostreptococcus anaerobius 


ATCC 27337 


Streptococcus oralis 


ATCC 9811 




Peptostreptococcus 


ATCC 2639 


Streptococcus oralis 


ATCC 35037 




asaccharolyticus 




Streptococcus parasanguinis 


ATCC 15912 




Peptostreptococcus lactolyticus 


ATCC 51172 


Streptococcus parauberis 


ATCC 6631 




Peptostreptococcus magnus 


ATCC 15794 


Streptococcus rattus 


ATCC 15912 


35 


Peptostreptococcus prevotii 


ATCC 9321 


Streptococcus salivarius 


ATCC 7073 




Peptostreptococcus tetradius 


ATCC 35098 


Streptococcus sanguinis 


ATCC10556 




Staphylococcus aureus 


ATCC 25923 


Streptococcus suis 


ATCC 43765 




Staphylococcus capitis 


ATCC 27840 


Streptococcus uteris 


ATCC 19436 




Staphylococcus epidermidis 


ATCC 14990 


Streptococcus vestibularis 


ATCC 49124 


40 


Staphylococcus haemolyticus 


ATCC 29970 








Gram-negative species (n=33) 










Actinetobacter baumannii 


ATCC 19606 


Moraxella morgan// 


ATCC 13077 




Bordetella pertussis 


ATCC 9797 


Neisseria gonorrhoeae 


ATCC 35201 




Citrobacter diversus 


ATCC 27028 


Neisseria meningitidis 


ATCC 13077 


45 


Citrobacter freundii 


ATCC 8090 


Proteus mirabilis 


ATCC 25933 




Enterobacter aerogenes 


ATCC 13048 


Proteus vulgaris 


ATCC 13315 




Enterobacter agglomerans 


ATCC 27155 


Providencia alcalifaciens 


ATCC 9886 




Enterobacter cloacae 


ATCC 13047 


Providencia rettgeri 


ATCC 9250 




Escherichia coli 


ATCC 25922 


Providencia rustigianii 


ATCC 33673 


50 


Haemophilus ducreyi 


ATCC 33940 


Providencia stuartii 


ATCC 33672 




Haemophilus haemolyticus 


ATCC 33390 


Pseudomonas aeruginosa 


ATCC 35554 




Haemophilus influenzae 


ATCC 9007 


Pseudomonas fluorescens 


ATCC 13525 




Haemophilus parainfluenzae 


ATCC 7901 


Pseudomonas stutzeri 


ATCC 17588 




Hafnla alvei 


ATCC 13337 


Salmonella typhimurium 


ATCC 14028 


55 


Klebsiella oxytoca 


ATCC 13182 


Serratia marcescens 


ATCC 13880 




Klebsiella pneumoniae 


ATCC 13883 


Shigella flexneri 


ATCC 12022 




Moraxella atlantae 


ATCC 29525 


Yersina enterocolitica 


ATCC 9610 




Moraxella catarrhalis 


ATCC 43628 







60 
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Table 14. Bacterial species (n=104) detected by the platelet contaminants assay. Bold characters 
indicate the major bacterial contaminants found in platelet concentrates. 



5 Abiotrophia adiacens 
Ablotrophia defectiva 
Acinetobacter baumannii 
Acinetobacter Iwoffi 
Aerococcus viridans 45 

J0_ Bacillus anthracis 

Bacillus cereus 
Bacillus subtllls 
Brucella abortus 

Burkholderia cepacia 50 
1 5 Citrobacter divers us 

Citrobacter freundii 

Enterobacter aerogenes 

Enterobacter agglomerans 

Enterobacter cloacae 55 
20 Enterococcus avium 

Enterococcus casselifJavus 

Enterococcus dispar 

Enterococcus durans 

Enterococcus faecalis 60 
25 Enterococcus faecium 

Enterococcus flavescens 

Enterococcus gallinarum 

Enterococcus mundtii 

Enterococcus raffinosus 65 
30 Enterococcus solitarius 

Escherichia coll 

Gemella morbillorum 

Haemophilus ducreyi 

Haemophilus haemolyticus 70 
35 Haemophilus influenzae 

Haemophilus 

parahaemolyticus 

Haemophilus parainfluenzae 

Hafnia atvei 75 
40 Kingella kingae 



230 



Klebsiella oxytoca 
Klebsiella pneumoniae 

Legionella pneumophila 
Megamonas hypermegale 
Moraxeila atlantae 
Moraxel[a catajrhalis_ 
Morganella morganii 
Neisseria gonorrheae 
Neisseria meningitidis 
Pasteurella aerogenes 
Pasteurella multocida 
Peptostreptococcus magnus 
Proteus mirabllis 
Providencia alcalifaciens 
Providencia rettgeri 
Providencia rustigianii 
Providencia stuartii 
Pseudomonas aeruginosa 
Pseudomonas fluorescens 
Pseudomonas stutzeri 
Salmonella bongori 
Salmonella choleraesuls 
Salmonella enteritidis 
Salmonella gallinarum 
Salmonella typhimurium 
Serratia liquefaciens 
Serratla marcescens 
Shigella flexneri 
Shigella sonnel 
Staphylococcus aureus 
Staphylococcus capitis 
Staphylococcus epldermidis 
Staphylococcus haemolyticus 
Staphylococcus hominis 
Staphylococcus lugdunensis 
Staphylococcus saprophytics 



Staphylococcus simulans 
Staphylococcus warned 
Stenotrophomonas maltophilia 

80 Streptococcus acidominimus 
Streptococcus agalactiae 

Streptwo<xus_anginosus_ 

Streptococcus bovis 
Streptococcus constellatus 

85 Streptococcus cricetus 
Streptococcus cnstatus 
Streptococcus dysgalactiae 
Streptococcus equi 
Streptococcus ferus 

90 Streptococcus gordonii 
Streptococcus intermedius 
Streptococcus macacae 
Streptococcus mitis 
Streptococcus mutans 

95 Streptococcus oralis 

Streptococcus parasanguinis 
Streptococcus parauben's 
Streptococcus pneumoniae 
Streptococcus pyogenes 
1 00 Streptococcus ratti 

Streptococcus salivarius 
Streptococcus sanguinis 
Streptococcus sobrinus 
Streptococcus uberis 
105 Streptococcus vestibularis 
Vibrio cholerae 
Yersinia enterocolitica 
Yersinia pestis 

Yersinia pseudotuberculosis 
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Abiotrophia adiacens (Streptococcus 
adjacens) 

Abiotrophia defectiva (Streptococcus 
defectivus) 
5 Achromobecter species 
Acidaminococcus fermentans 
Acinetobacter alcaligenes 
Acinetobacier anitratus 

'A dnetobacter baumannii 

1 0 Acinetobacter catcoaceticus 

Acinetobacter calcoaceticus biovar 
anitratus 

Acinetobacter catcoaceticus biovar 
twoffi 

15 4c/nefotecfergenomo$pecies 

Acinetobacter haemoiyticus 

Acinetobacter Johnsoml 

Acinetobacter junii 

Acinetobacter tworfti 
20 Acinetobacter radioresistens 

Acinetobacter species 

Adinobaciffus actinomycetemcomitans 

Adinobacitius capsulatus 

Adinobacitius equufl 
25 ActinobaciHus homlnis 

Actinobacitius iignieresii 

Actinobacitius pleuropneumonia 

Actinobacitius species 

Actinobacitius suis 
30 ActinobaciHus ureae 

Actinomyces ooWs 

Actinomyces israelii 

Actinomyces meyeri 

Actinomyces naeslundti 
35 Actinomyces neuii subsp. anitratus 

Actinomyces neuii subsp. neuii 

Actinomyces odontoJyticus 

Actinomyces pyogenes 

Actinomyces radingae 
40 Actinomyces species 

Actinomyces turicensis 

Actinomyces viscosus 

Aerococcus species 

Aerococcus viridans 
45 Aeromonas caviae 

Aeromonas hydrophila 

Aeromonas hydrophila group 

Aeromonas jandaei 

Aeromonas salmonidda 
50 Aeromonas salmonidda subsp. 

achromogenes 

Aeromonas salmonidda subsp. 
masoucida 

Aeromonas salmonidda subsp. 
55 salmonidda 

Aeromonas schubertii 

Aeromonas sobria 

Aeromonas species 

Aeromonas trota 
60 Aeromonas veronii 

Aeromonas veronii biovar sobria 

Aeromonas veronii biovar veronii 

Agrobacterium radiobacter 

Agrobacterium species 
65 Agrobacterium tumefadens 

Atcaligenes denitrlficans 

Atcaligenes taecalis 

Atcaligenes odorans 

Atcaligenes odorans (Atcaligenes 
70 taecalis) 

Atcaligenes species 

Alcaligenes xylosoxidans 

Atcaligenes xylosoxidans subsp. 

denitrlficans 



75 Alcaligenes xylosoxidans subsp. 
xylosoxidans 
Atloiococcus otitis 

Anaerobiospirillum sucdnlciproducens 
Anaerovibrio lipolytics 
80 Arachnia propionics 

Arcanobaderium (Actinomyces) 
bemardiae 

Arcanobscterium {Actinomyces) 

pyogenes 
85 Arcanobacterium haemotyticum 

Arcobacter cryaerophiius 

(Campylobacter cryaerophlla) 

Arthrobacter globlformls 

Arthrobacter species 
90 Arxhzyma teliun's (Tonjlopsis 

pintotopesii) 

Atopobium minutum (Lactobacillus 

minutus) 
. Aureobacterium species 
95 Bacillus amyloliquefaciens 

Bacillus anthrads 

Bacillus badius 

Bacillus cereus 

Bacillus drculans 
100 Bacillus coagulans 

Bacillus firmus 

Bacillus lentus 

Bacillus lichenitormfs 

Bacillus megaterium 
1 05 Bacillus mycoides 

Bacillus pantothenticus 

Bacillus pumiius 

Bacillus species 

Bacillus sphaericus 
110 Bacillus stearothermophilus 

Bacillus subtilis 

Bacillus thuringiensis 

Bacteroides caeca e 

Bacteroides capillosus 
115 Bacteroides distasonis 

Bacteroides eggerthii 

Bacteroides fragilis 

Bacteroides merdae 

Bacteroides ovatus 
120 Bacteroides species 

Bacteroides splanchnicus 

Bacteroides stercons 

Bacteroides thetaiotaomicron 

Bacteroides uniformis 
1 25 Bacteroides ureolytlcus (B. corrodens) 

Bacteroides vulgatus 

Sergey eifa (Weeksefla) zoohelcum 

Bifidobacterium actoiescentfs 

Bifidobacterium blfidum 
1 30 Bifidobacterium breve 

Bifidobacterium dentium 

Bifidobacterium infanHs 

Bifidobacterium species 

Btastoschizomyces (Dipodascus) 
135 capitatus 

Bordetella avium 

Bordetella bronchiseptica 

Bordetella parapertussis 

Bordetella pertussis 
140 Bordetella species 

Borreiia species 

Branhametla {Moraxella) catarmatis 
Branhamefla species 
Brevibadllus brevis 
145 Brevibadllus laterosporus 
Brevibacterium casei 
Brevibaderium epidarmidis 
Brevibacterium linens 



Brevibacterium species 
150 Brevundimonas (Pseudomonas) 
dimfnuta 

Brevundimonas (Pseudomonas) 
vesicutaris 

Brevundimonas species 
1 55 Brochothrix thermos phada 

Brucella abortus 
Brucella canls 

Brucella meiitensis 

Brucella ovis 
160 Brucella species 

Brucella suis 

Budvicla aquatics 

Burkholderia (Pseudomonas) cepada 

Burkholderia (Pseudomonas) gladioli 
165 Burkholderia (Pseudomonas) mallei 

Burkholderia (Pseudomonas) 

pseudomelia! 

Burkholderia species 

Buttlauxella agrestis 
170 Campylobacter coU 

Campylobacter concisus 

Campylobacter fetus 

Campylobacter fetus subsp. fetus 

Campylobacter fetus subsp. 
175 venerealis 

Campylobacter hyointestinatis 

Campylobacter Jejuni subsp. doylel 

Campylobacter jejuni subsp. jejuni 

Campylobacter lari 
1 80 Campylobacter ten subsp. UPTC 

Campylobacter mucosafis 

Campylobacter species 

Campylobacter spuiorum 

Campylobacter sputorum subsp. 
185 bubulus 

Campylobacter sputorum subsp. 

fecatis 

Campylobacter sputorum subsp. 

sputorum 
190 Campylobacter upsafiensis 

Candida (Clavispora) lushanlae 

Candida (Pichia) guiltiermondii 

Candida (Torulopsis) giabrata 

Candida albicans 
195 Candida boidinii 

Candida catenulata 

Candida ciferrii 

Candida cotliculosa 

Candida conglobate 
200 Candida curvata [Cryptococcus 

curvatus) 

Candida dattita 

Candida dubliniensis 

Candida famata 
205 Candida globose 

Candida hellenha 

Candida holmii 

Candida humJcda 

Candida inconspicua 
2 1 0 Candida intermedia 

Candida kefyr 

Candida krusel 

Candida Iambics 

Candida magnoliae 
215 Candida maris 

Candida mellbioslca 

Candida membranaefadens 

Candida norvegensis 

Candida norvegice 
220 Candida parapsilosfs 

Candida paratropicalls 

Candida petliculosa 
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Candida pseudotropicaiis 

Candida pulchemma 

Candida ravautii 

Candida rvgosa 
5 Candida sake 

Candida sitvlcoJa 

Candida species 

Candida sphaerica 

Candida steltatoidea 
— 10— Candida tenuis 

Candida tropicalis 

Candida utilis 

Candida valida 

Candida vini 
15 Candida viswanathU 

Candida zeylanoides 

Capnocytophaga gingivalis 

Capnocytophaga ochracea 

Capnocytophaga species 
20 Capnocytophaga sputigena 

Cardiobacterium hominis 

Camobacterium divergens 

Camobacterium piscicoJa 

CDC group ED-2 
25 CDC group EF4 (Pasteuretla sp.) 

CDC group EF-4A 

CDC group EF-4B 

CDC group EQ-2 

CDC group HB-5 
30 CDC group II K-2 

CDC group IV C-2 (Sorcfofe/te-like) 

CDC group M5 

CDC group M6 

Cedecea davisae 
35 Cedecea lapagei 

Cedecea neteri 

Cedecea species 

Cellulomonas [Oerskovia) turbata 

Cellufomonas species 
40 Chlamydia species 

Chrome-bacterium vfotaceum 

Chryseobacterium (Flavobacterium) 

indofogenes 

Chryseobacterium {Flavobacterium) 
45 meningosepticum 

Chryseobacterium gleum 

Chryseobacterium species 

Chryseomonas fndologenes 

CHeromyces matritensis 
50 Citrobacter amaionaticus 

Citrobacter braakii 

Citrobacter diversus 

Citrobacter farmeri 

Citrobacter freundii 
55 Citrobacter freundii complex 

Citrobacter koseri 

Citrobacter sedlakii 

Citrobacter species 

Citrobacter werkmanii 
60 Citrobacter youngae 

Clostridium aceiobutyiicum 

Clostridium barati 

Clostridium beijerinckii 

Clostridium blfennentans 
65 Clostridium botulinum . 

Clostridium botulinum (NP) B&F 

Clostridium botulinum (NP) E 

Clostridium botulinum (?) A&H 

Clostridium botulinum (P) F 
70 Clostridium botulinum G1 

Clostridium botulinum G2 

Clostridium butyricum 

Clostridium cadaveria 

Clostridium chauvoel 
75 Clostridium clostridiiforme 

Clostridium difficile 

Clostridium fallax 

Clostridium glycolicum 



Clostridium hastiforme 
80 Clostridium histolyticum 

Clostridium Innocuum 

Clostridium Hmosum 

Clostridium novyi 

Clostridium novyi A 
85 Clostridium paraputrificum 

Clostridium perfringens 

Clostridium putrificum 
Ctostridiurnjamosum 

Clostridium septicum - — 

90 Clostridium sordeltii 

Clostridium species 

Clostridium sphenoides 

Clostridium sporogenes 

Clostridium subterminale 
95 Clostridium tertium 

Clostridium tetani ' 

Clostridium tyrobutyricum 

Comamonas {Pseudomonas) 

acidovorans 
1 00 Comamonas {Pseudomonas) 

testosteron! 

Comamonas species 

Corynebacterium accofens 

Corynebacterium afermentans 
105 Corynebacterium amycolatum 

Corynebacterium aquaticum 

Corynebacterium argentoratense 

Corynebacterium auris 

Corynebacterium bovis 
110 Corynebacterium coyleae 

Corynebacterium cystHidis 

Corynebacterium diphtheriae 

Corynebacterium diphtheriae biotype 

belfanti 

1 1 5 Corynebacterium diphtheriae biotype 
gravis 

Corynebacterium diphtheriae biotype 
intermedius 

Corynebacterium diphtheriae biotype 
120 mltis 

Corynebacterium flavescens 

Corynebacterium glucuronolyticum 

Corynebacterium glucuronoiyticum- 

seminale 
1 25 Corynebacterium group A 

Corynebacterium group A-4 

Corynebacterium group A-5 

Corynebacterium group ANF 

Corynebacterium group B 
1 30 Corynebacterium g roup B-3 

Corynebacterium group F 

Corynebacterium group F-1 

Corynebacterium group F-2 

Corynebacterium group G 
1 35 Corynebacterium group G*1 

Corynebacterium group G-2 

Corynebacterium group I 

Corynebacterium group I-2 

Corynebacterium jeikeium (group JK) 
1 40 Corynebacterium kutscheri (C. 

murium) 

Corynebacterium macginleyi 

Corynebacterium mlnutisslmum 

Corynebacterium pilosum 
145 Corynebacterium propinquum 

Corynebacterium 

pseudod'tphtheriticum 

Corynebacterium pseudotuberculosis 

Corynebacterium pyogenes 
150 Corynebacterium renale 

Corynebacterium renale group 

Corynebacterium seminale 

Corynebacterium species 

Corynebacterium striatum (C. 
155 flavidum) 

Corynebacterium ufcerans 



Corynebacterium urealyiicum (group 
D2) 

Corynebacterium xerosis 
160 Cryptococcus aibktus 

Cryptococcusater 

Cryptococcus cereanus 

Cryptococcus gastricus 

Cryptococcus hum/co/us 
1 65 Cryptococcus lactativorus 

Cryptococcus taurerttli 
Cryptococcus luteoius 

Cryptococcus me li bios urn 

Cryptococcus neoformans 
170 Cryptococcus species 

Cryptococcus terreus 

Cryptococcus uniguttulatus 

Debaryomyces hansenil 

Debaryomyces mamma 
175 Debaryomyces potymorphus 

Debaryomyces species 

Dermabacter hominis 

Denrtacoccus (Micrococcus) 

nishinomiyaensis 
1 80 Dietzja species 

Edwardsiella hoshinae 

Edwardsiella ictaluri 

Edwardsiella species 

Edwardsiella tarda 
185 Eikenella corrodens 

Empedobacterbrevis (Flavobacterium 

breve) 

Enterobacter aerogenes 

Enterobacter agglomerans 
1 90 Enterobacter amnigenus 

Enterobacter amnigenus asburiae 

(CDC enteric group 17) 

Enterobacter amnigenus biogroup 1 

Enterobacter amnigenus biogroup 2 
195 Enterobacter asburiae 

Enterobacter cancerogenus 

Enterobacter cloacae 

Enterobacter gergovlae 

Enterobacter norma echel 
200 Enterobacter intermedius 

Enterobacter sakazakii 

Enterobacter species 

Enterobacter taylorae 

Enterobacter taylorae (CDC enteric 
205 group 19) 

Enterococcus (Streptococcus) 

cecorum 

Enterococcus (Streptococcus) faecal'ts 

(Group D) 
210 Enterococcus (Streptococcus) 

feec/um(Group D) 

Enterococcus (Streptococcus) 

saccharolyticus 

Enterococcus avium (Group D) 
215 Enterococcus casselirTavus 

(Steptococcus faecium subsp. 

cassefifiavus) 

Enterococcus durans (Streptococcus 

faecium subsp. durans) (Group D) 
220 Enterococcus gall'mawm 

Enterococcus hirae 

Enterococcus malodoratus 

Enterococcus mundtil 

Enterococcus rafflnosus 
225 Enterococcus species 

Erwinia amytovora 

Erwinia carotovora 

Erwinia carolovore subsp. atroseptica 
Erwinia carotovore subsp. 
230 betavasculorum 

Erwinia carotovora subsp. carotovora 
Erwinia chrysanthemi 
Erwinia cypripedii 
Erwinia mallotivora 
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Erwinia ntgriftuens 

Erwinia quercina 

Erwinia rhapontid 

Erwinia rubrifaciens 
5 Erwinia salicis 

Erwinia species 

Erysipelothrix rhusiopathiae 

Erysipelothrix species 

Escherichia blame _ 
10- Escherichia coli ~ 

Escherichia coll A-D 

Escherichia coll 0157:H7 

Escherichia fergusonii 

Escherichia hermannil 
15 Escherichia species 

Escherichia vulneris 

Eubactarium aerofaciens 

Eubactarium alactotyticum 

Eubactarium lantum 
20 Eubactarium timosum 

Eubactarium species 

Ewingelta americana 

Fllobasldieila neotormans 

Filobasidium floriforme 
25 Fiiobasidium uniguttulatum 

Flavimonas oryzihabitans 

Flavobaderium gleum 

Flavobaderium indologenea 

Fiavobacterium odoratum 
30 Fiavobacterium species 

Francisella novidda 

Francisefla philomiragia 

Francisella species 

Francisella tularensis 
35 Fusobacterium morttferum 

Fusobacterium necrogenes 

Fusobacterium necrophonjm 

Fusobacterium nucleatum 

Fusobacterium species 
40 Fusobacterium varium 

Gaffkya species 

Gardnerella vaginalis 

Gemetla haemofysans 

Gemetla morbiliorum 
45 Gemella species 

Geotrichum candidum 

Geotrichum termentans 

Geotrichum penldllarum 

Geotrichum penidfiaium 
50 Geotrichum species 

Gordons species 

Haemophilus aegyptius 

Haemophilus aphrophilus 

Haemophilus ducreyi 
55 Haemophilus haemoglobinopNlus 

Haemophilus haemdybcus 

Haemophilus influenzae 

Haemophilus influenzae biotype I 

Haemophilus influenzae biotype II 
60 Haemophilus influenzae biotype III 

Haemophilus Influenzae biotype IV 

Haemophilus influenzae biotype V 

Haemophilus influenzae biotype VI 

Haemophilus influenzae biotype VII 
65 Haemophilus influenzae biotype VIII 

Haemophilus paragallinarum 

Haemophilus parahemolyticus 

Haemophilus parainfluenzae 

Haemophilus parainfluenzae biotype I 
70 Haemophilus parainfluenzae biotype II 

Haemophilus parainfluenzae biotype 

III 

Haemophilus parainfluenzae biotype 
IV 

75 Haemophilus parainfluenzae biotype V 
Haemophilus parainfluenzae biotype 
VI 

Haemophilus parainfluenzae biotype 



VII 

80 Haemophilus parainfluenzae biotype 
VIII 

Haemophilus paraphrohaemofyticus 
Haemophilus paraphrophilus 
Haemophilus segnis 
85 Haemophilus somnus 
Haemophilus species 
Hafnia ah/el 

Hanseniaspora guilliermondi} 

Hanseniaspora uvarum 
90 Hanseniaspora valbyensis 

Hansenula anomala 

Hansenuta holstii 

Hansenula polymorphs 

Helicobacter {Campylobacter) cinaedi 
95 Helicobacter (Campylobacter) 

fennelliae 

Helicobacter (Campylobacter) pylori 

Issatchenkia orientals 
, Kingella denitrificans 
1 00 Kingella indologenes 

Kingella kingae 

Kingella species 

Klebsiella omithinofytica 

Klebsiella oxytoca 
1 05 Klebsiella planticota 

Klebsiella pneumoniae subsp. 

ozaenae 

Klebsiella pneumoniae subsp. 

pneumoniae 
1 1 0 Klebsiella pneumoniae subsp. 

rhinoscieromatis 

Klebsiella species 

Klebsiella terrigena 

Kloeckera apicutata 
115 Kloeckera apis 

Kloeckera japonica 

Kloeckera species 

Kiuyvera ascorbata 

Kluyvere cryocrescens 
120 Kiuyvera species 

Kluyveromyces lactis 

Kluyveromyces maixianus 

Kluyveromyces thermotolerans 

Kocuria (Micrococcus) kristinae 
1 25 Kocuria (Micrococcus) rosea 

Kocuria(Micrococcus) vartans 

Koserella trabulsil 

Kytococcus (Micrococcus) sedentarius 

Lactobacillus (Weisselia) viridescens 
130 Lactobacillus A 

Lactobacillus acidophilus 

Lactobacillus B 

Lactobacillus brevis 

Lactobacillus buchneri 
1 35 Lactobacillus casei 

Lactobacillus casei subsp. casei 

Lactobadllus casei subsp. lactosus 

Lactobacillus casei subsp. rhamnosus 

Lactobadllus catena fcrmls 
140 Lactobadllus cellobiosus 

Lactobadllus cdlinoides 

Lactobadllus coprophilus 

Lactobadllus crispatus 

Lactobadllus curvatus 
145 Lactobacillus delbrveckit subsp. 

bulgaricus 

Lactobadllus defbrueckii subsp. 
defbrueckii 

Lactobacillus defbrueckii subsp. lactis 
150 Lactobadllus fermentum 

Lactobadllus fructivorans 

Lactobadllus helveticus 

Lactobadllus helveticus subsp. jugurti 

Lactobadllus jensenii 
155 Lactobadllus lindneri 

Lactobadllus minutus 



Ladobadllus paracasel subsp. 
paracasei 

Lactobacillus pentosus 
1 60 Lactobadllus ptantarum 

Lactobadllus salfvarius 

Ladobacillus salivarius var. salicinius 

Lactobacillus species 

Lectococcus diadtllactis 
1 65 Lactccoccus garvieae 
Udocpccusjactis subsp^ cremoris 

Lactccoccus lactis subsp. diacitilactis 

Lactccoccus lactis subsp. hordniae 

Lactccoccus lactis subsp. lactis 
1 70 Ladococcus plantarum 

Ladococcus raffinoladis 

Ledercia adecarboxytata 

Legionella species 

Leminorella species 
1 75 Leptospira species 

Leptotrichia buccaHs 

Leuconostoc (Weisselia) 

paramesenterddes 

Leuconostoc camosum 
1 80 Leuconostoc citreum 

Leuconostoc gelidum 

Leuconostoc tadis 

Leuconostoc mesenterddes 

Leuconostoc mesenterddes subsp. 
185 cremoris 

Leuconostoc mesenterddes subsp. 

dextranicum 

Leuconostoc mesenterddes subsp. 

mesenterddes 
190 L euconostoc species 

Listeria gray! 

Listeria innocua 

Listeria ivanovu 

Listeria monocytogenes 
195 Listeria murrayi 

Listeria seeligeri 

Listeria species 

Listeria welshrmeri 

Megasphaera elsdenii 
200 Methylobacterium mesophilicum 

Metschnikow'ia pulcherrima 

Microbacierium species 

Micrococcus tuteus 

Micrococcus tytae 
205 Micrococcus species 

Mobfluncus curtisii 

Mobituncus mulieris 

Mobiluncus species 

Moeflerefla wisconsensis 
2 1 0 Moraxelia (Branhamella) catarrhalis 

Moraxella attantae 

Moraxelia bovis 

Moraxella lacunata 

Moraxelia nontiquefadens 
2 1 5 Moraxella osloensis 

Moraxella phenytpyruvica 

Moraxelia species 

Morganella morganii 

Morganella morganii subsp. morganii 
220 Morganella morganii subsp. sibonii 

Mycobaderium efricanum 

Mycobaderium asiaticum 

Mycobacterium avium 

Mycobaderium bovis 
225 Mycobaderium chelonae 

Mycobaderium fortuitum 

Mycobacterium gordonae 

Mycobaderium kartsasu 

Mycobaderium maJmoense 
230 Mycobacterium marinum 

Mycobaderium phtel 

Mycobacterium scrofulaceum 

Mycobacterium smegmatis 

Mycobaderium species 
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Mycobacterium tuberculosis 

Mycobacterium ulcerans 

Mycobacterium xenopi 

Mycoplasma fermentans 
5 Mycoplasma hominis 

Mycoplasma orate 

Mycoplasma pneumoniae 

Mycoplasma species 

Myroides species 
10 Neisseria dnerea 
~ Neisseria elongate sub$prelongata~ 

Neisseria flava 

Neisseria ftevescens 

Neisseria gonorrhoeae 
15 Neisseria lactamica 

Neisseria meningitidis 

Neisseria mucosa 

Neisseria perflava 

Neisseria polysaccharea 
20 Neisseria saprophytes 

Neisseria sicca 

Neisseria subflava 

Neisseria weaveri 

Neisseria weaveri (CDC group M5) 
25 Nocardia species 

Ochrobectrvm anthropi 

Oerskovia species 

Oerskovia xanthineclytica 

Oligella {Moraxella) urethrals 
30 Oligella species 

Oligella uredyt'ca 

Paenibaciifus alvei 

Paenibacillus macerans 

Paenibaciltus pcSymyxa 
35 Panxoea aggiomerans 

Pantoea ananas (Erwinfa uredovora) 

Pantoea dispersa 

Pantoea species 

Pantoea stewartii 
40 Pasteurella (Haemophilus) avium 

Pasteurella aerogenes 

Pasteurella gaflinarum 

Pasteurella haemotytica 

Pasteurella haemoiyticus 
45 Pasteurella muttocida 

Pasteurella muttocida SF 

Pasteurella muttocida subsp. 

multocida 

Pasteurella muttocida subsp. septlca 

50 Pasteurella pneumotropica 
Pasteurella species 
Pasteurella ureae 
Pediococcus acidilactici 
Pediococcus damnosus 

55 Pediococcus pentosaceus 
Pediococcus species 
Pepiococcus niger 
Peptococcus species 
Peptostreptccoccus anaerobius 

60 Peptostreptccoccus asaccharotyticus 
Peptostreptccoccus tndoticus 
Peptostreptccoccus magnus 
Peptostreptococcus micros 
Peptostreptccoccus parvutus 

65 Peptostreptccoccus prevotii 
Peptostreptccoccus productus 
Peptostreptccoccus species 
Peptostreptccoccus tetradius 
Phaecoccomyces exophialiae 

70 Photobacterium damselae 
Pichia (Hansenula) anomata 
Pichia [Hansenula) jadinii 
Pichla (Hansenula) petersonli 
Pichia angusta (Hansenula 

75 polymorphs) 

Pichia carsonli (P. vlnt) 
Pichia etchetlsii 
Pichia farinose 



Pichia fermentans 
80 Pichia membranaefaciens 

Pichia norvegensis 

Pichia ohrneri 

Pichia spam'nae 

Pichia species 
85 Piesfomonas shigeltoides 

Porphyromonas asaccharotytica 

Porphyromonas endodontatis 

Porphyromonas gingivals 

Pb^hyromorias'tevii 
90 Prevotetla (Bacteroides) buccae 

Prevotelta (Bacteroides) buccalis 

Prevotella (Bacteroides) corporis 

Prevotetla (Bacteroides) dentcola 

Prevotetla (Bacteroides) loescheii 
95 Prevotella (Bacteroides) oralis 

Prevotetla (Bacteroides) disiens 

Prevotetla (Bacteroides) oris 

Prevotella bivia (Bacteroides bMu$) 

Prevotella intermedia (Bacteroides 
100 intermedlus) 

Prevotella melanhogenlca 

(Bacteroides melaninogenicus) 

Prevotella ruminicola 

Propionibacterium acnes 
105 Propionibacterium avidum 

Propionibacterium granulosum 

Propionibacterium propionicum 

Propionibacterium species 

Proteus mirabilis 
110 Proteus penneri 

Proteus species 

Proteus vulgaris 

Prototheca species 

Prototheca wickerhamil 
115 Prototheca zopfii 

Providencia alcalifaciens 

Providencia helmbachae 

Providencia rettgeri 

Providencia rustigianii 
1 20 Providencia species 

Providencia stuartii 

Providencia stuartii urea 4 

Pseudomonas (Chryseomonas) 

tuteola 

125 Pseudomonas addovorans 

Pseudomonas aeruginosa 

Pseudomonas alcaligenes 

Pseudomonas cepacia 

Pseudomonas chtororaphis (P. 
1 30 aureofaciens) 

Pseudomonas fluorescens 

Pseudomonas fluorescens group 

Pseudomonas mendocina 

Pseudomonas pseudoalcaligenes 
1 35 Pseudomonas putida 

Pseudomonas species 

Pseudomonas stutzeri 

Pseudomonas testosteroni 

Pseudomonas vesicutaris 
1 40 Pseudoramibacter [Eubacterium) 

alactolyticus 

Psychrobacter (Moraxella) 

phenylpyruvicus 

Rahnella aquavits 
1 45 Ralstonia (Pseudomonas, 

Burkholderia) pickettii 

Rhodococcus (Corynebacterium) equi 

Rhodococcus species 

Rhodosporidium toruloides 
150 Rhodotorula gtutinls 

Rhodotorula minute 

Rhodotorula mucitaginosa (ft rubra) 

Rhodotorula species 

Rickettsia species 
1 55 Rothia dentocariosa 

Saccharomyces cerevisiae 



Saccharomyces extgvus 

Saccharomyces kluyverii 

Saccharomyces species 
160 Sakaguchia dacryoides 

(Rhodosporidium dacryoidum) 

Salmonella anzonae 

Salmonella choieraesuis 

Salmonella enteritidis 
165 Salmonella gattinarum 

Salmonella paratyphi A 

Salmonella paratyphi B 

Salmonella pulforum 

Salmonella species 
170 Salmonella typhi 

Salmonella typhimurium 

Salmonella typhisuis 

Salmonella/Arizona 

Serratia ficaria 
175 Serratia fcnticcla 

Serratia grimesii 

Serratia liquefaciens 

Serratia marcescens 

Serratia odorifera 
180 Serratia odorifera type 1 

Serratia odorifera type 2 

Serratia pfymuthica 

Serratia proteamaculans 

Serratia proteamaculans subsp. 
185 proteamaculans 

Serratia proteamaculans subsp. 

quinovora 

Serratia njbidaea 

Serratia species 
1 90 Shewanelta (Pseudomonas, 

Atteromonas) putrefaciens 

Shigella boydii 

Shigella dysenteriae 

Shigella flexneri 
195 Shigella sonnei 

Shigella species 

Sphingobacterium multivorum 

Sphingobecterium species 

Sphingobacterium spiritivorum 
200 Sphingobacterium thalpophHum 

Sphingomonas [Pseudomonas) 

paucimobiJis 

Sporidiobolus salmonicolor 

Sporoboiomyces roseus 
205 Sporoboiomyces salmonicolor 

Sporoboiomyces species 

Staphylococcus (Peptococcus) 

saccharotyticus 

Staphylococcus ariettae 
2 1 0 Staphylococcus aureus 

Staphylococcus aureus (Coagulase- 

negative) 

Staphylococcus auriculans 
Staphylococcus capitis 
2 1 5 Staphylococcus capitis subsp. capitis 
Staphylococcus capitis subsp. 
ureotytlcus 

Staphylococcus caprae 

Staphylococcus camosus 
220 Staphylococcus caseolyticus 

Staphylococcus chromogenes 

Staphylococcus cohnil 

Staphylococcus cohnil subsp. cohnil 

Staphylococcus cohnii subsp. 
225 urealyticum 

Staphylococcus epidermidis 

Staphylococcus equowm 

Staphylococcus gallinarum 

Staphylococcus haemoiyticus 
230 Staphylococcus hominfs 

Staphylococcus hominis subsp. 

hominfs 

Staphylococcus hominis subsp. 
novobiosepticus 
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Staphylococcus hylcus 
Staphylococcus intermedius 
Staphylococcus kloosii 
5 Staphylococcus tentus 
Staphylococcus lugdunensis 

Staphylococcus saprophytics 

Staphylococcus schleiferi 
Staphylococcus sciuri 

1 0 Staphylococcus simutans 
Staphylococcus species 
Staphylococcus wameri 
Staphylococcus xylosus 
Stenotrophomonas [Xanthomonas) 

15 mattophllia 

Stephanoascus cifem'i 
Stomatococcus mucilaginosus 
Streptococcus acidominimus 
Streptococcus agalactiae 

20 Streptococcus agalactiae (Group B) 
Streptococcus agalactiae hemolytic 
Streptococcus agalactiae non- 
hemolytic 

Streptococcus alactotyticus 
25 Streptococcus anginosus 

Streptococcus anginosus (Group D, 
nonenterococci) 

Streptococcus beta-hemolytic group A 
Streptococcus beta -hemolytic non- 
30 group A or B 

Streptococcus beta-hemolytic non- 
group A 

Streptococcus beta-hemolytic 

Streptococcus bovis (Group D, 
35 nonenterococci) 

Streptococcus bovis I 

Streptococcus bovis II 

Streptococcus canis 

Streptococcus consteilatus 
40 Streptococcus consteilatus 

(Streptococcus mitleh I) 

Streptococcus consteilatus (viridans 

Streptococcus) 

Streptococcus downei 
45 Streptococcus dysgalactiae subsp. 

dysgalactiae 

Streptococcus dysgalactiae subsp. 

equisimilis 

Streptococcus equi (Group C/Group G 
50 Streptococcus) 

Streptococcus equi subsp. equi 

Streptococcus equi subsp. 

zooepidemicus 

Streptococcus equinus 
55 Streptococcus equinus (Group D, 

nonenterococci) 

Streptococcus equisimilis 

Streptococcus equisimuiis (Group 

C/Group G Streptococcus) 



60 Streptococcus Gamma (non- 
hemolytic 

Streptococcus gordonii 

Streptococcus Group B 

Streptococcus Group C 
65 Streptococcus Group D 
Streptococcus. Group E 

Streptococcus Group F 

Streptococcus Group G 

Streptococcus Group L 
70 Streptococcus Group P 

Streptococcus Group U 

Streptococcus intermedius 

Streptococcus intermedius 

[Streptococcus milleri II) 
75 Streptococcus intermedius (viridans 

Streptococcus) 

Streptococcus milleri group 

Streptococcus mitis 

Streptococcus mitis (viridans 
80 Streptococcus) 

Streptococcus mitis group 

Streptococcus mutans 

Streptococcus mutans (viridans 

Streptococcus) 
85 Streptococcus oralis 

Streptococcus parasanguis 

Streptococcus pneumoniae 

Streptococcus porcinus 

Streptococcus pyogenes 
90 Streptococcus pyogenes (Group A) 

Streptococcus salivarius 

Streptococcus salivarius (viridans 

Streptococcus) 

Streptococcus salivarius subsp. 
95 salivarius 

Streptococcus salivarius subsp. 

thermophilus 

Streptococcus sanguis 

Streptococcus sanguis I (viridans 
100 Streptococcus) 

Streptococcus sanguis II 

Streptococcus sanguis II (viridans 

Streptococcus) 

Streptococcus sobrinus 
1 05 Streptococcus species 

Streptococcus suis I 

Streptococcus suis II 

Streptococcus uberis 

Streptococcus uberis (viridans 
110 Streptococcus) 

Streptococcus vestibularis 

Streptococcus zooepidemicus 

Streptococcus zooepidemicus (Group 

C) 

115 Streptomyces somaliensis 
Streptomyces species 
Suttonella (KingeUa) indologenes 
Tatumella ptyseos 



Tetragenococcus (Pediococcus) 
120 halophilus 

Torulaspora delbnjeckii 

(Saccharomyces rose/) 

Torutopsis Candida 

Torufopsis haemulonil 
125 J'cwjops^islnconspicua 

Treponema species 

Trichosporon asahii 

Trichosporon asteroides 

Trichosporon beigelii 
130 Trichosporon cutaneum 

Trichosporon inkin 

Trichosporon mucoides 

Trichosporon ovoides 

Trichosporon pufiulans 
1 35 Trichosporon species 

Turicella otitidis 

Ureaplasma species 

Ureaplasma ureatyticum 

Veillonella parvula (V. alcalescens) 
140 Veillonella species 

Vibrio alginolyticus 

Vibrio cholerae 

Vibrio damsela 

Vibrio Huvialis 
145 Vibrio fumissii 

Vibrio harveyl 

Vibrio holiisae 

Vibrio metschnikovii 

Vibrio mimicus 
1 50 Vibrio parahaemolyticus 

Vibrio species 

Vibrio species SF 

V7brio vulnificus 

Weekselia (Bergeylla) virosa 
155 Weekselia species 

Weekselia virosa 

Williopsis (Hansenula) satumus 

Xanthomonas campestris 

Xanthomonas species 
160 Yarrowia (Candida) lipolytics 

Yersinia aldovae 

Yersinia enterocolitis 

Yersinia enterocolitis group 

Yersinia frederiksenii 
165 Yersinia intermedia 

Yersinia intennedius 

Yersinia kristensenii 

Yersinia pestis 

Yersinia pseudotuberculosis 
1 70 Yersinia pseudotuberculosis SF 

Yersinia rvckeri 

Yersinia species 

Yokenella regensburgei 

Yokenella regensburgei (Koserelia 
175 trabuisil) 

Zygoascus hellenicus 

Zygosaccharomyces species 



1 The list includes microorganisms that may be identified by API identification test systems and VITEK® 
automated identification system from biolvterieux Inc., or by the MicroScan - WalkAway® automated 
systems from Dade Behring. Identification relies on classical identification methods using batteries of 
biochemical and other phenotypical tests. 
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Table 16. tuf gene sequences obtained in our laboratory (Example 42). 



Species 


Strain no. 


Gene 


GenBank Accession no/ 


~Abibtrophia~adiacens 


- -ATCC49175- - 


tuf 


AF124224 


Enterococcus avium 


ATCC14025 


tufA 


AF1 24220 






turn 


AF274715 


Enterococcus casseliflavus 


ATCC25788 


tufA 


AF274716 






turn 


AF274717 


Enterococcus cecorum 


ATCC43198 


tuf 


AF274718 


Enterococcus columbae 


ATCC51263 


tuf 


AF274719 


Enterococcus dispar 


ATCC51266 


tufA 


AF274720 






turn 


AF274721 


Enterococcus durans 


ATCC19432 


tufA 


AF274722 






turn 


AF274723 


Enterococcus faecalis^ 


ATCC29212 


tuf 


AF1 24221 


Enterococcus faecium 


ATCC 19434 


tufA 


AF1 24222 






turn 


AF274724 


Enterococcus gailinarum 


ATCC49573 


tufA 


AF1 24223 






turn 


AF274725 


Enterococcus hirae 


ATCC8043 


tufA 


AF274726 






turn 


AF274727 


Enterococcus malodoratus 


ATCC43197 


tufA 


AF274728 






turn 


AF274729 


Enterococcus mundtii 


ATCC43186 


tufA 


AF274730 






turn 


AF274731 


Enterococcus pseudoavium 


ATCC49372 


tufA 


AF274732 






turn 


AF274733 


Enterococcus raffinosus 


ATCC49427 


tufA 


AF274734 






turn 


AF274735 


Enterococcus saccharotyticus 


ATCC43076 


tuf 


AF274736 


Enterococcus solitarius 


ATCC49428 


tuf 


AF274737 


Enterococcus suffureus 


ATCC49903 


tuf 


AF274738 


Lactococcus lactis 


ATCC11154 


tuf 


AF274745 


Listeria monocytogenes 


ATCC15313 


tuf 


AF274746 


Listeria seeligeri 


ATCC35967 


tuf 


AF274747 


Staphylococcus aureus 


ATCC25923 


tuf 


AF274739 


Staphylococcus epidermidis 


ATCC14990 


tuf 


AF274740 


Streptococcus mutans 


ATCC25175 


tuf 


AF274741 


Streptococcus pneumoniae 


ATCC6303 


tuf 


AF274742 


Streptococcus pyogenes 


ATCC19615 


tuf 


AF274743 


Streptococcus suis 


ATCC43765 


tuf 


AF274744 



'Corresponding sequence ID NO. for the above ATCC strains are given in table 7. 
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Table 17. tuf gene sequences selected from databases for Example 42. 



Species 


Gene 


Accession no.* 


Agrobacterium tumefaciens 


tufA 


X99673 


turn 


X99674 


Anacystis nidulans 


tuf 


X17442 


Aquifex aeolicus 


tufA 


AE000657 


tufB 


AE000657 


Bacillus stearothermophiius 


tuf 


AJ000260 


Bacillus subtitis 


tuf " 


AE009T26 


Bacteroides fragilis 


tuf 


P33165 


Borrelia burgdorferi 


tuf 


AE000783 


Brevibacterium linens 


tuf 


X76863 


Bulkholderia cepacia 


tuf 


P33167 


Campylobacter jejuni 


tufB 


Y17167 


Chlamydia pneumoniae 


tuf 


AE001363 


Chlamydia trachomatis 


tuf 


M74221 


Corynebacterium glutamicum 


tuf 


X77034 


Cytophaga lytica 


tuf 


X77035 


Deinococcus radiodurans 


tuf 


•AE000513 


Escherichia coll 


tufA 


J01690 




tufB 


J01717 


Fervidobacterium islandicum 


tuf 


Y15788 


Haemophilus influenzae 


tufA 


L42023 




tufB 


L42023 


Helicobacter pylori 


tuf 


AE000511 


Homo sapiens (Human) 


EF-1a 


X03558 


Methanococcus jannaschii 


EF-1a 


U67486 


Mvcobacterium leprae 


tuf 


D13869 


Mycobacterium tuberculosis 


tuf 


X63539 


Mycoplasma genitalium 


tuf 


L43967 


Mycoplasma pneumoniae 


tuf 


U00089 


Neisseria gonorrhoeae 


tufA 


L36380 


Nicotiana tabacum (Tobacco ) 


EF-1a 


U04632 


Peptococcus niger 


tuf 


X76869 


Planobispora rosea 


. tuff 


U67308 


Saccharomyces cerevisiae (Yeast) 


EF-1a 


X00779 


Salmonella typhimurium 


tufA 


X55116 




tufB 


X55117 


Shewanella putrefaciens 


tuf 


P33169 


Spirochaeta aurantia 


tuf 


X76874 


Spirulina platensis 


tufA 


X15646 


Streptomyces aureofaciens 


tuf1 


AF007125 


Streptomyces cinnamoneus 


tuft 


X98831 


Streptomyces coeticolor 


tuft 


X77039 




tuf 3 


X77040 


Streptomyces collinus 


tuff 


S79408 


Streptomyces ramocissimus 


tuf1 


X67057 




tuf2 


X67058 




tuf3 


X67059 


Synechocystis sp. 


tuf 


AB001339 


Taxeobacter oceflatus 


tuf 


X77036 


Thermotoga mahtima 


tuf 


AE000512 


Thermus aquaticus 


tuf 


X66322 


Thermus thermophilus 


tuf 


X06657 


Thiobacillus cuprinus 


tuf 


U78300 


Treponema pallidum 


tuf 


AE000520 


Wolinella succinogenes 


tuf 


X76872 



* Sequence data were obtained from GenBank, EMBL, and SWISSPROT databases. Genes were designated 
appeared in the references. 
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Table 18. Nucleotide and amino acid sequence Identities of EF-Tu between different enterococcl 
and other low G+C gram-positive bacteria. 

The upper right triangle represents the deduced amino acid sequence identities of gram-positive bacterial 
EF-Tu, while the lower left triangle represents the DNA sequence identities of the corresponding tuf genes. 
The sequence identities between different enterococcal tufA genes are boxed while those between 
enterococcal tufB genes are shaded. 
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Taxon Strain* ' Strainf ^ 16SrDNAs quence 

accession number 

Cedecea ATCC 33431 T 

Cedecea lapagei ATCC 33432 T 



Cedecea neteh 


ATCC 33855 T 






Citrobacter amalonaticus 


ATCC 25405 T 


CDC 9020-77 1 


AF025370 


Citrobacter braakii 


ATCC 43162 


CDC 080-58 T 


AF025368 


Citrobacter farmeri 


ATCC 51112 T 


CDC 2991-81 1 


AF025371 


Citrobacter freundii 


ATCC 8090 T 


DSM 30039 T 


AJ233408 


Citrobacter koseri 


ATCC 271 56 T 






Citrobacter sedlakii 


ATCC 51115 T 


CDC 4696-86 1 


AF025364 


Citrobacter werkmanii 


ATCC 51114 T 


CDC 0876-58 1 


AF025373 


Citrobacter youngae 


ATCC 29935 1 " 






Edwardsiella hoshinae 


ATCC 33379 1 " 






Edwardsiella tarda 


ATCC 15947 T 


CDC4411-6B 


AF015259 


Enterobacter aerogenes 


ATCC 13048 T 


JCM 1235 T 


AB004750 


Enterobacter agglomerans 


ATCC 27989 


JCM 1237 T 




Enterobacter amnigenus 


ATCC 33072 T 


AB004749 


Enterobacter asburiae 


ATCC 35953 1 " 


JCM 6051 T 


AB004744 


Enterobacter cancerogenus 


ATCC 3531 7 T 






Enterobacter cloacae 


ATCC 13047 T 






Enterobacter gergoviae 


ATCC 33028 T 


JCM1234 1 


AB004748 


Enterobacter hormaechei 


ATCC 491 62 T 






Enterobacter sakazakii 


ATCC 29544 T 


JCM 1233 T 


AB004746 


Escherichia coli 


ATCC11775 1 


ATCC 11775 T 


X80725 


Escherichia coli 


ATCC 25922 


ATCC 25922 


X80724 


Escherichia coli (ETEC) 


ATCC 35401 






Escherichia coli (01 57;H7) 


ATCC 43895 


ATCC 43895 


Z83205 


Escherichia fergusonii 


ATCC 35469 T 






Escherichia hermanii 


ATCC 33650 T 






Escherichia vulneris 


ATCC 33821 T 


ATCC 33821 T 


X80734 


Ewingella americana 


ATCC 33852 T 


NCPPB 3905 


X88846 


Hafnia alvei 


ATCC13337 1 


ATCC 13337 T 


M59155 


Klebsiella omithinolytica 


ATCC 31898 


CIP 103.364 


U78182 


Klebsiella oxytoca 


ATCC 33496 








ATCC 13182 T 


U78183 


Klebsiella planticola 


ATCC 33531 T 


JCM 7251 T 


AB004755 


Klebsiella pneumoniae 








subsp. pneumoniae 


ATCC 13883 T 


DSM30104 1 


AJ233420 


subsp. ozaenae 


ATCC 11296 T 


ATCC 11296 1 " 


Y17654 


subsp. rhinoscleromatis 


ATCC13884 1 
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Table 19. Strains analyzed in Example 43 (continued). 



Tax on 


Strain* 


Stralnf 


16S rDNA sequence 
accession number 


Kluyvera ascorbata 


ATCC 33433 T 


ATCC 14236 


Y07650 


Ktuyvera cryocrescens 


ATCC 33435 1 " 






Kluyvera georgiana 


ATCC 51603 T 






Leclercia adecarboxylata 


ATCC 23216 T 








Leminorella grimontii 


ATCC 33999 T 


DSM5078 1 


AJ233421 


Moellerella wisconsensis 


ATCC 3501 7 T 






Morganella morganii 


ATCC 25830 T 






Pantoea agglomerans 


ATCC 271 55 T 


DSM3493 1 


AJ233423 


Pantoea dispersa 


ATCC 14589 T 




Plesiomonas shigelloses 


ATCC 14029 T 






Pragia fontsum 


ATCC 491 00 T 


DSM5563 1 


AJ233424 


Proteus mirabilis 


ATCC 25933 






Proteus penneri 


ATCC 3351 9 T 






Proteus vulgaris 


ATCC 1 331 5 T 


DSM30118 1 


AJ233425 . 


Providencia alcalifaciens 


ATCC 9886 T 






Providencia rettgeri 


ATCC 9250 






Providencia rustigianii 


ATCC 33673 T 






Providencia stuartii 


ATCC 33672 






Rahnella aquatilis 


ATCC 33071 T 


DSM4594 1 


AJ233426 


Salmonella choleraesuis 








subs p. arizonae 


ATCC 1 331 4 T 






subsp. choleraesuis 








serotype Choleraesuis 


ATCC 7001 






serotype EnteritidisJ 


ATCC13076 1 


SE22 


SE22 


serotype Gallinarum 


ATCC 9184 






serotype Heidelberg 


ATCC 8326 






serotype Paratyphi A 


ATCC 9150 






serotype Paratyphi B 


ATCC 8759 






serotype Typhi* 


ATCC 10749 


Still 


U88545 


serotype Typhimuriumt 


ATCC 14028 






serotype Virchow 


ATCC 51955 






subsp. diarizonae 


ATCC 43973 T 






subsp. houtenae 


ATCC 43974 T 






subsp. indica 


ATCC 43976 T 






subsp. salamae 


ATCC 43972 T 






Serratia fonticola 


DSM4576 1 


DSM4576 1 


AJ233429 


Serratia grimesii 


ATCC14460 T 


DSM 30063 1 " 


AJ233430 


Serratia liquefaciens 


ATCC 27592 T 






Serratia marcescens 


ATCC13880 1 


DSM 301 21 T 


AJ233431 


Serratia odorifera 


ATCC 33077 T 


DSM 45B2 T 


AJ233432 


Serratia plymuthica 


DSM 4540 T 


DSM 4540 T 


AJ233433 


Serratia rubidaea 


DSM4480 1 


DSM4480 1 " 


AJ233436 


Shigella boydii 


ATCC 9207 


ATCC 9207 


X96965 


Shigella dysenteriae 


ATCC 11835 


ATCC 13313 T 
ATCC 25931 


X96966 
X96964 



240 



WO 01/23604 



PCT/CAOO/01150 



Table 19. Strains analyzed in Example 43 (continued). 



Taxon 


btrain 


Strawy 


16S rDN A sequence 
accession number 

••wwvwviwii Hull 1 l/v 1 


Shigella flexneri 


ATCC 12022 


ATCC 12022 


X96963 


Shigella sonnet 


ATCC 29930 T 






Tatumella ptyseos 


ATCC 33301 T 


DSM 5000 T 


AJ233437 


Trabulsiella guamensis 


ATCC 49490 T 






- -Yersinia enterocolitis — 


-ATCC 961 ol 


ATCC 961 0 T 


M59292 


Yersinia frederiksenii 


ATCC 33641 T 






Yersinia intermedia 


ATCC 29909 T 






Yersinia pestis 


RRB KIMD27 


ATCC19428 1 


X75274 


Yersinia pseudotuberculosis 


ATCC 29833 T 






Yersinia rohdei 


ATCC 43380 T 


ER-2935 1 " 


X75276 


Shewanella putrefaciens 


ATCC 8071 T 






Vibrio cholerae 


ATCC 25870 


ATCC14035 1 


X74695 



T Type strain 

5 'Strains used in this study for sequencing of partial tuf and atpD genes. SEQ ID NOs. for tuf and atpD sequences 
corresponding to the above reference strains are given in table 7. 

tStrains used in other studies for sequencing of 16S rDNA gene. When both strain numbers are on the same row, both 
strains are considered to be the same although strain numbers may be different. 
JPhylogenetic serotypes considered species by the Bacteriological Code (1990 Revision). 
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Table 20. PCR primer pairs used in this study 



Primer Sequence Nucleotide Amplicon 

SEQ ID NO. positions* length (bp) 



tuf 
664 

697 


5'-AAYATGATIACIGGIGCIGCICARATGGA- 
5'-CCIACIGTICKICCRCCYTCRCG-3' 


271-299 
1132-1156 


884 


atpD 

568 

567 


5'-RTIATIGGIGCIGTIRTIGAYGT-3' 
5'-TCRTCIGClGGIACRTAIAYIGCYTG-3' 


25-47 
883-908 


884 


700 
567 


5'-TIRTIGAYGTCGARTTCCCTCARG-3' 
5'-TCRTCIGCIGGIACRTAIAYIGCYTG-3* 


38-61 
883-908 


871 



*The nucleotide positions given are for E. coli tuf and atpD sequences (GenBank accession no. AE000410 
and V00267, respectively). Numbering starts from the first base of the initiation codon. 
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Annex It 


Specific and ubiquitous primers 
amplification (tuf s guences). 


for nucleic acid 


5 








Originating DNA fragm nt 




SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 
NO. 


Nucleotide 
position 














10 


Bacterial 




Acinetobacter baumannii 


. 




1692 


5' -GGT GAG 


AAC TGT GGT ATC TTA CTT 


1 


478-501 




1693 a 


D -Uil I IX. 


AAC GCC TTC TTT CAA CTG 


1 


691-714 


15 


Bacterial 


soecies: 


Chlamydia pneumoniae 








630 


5 ' -CGG AGC 


TAT CCT AGT CGT TTC A 


20 


2-23 




629 a 


D -AAb 1 lx, 


CAT CTC AAC AAG GTC AAT A 


20 


146-170 


20 


2085 


5 ' -CAA ACT 


AAA GAA CAT ATC TTG CTA 


20 


AC CO 

40-po 




2086 a 


5 '-ATA TAA 


TTT GCA TCA CCT TCA AG 


20 


237-259 




2087 


5' -TCA GCT 


CGT GGG ATT AGG AGA G 


20 


431-452 


25 


2088 a 


5 ' -AGG CTT 


CAC GCT GTT AGG CTG A 


20 


584-605 


Bacterial 


3pecie&: 


Chlamydia trachomatis 

WalatlUi^JUafi l»* I* V«* Villi" V 








554 


5' -GTT CCT 


TAC ATC GTT GTT TTT CTC 


22 


82-105 


30 


555 a 


5' -TCT CGA 


ACT TTC TCT ATG TAT GCA 


22 


249-272 


Parasitical species: 


Cryptosporidium parvum 








798 


5'-TGG TTG 


TCC CAG CCG ATC GTT T 


865 


158-179 


35 


804 a 


5' -CCT GGG 


ACG GCC TCT GGC AT 


865 


664-683 


799 


5'-ACC TGT 


GAA TAC AAG CAA TCT 


865 


280-300 




805 a 


5' -CTC TTG 


TCC ATC TTA GCA GT 


865 


895-914 




800 


5' -GAT GAA 


ATC TTC AAC GAA GTT GAT 


865 


307-330 


40 


806 a 


5' -AGC ATC 


ALL AbA L I I bAT AAVj 


ODD 


946-966 




801 


5 ' -ACA ACA 


CCG AGA AGA TCC CA 


865 


J JJ — J /z 




803 a 


5 '-ACT TCA 


GTG GTA ACA CCA GC 


865 


616-635 


45 


802 


5 ' -TTG CCA 


TTT CTG GTT TCG TT 


865 


377-396 




807 a 


5 ' -AAA GTG 


GCT TCA AAG GTT GC 


865 


981-1000 




pacterial 


Specie?: 


Enterococcus faecium 






50 


1696 


5'-ATG TTC 


CTG TAG TTG CTG GA 


64 


189-208 




1697 a 


5' -TTT CTT 


CAG CAA TAC CAA CAA C 


64 


422-443 




Bacterial 


species : 


Klebsiella pneumoniae 






55 


1329 


5 ' -TGT AGA 


GCG CGG TAT CAT CAA AGT A 


103 


352-377 




1330 a 


5 ' -AG A TTC 


GAA CTT GGT GTG CGG G 


103 


. 559-571 



a These sequences are from the complementary DNA strand of the s quence of the 
60 originating fragment given in the Sequence Listing. 
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Ann x 1 1 Specific and ubiquitous primers for nucleic acid 
amplification (tuf s quences) (continued)* 



5 






Originating DNA fragment 




SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 

. NO. 


Nucleotide 
position 


_10__ 


Bacterial 


specif: 


Mycoplasma pneumoniae 








2093 
2094 b 


5 ' -TGT TGG 
5'-TTC AAT 


CAA TCG AAG ACA CC 

TTC TTG ACC TAC TTT CAA 


2097 a 
2097 a 


635-654 
709-732 


15 


Bacterial 


species: 


iv es j>00C»a la yuxiyxxiiucatJ 








551 
552 b 


5 ' -GAA GAA 
5' -TAC ACG 


AAA ATC TTC GAA CTG GCT A 
GCC GGT GAC TAC G 


126 
126 


256-280 
378-396 


20 


2173 
2174 b 


5' -AAG AAA 
5' -TCT ACA 


AAA TCT TCG AAC TGG CTA 
CGG CCG GTG 


126 
126 


257-280 
384-398 


25 


2175 
2176 b 


5'-CCG CCA 
5' -CGG CAT 


TAC CCC GTT T 

TAC CAT TTC CAC ACC TTT 


126 
126 


654-669 
736-759 


Bacterial species: 


Pseudomonas aeruginosa 






30 


1694 
1695 b 


5 ' -AAG GCA 
5' -ACG ATT 


AGG ATG ACA ACG GC 
TCC ACT TCT TCC TGG 


153 
153 


231-250 
418-438 


Bacterial species: 


Streptococcus agalactiae 






35 


549 . 
550 b 


5 ' -GAA CGT 
5 ' -GAA GAA 


GAT ACT GAC AAA CCT TTA 
GAA CAC CAA CGT TG 


207-210° 
207-210 c 


308-331 d 
520-539 d 


Bacterial 


species: 


Streptococcus pyogenes 








999 
1000 b 


5' -TTG ACC 
5' -TTA GTG 


TTG TTC ATC ACG AAG AG 
TGT GGG TTG ATT GAA CT 


1002 
1002 


143-165 
622-644 


1001 
1000 b 


5 ' -AAG AGT 
5' -TTA GTG 


TGC TTG AAT TAG TTG AG 
TGT GGG TTG ATT GAA CT 


1002 
1002 


161-183 
622-644 


45 


Parasitical species: 


Trypanosoma brucei 






820 
821 b 


5' -GAA GGA 
5'-GGC GCA 


GGT GTC TGC TTA CAC 
AAC GTC ACC ACA TCA 


864 
864 


513-533 
789-809 


50 


820 
822 b 


5' -GAA GGA 
5'-CGG CGG 


GGT GTC TGC TTA CAC 
ATG TCC TTA ACA GAA 


864 
864 


513-533 
909-929 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
55 originating fragment given in the Sequence Listing. 

c These sequences were aligned to derive the corresponding primer. 

d The nucleotide positions refer to the S. agalactiae tuf sequ nee fragment 
<SEQ ID NO. 209) . 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplification (tuf s quences) (continu d) . 

~™ ^ "~" Originating DNA fragment 

5 SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

_ 10 Parasitical spgcies : _ Trypanosoma cruzi 

794 5'-GAC GAC AAG TCG GTG AAC TT 840-842 a 281-300° 

795 b 5 '-ACT TGC ACG CGA TGT GGC AG 840-842 a 874-893 c 

15 gacterial genus: Clostridium sp. 

796 5 ' -GGT CCA ATG CCW CAA ACW AGA 32,719- 32-52 d 

724,736 a 

797 b 5 ' -CAT TAA GAA TGG YTT ATC TGT SKC TCT 32,719- 320-346 d 

20 724,736 a 

808 5'-GCI TTA IWR GCA TTA GAA RAY CCA 32,719- 224-247 d 

724,736 a 

809 b 5' -TCT TCC TGT WGC AAC TGT TCC TCT 32,719- 337-360 d 



25 



30 



35 



40 



724,736 a 

810 5 ' -AGA GMW ACA GAT AAR SCA TTC TTA 32,719- 320-343 d 

724,736 a 

811 b 5'-TRA ART AGA ATT GTG GTC TRT ATC C 32,719- 686-710 d 



724,736 a 



Bacterial genus : CoryneJbacterium sp. 



45 



545 


5' 


-TAC 


ATC 


CTB 


GTY GCI 


CTI 


AAC AAG TG 


34-44, 662 a 


89-114 e 


546 b 


5' 


-CCR CGI 


CCG 


GTR ATG 


GTG 


AAG AT 


34-44, 662 a 


350-372 e 


P^Cterial 


qenus 






Enterococcus 


sp. 






656 


5' 


-AAT 


TAA 


TGG 


CTG CAG 


TTG 


AYG A 


58-72 a 


273-294 f 


657 b 


5' 


-TTG 


TCC 


ACG 


TTC GAT 


RTC 


TTC A 


58-72 a 


556-577 f 


656 


5' 


-AAT 


TAA 


TGG 


CTG CAG 


TTG 


AYG A 


58-72 a 


273-294 f 


271 b 


5' 


-TTG 


TCC 


ACG 


TTG GAT 


RTC 


TTC A 


58-72 a 


556-577 f 


1137 


5' 


-AAT 


TAA 


TGG 


CTG CWG 


TTG 


AYG AA 


58-72 a 


273-295 f 


1136 b 


5' 


-ACT 


TGT 


CCA 


CGT TSG 


ATR 


TCT 


58-72 a 


559-579 f 



a These sequences were aligned to derive the corresponding primer. 

b These sequences are from the complementary DNA strand of the sequence of the 
50 originating fragment given in the Sequence Listing. 

c The nucleotide positions refer to the T. cruzi tuf sequence fragment (SEQ ID 
NO. 842) . 

d The nucleotide positions refer to the C. p rfringens tuf sequence fragment 
(SEQ ID NO. 32) . 

55 e The nucleotide positions refer to the C. diphtheriae tuf sequence fragment 
(SEQ ID NO. 662) . 

f The nucleotide positions refer to the E. durans tuf sequence fragment {SEQ ID 
NO. 61). 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf eequ nces) (continued). 











Originating DNA fragment 


5 


SEQ ID NO. 


Nucleotide sequence 


SEQ ID 


Nucleotide 










NO. 


nnsi t ion 


-10- 


— Bacteria 1-oenu a.: 


Legionella's^. 








2081 


5'-GRA 


TYR TYA AAG TTG GTG AGG AAG 


lll-112 a 


411-434 b 




2082 c 


5'-CMA 


CTT CAT CYC GCT TCG TAC C 


lll-112 a 


548-569 b 


15 


Bacterial 


qenus : 


Staphylococci sp. 








553 


5'-GGC 


CGT GTT GAA CGT GGT CAA ATC A 


176-203 a 


313-337 d 




575 c 


5 1 -TIA 


CCA TTT CAG TAC CTT CTG GTA A 


176-203 a 


653-677 d 


20 


553 


5'-GGC 


CGT GTT GAA CGT GGT CAA ATC A 


176-203 a 


313-337d 




707 c 


5 ' -TWA 


CCA TTT CAG TAC CTT CTG GTA A 


176-203 a 


653-677 d 




Pacterial 


qenujs : 


OCX Lycuut-uo t>}f m 


• 




25 


547 


5 ' -GTA 


CAG TTG CTT CAG GAC GTA TC 


206-231 a 


372-394© 




548 c 


5 ' -ACG 


TTC GAT TTC ATC ACG TTG 


206-231 a 


548-568 e 




Fwrcal qenus: 


Candida sp. 






30 


576 


5' -AAC 


TTC RTC AAG AAG GTY GGT TAC AA 


407-426, 


332-357 f 










428-432 a 






632 c 


5 ' -CCC 


TTT GGT GGR TCS TKC TTG GA 


407-426, 


791-813 f 










428-432 a 




35 


631 


5 ' -CAG 


ACC AAC YGA IAA RCC ATT RAG AT 


407-426, 


523-548 f 










428-432 a 






632 c 


5' -CCC 


TTT GGT GGR TCS TKC TTG GA 


407-426, 


791-813 f 










428-432 a 




40 


633 


5' -CAG 


ACC AAC YGA IAA RCC ITT RAG AT 


407-426, 


523-548 f 










428-432 a 






632 c 


5 ' -CCC 


TTT GGT GGR TCS TKC TTG GA 


407-426, 


791-813 f 










428-432 a 





45 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the L. pneumophila tuf sequence fragment 
(SEQ ID NO. 112) . 

c These sequences are from the complementary DNA strand of the sequence of the 
50 originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the S. aureus tuf sequence fragment (SEQ ID 
NO. 179) . 

e The nucleotide positions refer to the S. agalactiae tuf sequence fragment 
(SEQ ID NO. 209) . 

55 f The nucleotide positions refer to th C. albicans tuf(EF-l) sequence fragment 
{SEQ ID NO. 408) . 
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Annex It Specific and ubiquitous primers for nucl ic acid 
amplification (tuf sequences) (continued). 



Originating DNA fragment 



3 


SEQ ID NO. 


Nucleotide sequence 


SEQ ID 
NO. 


Nucleotide 
no si t i on 


10 


Funqal qenu$: 


Cryptococcus sp. 








1971 


5 ' -CYG 


ACT GYG CCA TCC TYA TCA 


434, 623, 1281, 
1985 1986 a 


150-170 b 




1973 c 


5'-RAC 


ACC RGI YTT GGW ITC CTT 


434,623,1281, 


464-484 b 


15 








1985,1986 a 






1972 


5 ' -MGI 


CAG CTC ATY ITT GCW KSC 


434 623.1281. 
19B5,1986 a 


260-280 b 

4b V V 4b V V 




197 3 V 


D -RAC 


ACC RGI YTT GGW ITC CTT 


434,623,1281, 


464-484 b 


20 


Parasitical 


genus: 


Entamoeba sp. 


1985,1986 a 






703 


5 ' -TAT 


GGA AAT TCG AAA CAT CT 


512 


38-57 


25 


704 c 


5 ' -AGT 


GCT CCA ATT AAT GTT GG 


512 


442-461 




703 


5 ' -TAT 


GGA AAT TCG AAA CAT CT 


512 


38-57 




705 c 


5 ' -GTA 


CAG TTC CAA TAC CTG AA 


512 


534-553 


30 


703 


5 ' -TAT 


GGA AAT TCG AAA CAT CT 


512 


38-57 




706 c 


5 ' -TGA 


AAT CTT CAC ATC CAA CA 


512 


768-7B7 




793 


5 ' -TTA 


TTG TTG CTG CTG GTA CT 


512 


149-168 


35 


704 c 


5 ' -AGT 


GCT CCA ATT AAT GTT GG 


512 


442-461 


Parasitica?. 


oenus : 


Giardia sp. 








816 


5 ' -GCT 


ACG ACG AGA TCA AGG GC 


513 


305-324 


40 


819° 
817 


5 ' -TCG 
5 ' -TGG 


AGC TTC TGG AGG AAG AG 
AAG AAG GCC GAG GAG TT 


513 
513 


895-914 
355-374 




818 c 


5 ' -AGC 


CGG GCT GGA TCT TCT TC 


513 


825-844 


45 


Parasitical 


oenus : 


Leishmania sp. 








701 


5'-GTG 


TTC ACG ATC ATC GAT GCG 


514-526 a 


94-114 d 




702 c 


5 ' -CTC 


TCG ATA TCC GCG AAG CG 


514-526 a 


913-932 d 



50 



55 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the C. neoformans tuf (EF-1) sequence 
fragment (SEQ ID NO. 623). 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the L. tropica tuf (EF-1) sequ nee fragment 
(SEQ ID NO. 526) . 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf sequences) (continued)* 

______ __ Originating DNA fragment 

5 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

■ NO. position 

10 Parasitical genus : Trypanosoma sp. 

823 5 ' -GAG CGG TAT GAY GAG ATT GT 529,840- 493-512 b 

842,864 a 

824 c 5' -GGC TTC TGC GGC ACC ATG CG 529,840- 1171-1190 b 

15 842,864 a 

Bacterial family : Enterobacteriaceae 

933 5 '-CAT CAT CGT ITT CMT GAA CAA RTG 78,103,146, 390-413 d 

20 168,238, 698 a 

934 c 5'-TCA CGY TTR RTA CCA CGC AGI AGA 78,103,146, 831-854 d 



25 



30 



35 



168,238,698 a 



Bacterial family : Mycobacteriaceae 



539 5'-CCI TAC ATC CTB GTY GCI CTI AAC AAG 122 85-111 

540 c 5'-GGD GCI TCY TCR TCG WAX TCC TG 122 181-203 

Bacterial group : Escherichia coli and Shigella 

1661 5'-TGG GAA GCG AAA ATC CTG 1668 e 283-300 

1665 c 5'-CAG TAC AGG TAG ACT TCT G 1668 e 484-502 

Bacterial group : Pseudomonads group 

541 5'-GTK GAA ATG TTC CGC AAG CTG CT 153-155 3 476-498 f 

542 c 5 '-CGG AAR TAG AAC TGS GGA CGG TAG 153-155 a 679-702 f 

541 5'-GTK GAA ATG TTC CGC AAG CTG CT 153-155 a 476-498 f 

40 544 c 5'-AYG TTG TCG CCM GGC ATT MCC AT 153-155 a 749-771 f 



a These sequences were aligned to derive the corresponding primer. 

45 b The nucleotide positions refer to the T. Jbrucei tuf (EF-1) sequence fragment 
{SEQ ID NO. 864) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the E. coli tuf sequence fragment (SEQ ID 
50 NO. 698). 

e Sequence from databases. 

f The nucleotide positions refer to the P. aeruginosa tuf sequence fragment 
(SEQ ID NO. 153) . 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf s gu nc s) (continued). 

Originating DNA fragment 

5 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 Parasitical group : Trypanosoma tidae family 

923 5'-GAC GCI GCC ATC CTG ATG ATC 511,514-526, 166-188 b 

529,840-842, 
864 a 

15 924 c 5'-ACC TCA GTC GTC ACG TTG GCG 511,514-526, 648-668 b 

529,840-842, 

864 a 

925 5'-AAG CAG ATG GTT GTG TGC TG 511,514-526, 274-293 b 

20 529,840-842, 

864 a 

926 c 5'-CAG CTG CTC GTG GTG CAT CTC GAT 511,514-526, 676-699 b 

529,840-842, 

25 864 a 

927 5 ' -ACG CGG AGA AGG TGC GCT T 511,514-526, 389-407 b 

529,840-842, 

864 a 

928 c 5'-GGT CGT TCT TCG AGT CAC CGC A 511,514-526, 778-799 b 

30 529,840-842, 

864 a 

Universal primers (bacteria) 

35 636 5 '-ACT GGY GTT GAI ATG TTC CGY AA 7,54,78, 470-492 d 

. . 100,103,159, 

209,224,227 b 

637° 5' -ACG TCA GTI GTA CGG AAR TAG AA 7,54,78, 692-714 d 

100,103,159, 

40 209,224,227 b 

638 5'-CCA ATG CCA CAA ACI CGT GAR CAC AT 7,54,78, 35-60 e 

100,103,159, 

209,224,227 b 

45 639 c 5'-TTT ACG GAA CAT TTC WAC ACC WGT IAC A 7,54,78, 469-496 e 

100,103,159, 

209,224,227 b 



50 a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the L. tropica tuf (EF-1) sequence fragment 
(SEQ ID NO. 526) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
55 d The nucleotide positions refer to the E. coli tuf sequence fragment (SEQ ID 
NO. 78) . 

e The nucleotide positions refer to the B. cereus tuf sequence fragment (SEQ ID 
NO. 7) . 
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5 



25 



50 



Annex It Specific and ubiquitous primers for nucl ic acid 
amplification (tuf segu nces) (continu d) . 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucl otide 

NO. position 



10 Universal primers (bacteria) (continued) 



643 5'-ACT GGI GTI GAR ATG TTC CGY AA 1,3,4/7,12, 470-492 b 

13,16,49,54, 
72,78,85,88, 

15 91,94,98,103, 

108,112,115, 
116,120,121, 
126,128,134, 
136,146,154, 

20 159,179,186, 

205,209,212, 

224,238 a 

644 c 5'-ACG TCI GTI GTI CKG AAR TAG AA same as SEQ 692-714 b 



ID NO. 643 



643 5 ' -ACT GGI GTI GAR ATG TTC CGY AA 1,3,4,7,12, 470-492 b 

13,16,49,54, 
72,78,85,88, 
91,94,98,103, 

30 108,112,115, 

116,120,121, 
126,128,134, 
136,146,154, 
159,179,186, 

35 205,209,212, 

224,238 a 

645° 5'-ACG TCI GTI GTI CKG AAR TAR AA same as SEQ 692-714 b 

ID NO. 643 

40 646 5'-ATC GAC AAG CCI TTC YTI ATG SC 2,13,82 317-339 d 

122,145 a 

647 c 5'-ACG TCC GTS GTR CGG AAG TAG AAC TG 2,13,82 686-711 d 

122,145 a 

45 646 5'-ATC GAC AAG CCI TTC YTI ATG SC 2,13,82 317-339 d 

122,.145 a 

648 c 5'-ACG TCS GTS GTR CGG AAG TAG AAC TG 2,13,82 686-711 d 



122,145 a 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the E. coii tuf sequence fragment (SEQ ID 
NO. 78) . 

c These sequences are from the complementary DNA strand of the sequence of the 
55 originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the A. meyeri tuf sequence fragment (SEQ ID 
NO. 2) 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplification (tuf sequences) (continued). 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Universal primers (bacteria) (continued) 



649 
650 c 

636 
651 c 



1974 
1975 c 

1976 

1978 c 

1977 
1979 c 

1981 



1980 c 



5'-GTC CTA TGC CTC ARA CWC GIG AGC 
5 ' -TTA CGG AAC ATY TCA ACA CCI GT 

5 '-ACT GGY GTT GAI ATG TTC CGY AA 
5'-TGA CGA CCA CCI TCY TCY TTY TTC 

Universal primers (fungi) 

5 ' - ACA AGG GIT GGR MSA AGG AGA C 
5'-TGR CCR GGG TGG TTR AGG ACG 

5' -GAT GGA YTC YGT YAA ITG GGA 

5 '-CAT CIT GYA ATG GYA ATC TYA AT 

5 ' -GAT GGA YTC YGT YAA RTG GGA 
5 ' -CAT CYT GYA ATG GYA ASC TYA AT 

5 ' -TGG ACA CCI SCA AG I GGK CYG 



AC 8,86,141,143 a 33-58 b 
8,86,141, 143 a 473-495 b 

8,86,141,143a 473-495 b 
8,86,141,143 a 639-663 b 



404,405,433, 443-464 d 
445,898,1268, 

1276,1986 a 

404,405,433, 846-866 d 
445,898,1268, 

1276,1986 a 

407-412, 286-306 e 
414-426,428- 
431,439,443,447, 
448,622,624,665, 

1685,1987-1990 a 

same as SEQ 553-575 e 
ID NO. 1976 



same as SEQ 
ID NO. 1976 

same as SEQ 
ID NO. 1976 



286-306 e 
553-575 e 

281-301 d 



5'-TCR ATG GCI TCI AIR AGR GTY T 



401-405, 
433,435,436, 
438,444,445,449, 
453,455,457,779, 
781-783,785,786, 
788-790,897-903, 
1267-1272,1274-1280, 
1282-1287, 1991-1998 a 

same as SEQ 488-509 d 
ID NO. 1981 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the B. distasonis tuf sequence fragment 
(SEQ ID NO. 8) . 

55 c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
d The nucleotide positions refer to the A. fumigatus tuf (EF-1) sequence 

fragment (SEQ ID NO. 404). 
e The nucleotide positions refer to the C. albicans tuf (EF-1) sequence fragment 
60 (SEQ ID NO. 407) . 
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Annex I; Specific and ubiquitous primers f r nucleic acid 
amplification (tuf sequences) (continued). 



Originating DNA fragment 





SEQ ID NO. 


Nucleotide 


sequence 








SEQ ID 
NO. 


Nucleotide 
position 


10 — 




Universal- primers. { fungi h 


(continued) 






1982 


5' -TGG ACA CYI SCA 


AGI GGK 


CYG 




same as SEQ 
ID NO. 1981 


281-301 a 


J J 


1980 b 


5' -TCR 


ATG 


GCI TCI 


AIR AGR 


GTY 


T 


same as SEQ 
ID NO. 1981 


488-509 a 




1983 


5' -CYG AYT GCG CYA 


TIC TCA 


TCA 




same as SEQ 
ID NO. 1981 


143-163 a 


20 


1980 b 


5 ' -TCR 


ATG 


GCI TCI 


AIR AGR 


GTY 


T 


same as SEQ 
ID NO. 1981 


488-509 a 




1984 


5 ' -CYG 


AYT 


GYG CYA 


TYC TSA 


<TCA 




same as SEQ 
ID NO. 1981 


143-163 a 


25 


1980 b 


5 ' -TCR ATG 


GCI TCI 


AIR AGR 




T 
I 


same as SEQ 
ID NO. 1981 


488-509 a 






Sequencing primers 














556 


5 ' -CGG 


CGC 


NAT CYT 


SGT TGT 


TGC 






668 c 


306-326 


30 


557 b 


5 ' -CCM 


AGG 


CAT RAC 


CAT CTC 


GGT 


G 




668 c 


1047-1068 




694 


5 ' -CGG 


CGC 


IAT CYT 


SGT TGT 


TGC 






668° 


306-326 




557b 


5 ' -CCM 


AGG 


CAT RAC 


CAT CTC 


GGT 


G 




668 c 


1047-1068 


35 


664 


5'-AAY 


ATG 


ATI ACI 


GGI GCI 


GCI 


CAR ATG 


GA 


619 c 


604-632 




652 b 


5 4 -CCW 


AYA 


GTI YKI 


CCI CCY 


TCY 


CTI ATA 




619 c 


1482-1508 




664 


5'-AAY 


ATG 


ATI ACI 


GGI GCI 


GCI 


CAR ATG 


GA 


619 c 


604-632 


40 


561 b 


5 1 -ACI 


GTI 


CGG CCR 


CCC TCA 


CGG 


AT 




619 c 


1483-1505 


543 


5'-ATC 


TTA 


GTA GTT 


TCT GCT 


GCT 


GA 




607 


8-30 




660 b 


5 ' -GTA 


GAA 


TTG AGG 


ACG GTA 


GTT 


AG 




607 


678-700 




658 


5 ' -GAT 


YTA 


GTC GAT 


GAT GAA 


GAA 


TT 




621 


116-138 


45 


659 b 


5 ' -GCT 


TTT 


TGI GTT 


TCW GGT 


TTR AT 




621 


443-465 




658 


5 ' -GAT 


YTA 


GTC GAT 


GAT GAA 


GAA TT 




621 


116-138 




661 b 


5 ' -GTA 


GAA 


YTG TGG 


WCG ATA 


RTT 


RT 




621 


678-700 


50 


558 


5 ' -TCI 


TTY 


AAR TAY 


GCI TGG 


GT 






665 c 


157-176 




559 b 


5 ' -CCG 


ACR 


GCR AYI 


GTY TGI 


CKC 


AT 




665 c 


1279-1301 




813 


5 ' -AAT 


CYG 


TYG AAA 


TGC AYC 


ACG 


A 




665 c 


687-708 


55 


559 b 


5 ' -CCG 


ACR 


GCR AYI 


GTY TGI 


CKC 


AT 




665 c 


1279-1301 



a The nucleotide positions refer to the A. fumigratus tuf (EF-1) sequence 
fragment (SEQ ID NO. 404). 

b These sequences are from the complementary DNA strand of the sequence of the 
60 originating fragment given in the Sequence Listing. 

c S quences from databases. 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf 8 quences) (continu d) • 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 

NO. 



Nucleotide 
position 



_1<L 



15 



20 



Sequencing primers (continued) 



25 



30 



35 



40 



558 


5 ' 


-TCI 


TTY 


AAR 


TAY 


GCI 


TGG 


GT 




665 a 


157-176 


815 b 


5' 


-TGG 


TGC 


ATY 


TCK 


ACR 


GAC 


TT 




665 a 


686-705 


560 


5' 


-GAY 


TTC 


ATY 


AAR 


AAY 


ATG 


ATY AC 




665 a 


289-311 


559 b 


5' 


-CCG 


ACR 


GCR 


AY I 


GTY 


TGI 


CKC AT 




665 a 


1279-1301 


653 


5' 


-GAY 


TTC 


ATI 


AAR 


AAY 


ATG 


AT 




665 a 


289-308 


559 b 


5' 


-CCG 


ACR 


GCR 


AY I 


GTY 


TGI 


CKC AT 




665 a 


1279-1301 


558 


5' 


-TCI 


TTY 


AAR 


TAY 


GCI 


TGG 


GT 




665 a 


157-176 


655 b 


5' 


-CCR 


ATA 


CCI 


CMR 


ATY 


TTG 


TA 




665 a 


754-773 


654 


5' 


-TAC 


AAR 


ATY 


KG I 


GGT 


ATY 


GG 




665 a 


754-773 


559 b 


5' 


-CCG 


ACR 


GCR 


AY I 


GTY 


TGI 


CKC AT 




665 a 


1279-1301 




•J 


-ATT 


an j 


PAY 


Ail 


RAY 


PAY 


aai AAR 


AP 


U J O 


52-77 

■J L. ft 


697 b 


5' 


-CCI 


ACI 


GTI 


CKI 


CCR 


CCY 


TCR CG 




698 a 


1132-1154 


911 


5' 


-GAC 


GGM 


KKC 


ATG 


CCG 


CAR 


AC 




853 


22-41 


914 b 


5' 


-GAA 


RAG 


CTG 


CGG 


RCG 


RTA 


GTG 




853 


700-720 


912 


5' 


-GAC 


GGC 


GKC 


ATG 


CCG 


CAR 


AC 




846 


20-39 


914 b 


5' 


-GAA 


RAG 


CTG 


CGG 


RCG 


RTA 


GTG 




846 


692-712 


913 


5' 


-GAC 


GGY 


SYC 


ATG 


CCK 


CAG 


AC 




843 


251-270 


915 b 


5' 


-AAA 


CGC 


CTG 


AGG 


RCG 


GTA 


GTT 




843 


905-925 


916 


5' 


-GCC 


GAG 


CTG 


GCC 


GGC 


TTC 


AG 




846 


422-441 


561 b 


5' 


-AC I 


GTI 


CGG 


CCR 


CCC 


TCA 


CGG AT 




619 a 


1483-1505 


664 


5' 


-AAY 


ATG 


ATI 


ACI 


GGI 


GCI 


GCI CAR 


ATG GA 


619 a 


604-632 


917 b 


5' 


-TCG 


TGC 


TAC 


CCG 


TYG 


CCG 


CCA T 




846 


593-614 



45 



a Sequences from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf e qu nces) (continu d) • 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



^Sequenc ing primers ( cont inued ) 



1221 


5' 


-GAY 


AC I 


CCI 


GGI 


CAY 


GTI GAY 


TT 






1230 a 


292 


-314 


1226 b 


5' 


-GTI 


RMR 


TAI 


CCR 


AAC 


ATY TC 








1230 a 


2014 


-2033 


1222 


5' 


-ATY 


GAY 


AC I 


CCI 


GGI 


CAY GTI 


GAY 


TT 




1230 a 


289 


-314 


1223 b 


5' 


-AYI 


TCI 


ARR 


TGI 


ARY 


TCR' CCC 


ATI 


CC 




1230 a 


1408 


-1433 


1224 


5' 


-CCI 


GYI 


HTI 


YTI 


GAR 


CCI ATI 


ATG 






1230 a 


1858 


-1881 


1225 b 


5' 


-TAI 


CCR 


AAC 


ATY 


TCI 


SMI ARI 


GGI 


AC 




1230 a 


2002 


-2027 


1227 


5' 


-GTI 


CCI 


YTI 


KCI 


GAR 


ATG TTY 


GGI 


TA 




1230 a 


2002 


-2027 


1 9 9QO 


C t 

J 




ATY 
nl 1 




npT 






ATP 


AT 




698 a 


4- 




1228 


5' 


-GTI 


CCI 


YTI 


KCI 


GAR 


ATG TTY 


GGI 


TAY 


GC 


1230 a 


2002 


-2030 


1229 b 


5' 


-TCC 


ATY 


TGI 


GCI 


GCI 


CCI GTI 


ATC 


AT 




698 a 


4- 


29 


1999 


5' 


-CAT 


GTC 


AAY 


ATT 


GGT 


ACT ATT 


GGT 


CAT 


GT 498-500, 


25- 


53<* 






















502,505,506, 
























508 f 619,2004, 2005 c 




2000 b 


5' 


-CCA 


CCY 


TCI 


CTC 


AMG 


TTG AAR 


CGT 


T 


same as SEQ 


1133- 


1157 d 






















ID 


NO. 1999 






2001 


5' 


-ACY 


AC I 


TTR 


ACI 


GCY 


GCY ATY 


AC 




same as SEQ 


67- 


89 d 






















ID 


NO. 1999 






2003 b 


5' 


-CAT 


YTC 


RAI 


RTT 


GTC 


ACC TGG 






same as SEQ 


1072- 


1092 d 






















ID 


NO. 1999 






2002 


5' 


-CCI 


GAR 


GAR 


AGA 


GCI 


MGW GGT 






same as SEQ 


151- 


171 d 






















ID 


NO. 1999 






2003 b 


5' 


-CAT 


YTC 


RAI 


RTT 


GTC 


ACC TGG 






same as SEQ 

Tn MA 1QQQ 


1072- 


1092 d 



a Sequences from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
c These sequences were aligned to derive the corresponding primer. 
d The nucleotide positions refer to the C. albicans tuf sequence fragment (SEQ 

ID NO. 2004) . 
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Ann x II) Specific and ubiquitous primers for nucleic acid 
amplification (atpD sequences). 

~ Originating DNA fragment 





SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 
NO. 


Nucleotide 
position 


~io 


Bacterial specie^:- 




- Acinetobacter-baumannii 








1690 
1691 b 


5 ' -CAG 
5 ' -CAC 


GTC 
AGA 


CTG TTG CGA CTG AAG AA 
TAA ACC TGA GTG TGC TTT C 


243 
243 


186-208 
394-418 


15 


Bacterial 


species: 




Bacteroidea fragilis 








2134 
2135 b 


5 ' -CGC 
5 ' -TCT 


GTG 
CGC 


AAG CTT CTG TG 
CGT TAT TCA GTT TC 


929 
929 


184-200 
395-414 


20 


Pact^rial 


species: 




Bordetella pertussis 








2180 
2181 b 


5'-TTC 
5 ' -AGC 


GCC 
GCC 


GGC GTG GGC 
ACG CGC AGG 

"LVJ T\\J\J 


1672 c 


544-558 

U O v OOU 


25 


Bacterial 


Specie?: 




Enterococcus faecium 








1698 
1699 b 


5'-GGA 
5 ' -GCA 


ATC 
TCT 


AAC AGA TGG TTT ACA AA 
TCT GGG AAA GGT GT 


292 
292 


131-153 
258-277 


30 


1700 
1701 b 


5' -AAG 
5' -ATT 


ATG 
ATG 


CGG AAA GAA GCG AA 
GAT CAG TTC TTG GAT CA 


292 
292 


271-290 
439-461 




Bacterial species: 




Klebsiella pneumoniae 






35 


1331 
1332 b 


5 ' -GCC 
5 ' -GAC 


CTT 
CGC 


GAG GTA CAG AAT GGT AAT GAA GTT 
GGC GCA GAC CAT CA 


317 
317 


88-118 
183-203 



a These sequences were aligned to derive the corresponding primer. 

40 b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
c Sequence from databases. 
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Annex II: Specific and ubiquitous prim rs for nucleic acid 
amplif ication (atpD sequences ) • 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



To - 


Bacterial specieg: 




Streptococcus agalactiae 








627 




-ATT 


GTC 


TAT AAA AAT GGC GAT AAG TC 


379-383 a 


42 


-67 b 




625 c 


5' 


-CGT 




AGA CAC GAC CCA AAG TAT CC 


379-383 a 


206 


-231 b 


15 


628 


5' 


-AAA 


ATG 


GCG ATA AGT CAC AAA AAG TA 


379-383 a 


52- 


-77 b 




£0 cC 


C i 

J 


-CGT 


TGA 


AGA CAC GAC CCA AAG TAT CC 


379-383 a 


206- 


-231 b 




627 




-ATT 


GTC 


TAT AAA AAT GGC GAT AAG TC 


379-383 a 


42' 


-67b 


20 


626 c 


5' 


-TAC 


CAC 


CTT TTA AGT AAG GTG CTA AT 


379-383 a 


371- 


-396 b 




628 


5' 


-AAA 


ATG 


GCG ATA AGT CAC AAA AAG TA 


379-383 a 


52- 


-77 b 




626 c 


5' 


-TAC 


CAC 


CTT TTA AGT AAG GTG CTA AT 


379-383 a 


371- 


-396 b 


25 


Bagterlal 


aroup : 






Campylobacter jejuni and C. coli 








2131 




-AAG 


CMA 


TTG TTG TAA ATT TTG AAA G 


1576,1600, 


7- 


31 e 














1849,1863,2139°-* 






2132 c 


5' 


-TCA 


TAT 


CCA TAG CAA TAG TTC TA 


1576,1600, 


92- 


114 e 














1849, 1863, 2139 d < 


,a 




30 




















Bacterial 


genus : 






Bordetella sp. 










825 


5' 


-ATG 


AGC 


ARC GSA ACC ATC GTT CAG TG 


1672 d 


1- 


-26 




826 c 


5' 


-TCG 


ATC 


GTG CCG ACC ATG TAG AAC GC 


1672 d 


1342 


-1367 


35 


Funaal aenus: 






Candida sp. 










634 


5' 


-AAC 


ACY 


GTC AGR RCI ATT GCY ATG GA 


460-472, 


101- 


-126 f 














474-478 a 






40 


635 c 


. 5' 


-AAA 


CCR 


GTI ARR GCR ACT CTI GCT CT 


460-472, 


617- 


-642 f 














474-478 a 







a These sequences were aligned to derive the corresponding primer. 
45 b The nucleotide positions refer to the S. agalactiae atpD sequence fragment 
(SEQ ID NO. 380) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d Sequence from databases. 
50 e The nucleotide positions refer to the C. jejuni atpD sequence fragment (SEQ 

ID NO. 1576) . 

f The nucleotide positions refer to the C. albicans atpD sequence fragment (SEQ 
ID NO. 460) . 
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Annex II; Specific and ubiquitous primers for nucleic acid 
amplification (atpD segu nces) (continued). 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide s quence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Universal- primers 



562 5 ' -CAR ATG RAY GAR CCI CCI GGI GYI MGI ATG 



563 c 5'-GGY TGR TAI CCI ACI GCI GAI GGC AT 



564 5'-TAY GGI CAR ATG AAY GAR CCI CCI GGI AA 



565 c 5'-GGY TGR TAI CCI ACI GCI GAI GGD AT 



243,244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 

243,244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336;339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 

243,244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 

243,244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 



528-557 b 



687-712 b 



522-550 b 



687-712 b 



55 a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the K. pneumoniae atpD sequence fragment 
(SEQ ID NO. 317) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex II* Sp cific and ubiquitous primers for nucl ic acid 
amplification (atpD sequ nces) (continued) • 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



10- 



15 



20 



25 



30 



35 



40 



45 



640 



641 c 



642 



641 c 



Universal primers „X continued) 

5'-TCC ATG GTI TOY GGI CAR ATG AA 
5'-TGA TAA CCW ACI GCI GAI GGC ATA CG 

5'-GGC GTI GGI GAR CGI ACI CGT GA 

5 ' -TGA TAA CCW ACI GCI GAI GGC 1 ATA CG 

Sequencing primers 



248,284.315, 513-535 b 
317,343,357, 

366,370, 379, 393 a 

248,284,315, 684-709 b 
317,343,357, 

366,370,379,393 a 

248,284,315, 438-460 b 
317,343,357, 

366,370,379,393 a 

248,284,315, 684-709 b 
317,343,357, 

366,370,379,393 a 



566 


5' 


-TTY 


GGI 


GGI 


GCI 


GGI 


GTI 


GGI 


AAR 


AC 


669 d 


445-470 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AY I 


GCY 


TG 


669 d 


883-908 


566 


5' 


-TTY 


GGI 


GGI 


GCI 


GGI 


GTI 


GGI 


AAR AC 


669 d 


445-470 


814 


5' 


-GCI 


GGC 


ACG 


TAC 


ACI 


GCC 


TG 






666 d 


901-920 


568 


5' 


-RTI 


ATI 


GGI 


GCI 


GTI 


RTI 


GAY 


GT 




669 d 


25-47 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AY I 


GCY 


TG 


669 d 


883-908 


570 


5' 


-RTI 


RYI 


GGI 


CCI 


GTI 


RTI 


GAY 


GT 




672 d 


31-53 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AY I 


GCY 


TG 


669 d 


883-908 


572 


5' 


-RTI 


RTI 


GGI 


SCI 


GTI 


RTI 


GA 






669 d 


25-44 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AY I 


GCY 


TG 


669 d 


883-908 


569 


5' 


-RTI 


RTI 


GGI 


SCI 


GTI 


RTI 


GAT 


AT 




671 d 


31-53 


567^ 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AY I 


GCY 


TG 


669 d 


883-908 


571 


5' 


-RTI 


RTI 


GGI 


CCI 


GTI 


RTI 


GAT 


GT 




670 d 


31-53 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 



50 a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the K. pneumoniae atpD sequence fragment 
(SEQ ID NO. 317) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

55 d Sequences from databases. 
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Annex II: Specific and ubiquitous primers for nucleic acid 
amplification (atpD segu nces) (continued) • 

Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Sequencing primers (continued) 



700 


5' 


-TIR 


TIG 


AYG 


TCG 


ART 


TCC 


CTC ARG 




669 a 


38-61 


567 b 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 a 


883-908 


DDO 


C I 

D 


-KT1 


Ail 


PPT 


PP T 
bLl 




K 1 1 


GAY 


GT 








573 b 


5 ' 


-CCI 


CCI 


ACC 


ATR 


TAR 


AAI 


GC 






666 a 


1465-1484 

j. "3 \j «y ± i o y 


574 


5' 


-ATI 


GCI 


ATG 


GAY 


GGI 


ACI 


GAR 


GG 




666 a 


283-305 


D / J 


C t 

O 


ppt 


PPT 
LL1 


IV PP 

ALL 


Al K 


I An 


BAT 
AAI 


GC 






&££a 

OO D 


1 A (TC "] A OA 


574 


5' 


-ATI 


GCI 


ATG 


GAY 


GGI 


ACI 


GAR 


GG 




666 a 


283-305 


/ (Jo 


C t 

D 




ILL 


Al 1 


PPT 
LL1 


AKJ. 


Al 1 


GCI 


ATI 


Al 


£ Aca 

0 O D 


1 *3 1 OP*5 
Xz Do-lzo J 


681 


5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




685 


694-716 


682 b 


5' 


-GTI 


ACI 


GGY 


TCY 


TCR 


AAR 


TTI 


CCI 


cc 


686 


1177-1202 


DO X 


c * 
D 






TTV 
111 


ppt 


TCT 
±0 -L 




AAR 


AC 




DO J 


D -7 *& 1 J. D 


683 b 


5' 


-GTI 


ACI 


GGI 


TCI 


SWI 


AWR 


TCI 


CCI 


CC 


685 


1180-1205 


681 


5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




685 


694-716 


699 


5' 


-GTI 


ACI 


GGY 


TCY 


TYR 


ARR 


TTI 


CCI 


cc 


686 


1177-1202 


681 


5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR AC„ 




685 


694-716 


812 b 


5' 


-GTI 


ACI 


GGI 


TCY 


TYR 


ARR 


TTI 


CCI 


cc 


685 


1180-1205 


1213 


5' 


-AAR 


GGI 


GGI 


ACI 


GCI 


GCI 


ATH 


CCI 


GG 


714 a 


697-722 


1212 b 


5' 


-CCI 


CCI 


RGI 


GGI 


GAI 


ACI 


GCW 


cc 




714 a 


1189-1211 


1203 


5' 


-GGI 


GAR 


MGI 


GGI 


AAY 


GAR 


ATG 






709 a 


724-744 


1207 b 


5' 


-CCI 


TCI 


TCW 


CCI 


GGC 


ATY 


TC 






709 a 


985-1004 


1204 


5' 


-GCI 


AAY 


AAC 


ITC 


IWM 


YAT 


GCC 






709 a 


822-842 


1206 b 


5' 


-CKI 


SRI 


GTI 


GAR 


TCI 


GCC 


A 






709 a 


926-944 


1205 


5' 


-AAY 


ACI 


TCI 


AWY 


ATG 


CCI 


GT 






709 a 


826-845 


1207 b 


5' 


-CCI 


TCI 


TCW 


CCI 


GGC 


ATY 


TC 






709 a 


985-1004 


2282 


5' 


-AGR 


RGC 


IMA 


RAT 


GTA 


TGA 








714 a 


84-101 


2284 b 


5' 


-TCT 


GWG 


TRA 


CIG 


GYT 


CKG 


AGA 






714 a 


1217-1237 


2283 


5' 


-ATI 


TAT 


GAY 


GGK 


ITT 


CAG 


AGG 


c 




714 a 


271-292 


2285 b 


5' 


-CMC 


CIC 


CWG 


GTG 


GWG AWA 


C 






714 a 


1195-1213 



Sequences from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex IIIi Internal hybridization probes for specific det ction of 
tuf sequences. 



5 










Originating DNA fragment 




SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 
NO. 


Nucleotide 
position 


10 


Bacterial 


sDecies : 




AJbiotrophia adiacens 








2170~ 


~ ~ 5 ' -ACG 


TGA 


CGT TGA CAA ACC A 


171 5~ " 


~313031~ 


15 


Bacterial 


species: 




Chlamydia pneumoniae 






2089 

*. VI O J 

2090 


- 5'-ATG 
5 ' -CGT 


CTG 
TAC 


AAC TTA TTG ACC TT 
TGG AGT CGA AAT G 


20 
20 


136-155 
467-485 


20 


Bacterial 


specieg: 




Enterococcua faecalis 






580 

CAT 

1174 


5'-GCT 
5'-GGT 
5' -GAA 


AAA 
ATT 
CGT 


CCA GCT ACA ATC ACT CCA C 
AAA GAC GAA ACA TC 
GGT GAA GTT CGC 


62-63,607 a 
62-63, 607 a 
62-63, 607 a 


584-608 b 
440-459 b 
398-415 b 


25 


Bacterial 


species: 




Enterococcua faecium 








602 


5'-AAG 


TTG 


AAG TTG TTG GTA TT 


64,608 a 


426-445 c 


30 


Bacterial 


species : 




EnterococcuB gallinarum 




604 
Bacterial 


5' -GGT 
species: 


GAT 


GAA GTA GAA ATC GT 

Escherichia coli 


66,609 a 


419-438 d 


35 


579 


5' -GAA 


GGC 


CGT GCT GGT GAG AA 


78 


503-522 




2168 


5 ' -CAT 


CAA 


AGT TGG TGA AGA AGT TG 


78 


409-431 


• 

40 


Bacterial 


specie^ : 




Neisseria gonorrhoeae 






2166 


5 ' -GAC 


AAA 


CCA TTC CTG CTG 


126 


322-339 e 




Funeral species: 




Candida albicans 






45 


577 


5 ' -CAT 


GAT 


TGA ACC ATC CAC CA 


407-411 3 


406-425 f 




Funaal species: 




Candida dubliniensis 






t 

50 


578 


5 ' -CAT 


GAT 


TGA AGC TTC CAC CA 


412,414-415 a 


418-437^ 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the £. faecalis tuf sequence fragment (SEQ ID NO. 
607). 

c The nucleotide positions refer to the E. faecium tuf sequence fragment (SEQ ID NO. 
55 608) . 



° The nucleotide positions refer to the E. gallinarum tuf sequence fragment (SEQ ID NO. 
609). 

€ The nucleotide positions refer to the N. gonorrhoeae tuf sequence fragment (SEQ ID NO. 
I 126). 

50 £ The nucleotide positions refer to the C. albicans tuf(EF-l) sequence fragment (SEQ ID 
NO. 408). 

9 The nucleotide positions refer to the C. dubliniensis tuf(EF-l) sequence fragment (SEQ 
ID NO. 414) . 
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Annex III: Internal hybridization probes for specific detection of 
tuf sequences (continued) . 



Originating DNA fragment 



25 



40 



SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 
NO. 


Nucleotide 
position 


Bacterial 


species t 




Haemophilus influenzae 






581 


5'-ACA 


TCG 


GTG CAT TAT TAC GTG G 


610 a 


551-572 


&&Sr v^r la,!. 


specie?: 




M\rs+r%r\ 7 a j?niA nnAUfflnn ioa 






2095 






GGT TGA ACG TGG 


2097 a 


687-704 


Bacterial 


species : 




Staphylococcus aureus 






JDV 


5 ' -AC A 


TGA 


> 

CAC ATC TAA AAC AA 


176-180 b 


369-388° 


JOJ 


5 ' -ACC 


ACA 


TAC TGA ATT CAA AG 


176-180 b 


525-544 c 




5 ' -CAG 


AAG 


TAT ACG TAT TAT CA 


176-180 b 


545-564 c 


587 


5 ' -CGT 


ATT 


ATC AAA AGA CGA AG 


176-180 b 


555-574 c 


588 


D -TCI 


1L1 


CAA ACT ATC GTC CA 


176-180 b 


593-612° 


Bacterial 


Specif : 




Staphylococcus epidermidis 




589 


5 ' -GCA 


CGA 


AAC TTC TAA AAC AA 


185,611 b 


445-464 d 


590 


D - TAT 


AL<o 


TAT TAT CTA AAG AT 


185,611 b 


627-646 d 


591 


5 ' -TCC 


TGG 


TTC TAT TAC ACC AC 


185,611 b 


586-605 d 


592 


5' -CAA 


AGC 


TGA AGT ATA CGT AT 


185, 6U b 


616-635 d 


593 


5' -TTC 


ACT 


AAC TAT CGC CCA CA 


185,611 b 


671-690 d 


Bacterial 


species: 




Staphylococcus haemolyticus 




594 


5 ' -ATT 


GGT 


ATC CAT GAC ACT TC 


186,188-190 b 


437-456 e 


595 


5'-TTA 


AAG 


CAG ACG TAT ACG TT 


186,188-190 b 


615-634 e 


Bacterial 


species: 




Staphylococcus hominis 






596 


5'-GAA 


ATT 


ATT GGT ATC AAA GA 


191,193-196 b 


431-450 f 


597 


5 ' -ATT 


GGT 


ATC AAA GAA ACT TC 


191,193-196 b 


437-456 f 


598 


5'-AAT 


TAC 


ACC TCA CAC AAA AT 


191,193-196 b 


595-614 f 



45 

a Sequences from databases. 

b These sequences were aligned to derive the corresponding probe. 
° The nucleotide positions refer to the S. aureus tuf sequence fragment (SEQ ID 
NO. 179) . 

50 d The nucleotide positions refer to the S. epidermidis tuf sequence fragment 
(SEQ ID NO. 611) . 

€ The nucleotide positions refer to the S. haemolyticus tuf sequence fragment 
(SEQ ID NO. 186) . 

f The nucleotide positions refer to the S. hominis tuf sequence fragment (SEQ 
55 ID NO. 191) . 
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Annex III: Internal hybridization probes for specific detection of 
tuf sequenc s (continued) . 

Originating DNA fragment 

5 SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 Ba c t ~er ia l~speci"e s : Staphylococcus saprophyti cub ; ~ 

599 5'-CGG TGA AGA AAT CGA AAT CA 198-200 a 406-425 b 

600 5'-ATG CAA GAA GAA TCA AGC AA 198-200 a 431-450 b 

601 5'-GTT TCA CGT GAT GAT GTA CA 198-200 a 536-555 b 
15 695 5'-GTT TCA CGT GAT GAC GTA CA 198-200 a 563-582 b 

Bacterial species : Streptococcus agalactiae 

582 c 5'-TTT CAA CTT CGT CGT TGA CAC GAA CAG T 207-210 a 404-431 d 

20 583 c 5' -CAA CTG CTT TTT GGA TAT CTT CTT TAA TAC CAA CG 207-210 a 433-467 d 

1199 5' -GTA TTA AAG AAG ATA TCC AAA AAG C 207-210 a 438-462 d 

Bacterial species : Streptococcus pneumoniae 

25 1201 5' -TCA AAG AAG AAA CTA AAA AAG CTG T 971,977, 513-537 e 

979,986 a 

jBacterial gpecies: Streptococcus pyogenes 

30 1200 5' -TCA AAG AAG AAA CTA AAA AAG CTG T 1002 473-497 

Bacterial group : Enterococcus casseliflavus-flavescens- 

gallinarwn group 

35 620 5' -ATT GGT GCA TTG CTA CGT 58,65,66 a 527-544 f 

1122 5'-TGG TGC ATT GCT ACG TGG 58,65,66 a 529-546 f 

Bacterial group : Enterococcus sp., Gemella sp., A. adiacens 

40 2172 5'-GTG TTG AAA TGT TCC GTA AA 58-62,67-71, 477-4969 

87-88,607-609, 
727,871 
1715,1722 a 

45 a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the S. saprophyticvs tuf sequence fragment 
(SEQ ID NO. 198) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

50 d The nucleotide positions refer to the 5. agalactiae tuf sequence fragment 
(SEQ ID NO.' 209) . 

e The nucleotide positions refer to the S. pneumoniae tuf sequence fragment 
(SEQ ID NO. 986) . 

f The nucleotide positions refer to the E. flavescens tuf sequence fragment 
55 (SEQ ID NO. 65) . 

9 The nucleotide positions r fer to the E. faecium tuf sequenc fragment (SEQ 
ID NO. 608) . 
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Annex III: int rnal hybridization prob s for specific detection of 
tuf 8 quences (continued) . 

Originating DNA fragment 

5 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10" - Bacterial- genus-: Gemella. _ 

2171 5'-TCG TTG GAT TAA CTG AAG AA 87,88 a 430-449 b 

Bacterial genus: Staphylococcus sp. 



15 



20 



25 



605 5'-GAA ATG TTC CGT AAA TTA TT 176-203 a 403-422 c 

606 5 ' -ATT AGA CTA CGC TGA AGC TG 176-203 3 420-439° 

1175 5'-GTT ACT GGT GTA GAA ATG TTC 176-203 a 391-411 c 

1176 5'-TAC TGG TGT AGA AAT GTT C 176-203 a 393-411 c 

Bacterial genus : Streptococcus sp» 

1202 5'-GTG TTG AAA TGT TCC GTA AAC A 206-231,971, 466-487 d 

977,979, 982-986 a 



Fungal specie?: Candida albicans 

1156 5' -GTT GAA ATG CAT CAC GAA CAA TT 407-412, 624 a 680-702 e 
30 Funaal group : Candida albicans and C. tropicalis 

1160 5 ' -CGT TTC TGT TAA AGA AAT TAG AAG 407-412, 748-771 e 

429,624 a 

35 Fungal species : Candida dubliniensis 

1166 5'-ACG TTA AGA ATG TTT CTG TCA A 414-415 a 750-771 f 

1168 5' -GAA CAA TTG GTT GAA GGT GT 414-415 a 707-726 f 

40 Fungal species : Candida glabrata 

1158 5'-AAG AGG TAA TGT CTG TGG T 417 781-799 

1159 5 ' -TGA AGG TTT GCC AGG TGA 417 718-735 

45 Fungal species : Candida krusei 

1161 5' -TCC AGG TGA TAA CGT TGG 422 720-737 



50 



a These sequences were aligned to- derive the corresponding primer. 

b The nucleotide positions refer to the G. haemolysans tuf sequence fragment 
(SEQ ID NO. 87) . 

c The nucleotide positions refer to the S. aureus tuf sequence fragment {SEQ ID 
55 NO. 179) . 

d The nucleotide positions refer to the S. pneumoniae tuf sequence fragment (SEQ ID NO. 
986). 

e The nucleotide positions refer to the C. albicans tuf(EF-l) sequence fragment (SEQ ID 
NO. 408). 

60 * The nucleotide positions refer to the C. dubliniensis tuf (EF-1) sequence fragment (SEQ 
ID NO. 414) . 
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Annex III: Internal hybridization prob s for sp cific detection f 
tuf 8 gu nces (continued). 

Originating DNA fragment 

5 tj 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 Funeral group : ~ Candida lusi taaiae~and C.~ gui-llermondii 

1162 5'-CAA GTC CGT GGA AAT GCA 418,424 a 682-699 b 

Fungal species: Candida parapsiloaia 

1157 5'-AAG AAC GTT TCA GTT AAG GAA AT 426 749-771 

Funoal species : Candida zeylanoides 

20 1165 5'-GGT TTC AAC GTG AAG AAC 432 713-730 

Fungal genus ; Candida sp. 

1163 5 ' -GTT GGT TTC AAC GTT AAG AAC 407-412,414- 728-748 c 
25 415,417,418, 

422,429 a 

1164 5'-GGT TTC AAC GTC AAG AAC 413,416,420, 740-757 b 

421,424,425, 

426,428,431 a 

30 1167 5 ' -GTT GGT TTC AAC GT 406-426, 428- 728-741° 

432, 624 a 

a These sequences were aligned to derive the corresponding primer. 

35 b The nucleotide positions refer to the C. lusitaniae tuf (EF-1) sequence 
fragment (SEQ ID NO. 424). 

c The nucleotide positions refer to the C. albicans tuf (EF-1) sequence fragment 
(SEQ ID NO. 408) . 
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Annex XIII: Strategy for the selection of the 
Staphylococcus- specific hybridization probe 
from tuf s guences. 

5 







400 




A O C 

4zb 


SEQ ID NO. 


: Accession 


5. 


aureus 


G 


rr\rr\r* ft * 7k mf*mm 

i i GAAATG1 1 


m % * ft rr\m ft 

CCGTAAATTA 


mm t> 0 ft 

TTAGA 


179 




_S._ 


_aureus 


G 


mnrvr^ * % *m^mm 
1 IGAAATGTT 


^^^mft it ft mm 11 

CCGTAAATTA 


mm ft ^*7k 

TTAGA 


1/0 


1__ 


S. 


aureus 


G 


TTGAAATGTT 


^^^m» ft ftmmft 

CCGTAAATTA 


TTAGA 


177 . 




S. 


aureus 


G 


TIX5AAATGTT 


m ft ft ftmmft 

C CGTAAATTA 


mm 7k r* ft 
TTAGA 


178 




S. 


aureus aureus 


G 


TTGAAATGTT 


n/^/tmn ft ft mm ft 

CCGTAAATTA 


TTAGA 


1 Oft 

180 




s. 


auricularis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


181 




s. 


capitis capitis 


G 


TAGAAATGTT 


#Q^*^%#vito vi intuit 

CCGTAAATTA 


mm ft 0 ft 

TTAGA 


182 




M. 


caseolyticus 


G 


TAGAAATGTT 


MA/un ft ft ftmmft 

CCGTAAATTA 


mm 7k /"*• ft 

TTAGA 


1 0*3 

lb J 




s. 


cohnii 


G 


m * ft ft ft m^mm 

TAGAAATGTT 


MM^mm ft ftmmft 

CCGTAAATTA 


mm 7v 0 ft 

TTAGA 


184 




s. 


epidermidis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


185 




s. 


haemolyticus 


G 


TAGAAATGTT 


#«^H^%mft ft ft mm ft 

CCGTAAATTA 


mm ft i*^ ft 

TTAGA 


loo 




5. 


haemolyticus 


G 


TAGAAATGTT 


M/tMmt ft tmint 

CCGTAAATTA 


TTAGA 


189 




S. 


haemolyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


mm ft Oft 

TTAGA 


^ oft 
190 




£. 


haemolyticus 


G 


TAGAAATGTT 


M.MMMito to siiuiii 

CCGTAAATTA 


TTAGA 


188 




S. 


hominis 


G 


^nfe ^1 to to to mAHUR 

TAGAAATGTT 


AAAf*l% to to JVIfVt to 

CCGTAAATTA 


TTAGA 


196 




S. 


hominis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


194 




s. 


hominis hominis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


191 




s. 


hominis 


G 


TAGAAATGTT 


*m ^fti A Mi «fe to toMllMto 

CCGTAAATTA 


TTAGA 


193 




s. 


hominis 


G 


TAGAAATGTT 


MMMm ft ft ft mm ft 

CCGTAAATTA 


mm ft z""' ft 
TTAGA 


1 AC 

19b 




s. 


lugdunensis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


197 




s. 


saprophyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


198 




5. 


saprophyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


200 




s. 


saprophyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


199 




s. 


sciuri sciuri 


G 


TTGAAATGTT 


CCGTAAATTA 


TTAGA 


201 




s. 


warneri 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


187 




5. 


warner i 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


192 




5. 


warneri 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


202 




S. 


warneri 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


203 




B. 


subtilis 


G 


TTGAAATGTT 


CCGTAAgcTt 


CTTGA 




Z99104 


K. 


coli 


G 


TTGAAATGTT 


CCGcAAAcTg 


CTGGA 


78 




L. 


monocytogenes 


G 


TAGAAATGTT 


CCGTAAATTA 


CTAGA 


138 a 





40 

Selected sequence for 
genus-specific hybridi- 
zation probe GAAATGTT CCGTAAATTA TT 605 

45 

The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179) , Nucleotides in capitals are identical to the selected sequence 
or match that sequence. Mismatches are indicated by lower-case letters. 

50 e The SEQ ID NO. refers to previous patent publication WO98/20157. 
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Annex XIV: Strat gy for the s 1 ction of Staphylococcus 
saprophytics- specific and of Staphylococcus 
haemolyticus- specific hybridization probes 
from tuf sequ nc s. 

5 



SEQ ID 



1 A 
10 






339 






M/*% * 

wu. : 




S. 




aureus 


2£(Z~ 




AglCuAAAlx, AlCwn ClflU 


ATGACAC ft TC i AA 


1 /y 




r» 
b. 


aureus 


r\\j 


rp*. f^P»fpr»* »f*» 


Ay^i Cb/tAitil. AlCwItiaL 


Ai\iAwlLal^ 1AA 


I/O 




S. 


aureus 


Mb 


1 VWll bAAlvA 


AgTCvAAAXt AiCwitloL 


& Tr APKPa flip A A 

ATGACACaTC 1AA 


1 T7 
lit 




5. 


aureus 


Ab 




AQlLbAAAXw AlCbbltiaC 


ArGAUACBlC 1AA 


I/O 


1 c 


S. 


aureus aureus 


Ab 


rp*. /^r»m/2» ft/Ilk 


AgTCGAAAlU AlCbbiviaL 


ATGACACaTC TAA 


lou 




S. 


auricularis 


Ab 




AgTCGAAATC ATCwTATgo 


Aa b AC g g TTC AAA 


1 Q 1 
±0 1 




5. 


capitis capitis 


Ab 


1 1 bb i. b AAbA 


AgTtGAAATC ATCbAjTAIUL 


AC uA^wviv 1 AA 


1 0 0 




Jf . 


caseolyticus 


AG 


Tf PP*PPi » p» 


AgiCbAAAlt Aliwbiuloo 


ciwiagaftCL aaa 


lo j 




S. 


cohnii 


Ab 


1 L bbTbAAbA 


AgTtbAAAIl- AlCbblAigi. 


AabAdgai 1L LAn 


loft 


20 


S. 


epidermidis 


AG 


TtGGTGAAGA 


AgTtGAAATC ATCGGTATgC 


AC GAaACTTC 1 AA 


lOD 




5. 


haemolyticus 


AG 


T^fc. p»P«n"kP» > * r» * 

T t GGT GAAGA 


AgTtGAAATC ATTGGTATCC 


ATGACAC TTC TAA 


loo 




S. 


haemolyticus 


AG 


T t GGT G AAG A 


AgTtGAAATC ATTGGTATCC 


% mp » r*» ?s p/nmp rp > > 
ATGACACTTC 1 AA 


1 0 o 

lo y 




5. 


haemolyticus 


AG 


1 tGGTGAAGA 


AgTtGAAATC ATTGGTATCC 


ATGACAC TTC IAA 


ion 

iy u 




S. 


haemolyticus 


AG 




AQTCGAAATC ATTGGTATCa 


ABGAaACTTC 1 AA 


1 0 Q 

loo 


25 


S. 


hominis 


jp 
AG 


rp*. pnffip » jpl 

A tvjbKjAA^A 


AgTtGAAATC ATTGGTATCa 


Afl G AaACTTC I AA 


iy ft 




S. 


hominis hominis 


ap 
Ab 


X tuui bAAbil 


AgTCGAAATC ATTGGTATCa 


AabA&ACTTC 1 AA 


10 1 

iy i 




s. 


hominis 


AG 


TtGGTGAAGA 


Ayi LbAAAlb Ai. i.bbl<ni.Va 


AaWMAU ^ iv 1 






s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt ATTGGTATCa 


AaGAaACTTC TAA 


195 




s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt ATTGGTATCa 


AaGAtACTTC TAA 


196 


30 


S. 


lugdunensis 


AG 


TCGGTGAAGA 


AgTtGAAATt ATTGGTATCC 


AcGAtACTaC TAA 


197 




S. 


saprophyticus 


AG 


TCGGTGAAGA 


AATCGAAATC ATcGGTATgC 


AaGAagaaTC CAA 


198 




5. 


saprophyticus 


AG 


TCGGTGAAGA 


AATCGAAATC ATcGGTATgC 


AaGAagaaTC CAA 


200 




S. 


saprophyticus 


AG 


TCGGTGAAGA 


AATCGAAATC ATcGGTATgC 


AaGAagaaTC CAA 


199 




S. 


sciuri sciuri 


TG 


TtGGTGAAGA 


AgTtGAAATC ATcGGTtTaa 


cTGAagaaTC TAA 


201 


35 


S. 


warner i 


AG 


TtGGTGAAGA 


AgTtGAAATC ATcGGTtTaC 


ATGACACTTC TAA 


187 




S. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC ATcGGTtTaC 


ATGACACTTC TAA 


192 




S. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC ATcGGTtTaC 


ATGACACTTC TAA 


202 




S. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC ATcGGTtTaC 


ATGACACTTC TAA 


203 




B. 


subtilis 


AG 


TCGGTGAcGA 


AgTtGAAATC ATcGGTcTtC 


AaGAagagag AAA 




40 


r. 


coli 


AG 


TtGGTGAAGA 


AgTtGAAATC gTTGGTATCa 


AaGAgACTca GAA 


78 




L. 


monocytogenes 


AG 


TtGGTGAcGA 


AgTaGAAgTt ATcGGTATCg 


AaGAagaaag AAA 


13 8 b 



Selected sequences for 
species-specific 

45 hybridization probes CGGTGAAGA AATCGAAATC A (S. saprophyticus) 599 

{S. haemolyticus) ATTGGTATCC ATGACACTTC 594 

The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179). Nucleotides in capitals are identical to the selected 
50 sequences or match those sequences. Mismatches are indicated by lower-case 
letters. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent publication WO98/20157. 
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Annex XV: Strategy for th selection of Staphylococcus 
aureus- specif ic and of Staphylococcus 
epid rmidis-specific hybridization prob s from 
tuf s quences. 

5 

SEQ ID 

521 547 .592 617 _ Jjp. 

10 S. aureus TACACCACA TACTGAATTC AAAOCAG . . , TTCTTCtCa AACTATCGtC CACAATT 179 

S. aureus TACACCACA TACTGAATTC AAAOCAG ... TTCTTCtC 178 

S. aureus TACACCACA TACTGAATTC AAAGCAG ... TTCTTCtCa AACTATCGtC CACAATT 176 

S. aureus TACACCACA TACTGAATTC AAAGCAG ... TTCTTCtCa AACTATCGtC CACAATT 177 

S. aureus aureus TACACCACA TACTGAATTC AAAGCAG. . .TTCTTCtCa AACTATCGtC CACAATT 180 

15 S. auricular is TACACCACA cACTaAATTC ActGCAG . . . TTCTTCtCT AACTAcCGtC CACAATT 181 

S. capitis capitis CACACCACA cACTaAATTC AAAGCGG . . . TTCTTCAgT AACTAcCGCC CACAATT 182 

M. caseolyticus TACtCCACA TACTaAATTC AAAGCTG ... TTCTTCACT AACTAcCGCC CtCAGTT 183 

S. COhnii TACACCACA cACaaAcTTt AAAGCGG ... TTCTTCAgT AACTATCGCC CACAATT 184 

S. epidermidis TACACCACA cACaaAATTC AAAGCTG ... TTCTTCACT AACTATCGCC CACAATT 185 

20 S. haemolyticus CACACCtCA cACaaAATTt AAAGCAG ... TTCTTCACa AACTATCGtC CACAATT 186 

S. haemolyticus CACACCtCA cACaaAATTt AAAGCAG ... TTCTTCACa AACTATCGtC CACAATT 189 

S. haemolyticus CACACCtCA CACaaAATTt AAAGCAG ... TTCTTCACa AACTATCGtC CACAATT 190 

5. haemolyticus TACACCtCA cACaaAATTC AAAGCAG ... TTCTTCACT AACTATCGtC CACAATT 188 

S. hominis CACACCtCA cACaaAATTC AAAGCAG ... TTCTTCACT AACTATCGtC CACAATT 195 

25 S. hominis TACACCtCA CACaaAATTC AAAGCAG ... TTCTTCACT AACTATCGtC CACAATT 196 

S. hominis hominis TACACCtCA CACaaAATTC AAAGCAG ... TTCTTCtCT AACTATCGtC CACAATT 191 

S. hominis TACACCtCA cACaaAATTC AAAGCAG ... TTCTTCtCT AACTATCGtC CACAATT 193 

S. hominis TACACCtCA cACaaAATTC AAAGCAG ... TTCTTCtCT AACTATCGtC CACAATT 194 

S. lugdunensis TACACCtCA cACTaAATTt AAAGCTG ... TTCTTCtCa AACTAcCGCC CACAATT 197 

30 S. saprophyticus TACACCACA TACaaAATTC AAAGCGG ... TTCTTCACT AACTAcCGCC CACAATT 198 

S. saprophyticus TACACCACA TACaaAATTC AAAGCGG ... TTCTTCACT AACTAcCGCC CACAATT 199 

S. saprophyticus TACACCACA TACaaAATTC AAAGCGG. . .TTCTTCACT AACTAcCGCC CACAATT 200 

S. scivri sciuri CACACCtCA CACTaAATTC AAAGCTG ... TTCTTCACa AACTAcCGCC CACAATT 201 

. S. wameri TACACCACA TACaaAATTC AAAGCGG... 192 

35 £. wameri TACACCACA TACaaAATTC AAAGCGG ... TTCTTCAgT AACTAcCGCC CACAATT 187 

S. wameri TACACCACA TACaaAATTC AAAGCGG ... TTCTTCAgT AACTAcCGCC CACAATT 202 

S. wameri TACACCACA TACaaAATTC AAAGCGG ... TTCTTCAgT AACTAcCGCC CACAATT 203 

B. subtilis CACtCCACA cAgcaAATTC AAAGCTG ... TTCTTCtCT AACTAcCGtC CtCAGTT - ft 

E. coli CAAgCCgCA cACcaAgTTC gAAtCTG . . . TTCTTCAaa ggCTAcCGtC CgCAGTT 78 

40 L. monocytogenes TACtCCACA cACTaAcTTC AAAGCTG ... TTCTTCAac AACTAcCGCC CACAATT 138 b 

Selected sequences 
for species-specific 
hybridization 

45 probes ACCACA TACTGAATTC AAAG (S. aureus) 585 

(S. epidermidis) TTCACT AACTATCGCC CACA 593 

The sequence numbering refers to the Staphylococcus aureus tuf gene fragment (SEQ ID NO. 
179) . Nucleotides in capitals are identical to the selected sequences or match those 
50 sequences. Mismatches are indicated by lower-case letters. *~ n indicate incomplete 
sequence data. Dots indicate gaps in the sequences displayed. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent publication WO98/20157. 
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Annex XVI: Strategy for the selection of the 
Staphylococcus hominis- specific hybridization 
probe from tuf s quences. 

5 



10 



15 



20 



25 



30 



35 



40 



S. aureus 

S. aureus 

S. aureus 

S. aureus 

5. aureus aureus 

S. auricularis 

S. capitis capitis 

M. caseolyticus 

S. cohnii 

S. epidermidis 

S. haemolyticus 

S. haemolyticus 

S. haemolyticus 

S. haemolyticus 

5. hominis 

5. hominis 

S. hominis hominis 

S. hominis 

S. hominis 

S. lugdunensis 

S. saprophytics 

5. saprophytics 

£. saprophytics 

S. sciuri sciuri 

S. warneri 

5. wameri 

5. warner i 

S. warneri 

B. suJbtilis 

E. coli 

L. monocytogenes 

Selected sequence for 
species-specific 
hybridization probe 



_358 

ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTATgA 
ATC ATcGGTATCc 
ATC ATTGGTtTaA 
ATC ATcGGTATgc 
ATC ATcGGTATgc 
ATC ATTGGTATCc 
ATC ATTGGTATCc 
ATC ATTGGTATCC 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCc 
ATC ATcGGTATgc 
ATC ATcGGTATgc 
ATC ATcGGTATgc 
ATC ATcGGTtTaA 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTcTtc 
ATC gTTGGTATCA 
GTT ATcGGTATCg 



383 

AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AAGAcggTTC AAA 
AcGAAACTTC TAA 

ctGAAgaacC AAA 
AAGAAgaTTC CAA 
AcGAAACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AAGAAACTTC TAA 
AAGAtACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AcGAtACTaC TAA 
AAGAAgaaTC CAA 
AAGAAgaaTC CAA 
AAGAAgaaTC CAA 
ctGAAgaaTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AAGAAgagag AAA 
AAGAgACTca GAA 
AAGAAgaaag AAA 



SEQ. 



ATTGGTATCA AAGAAACTTC 



ID„NO.. 

179 

176 

177 

178 

180 

181 

182 

183 

184 

185 

186 

189 

190 

188 

196 

194 

191 

193 

195 

197 

198 

200 

199 

201 

187 

192 

202 

203 

78 
138 b 



597 



The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179). Nucleotides in capitals are identical to the selected 
sequences or match those sequences. Mismatches are indicated by lower-case 
letters. Dots indicate gaps in the sequences displayed. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent publication WO98/20157. 
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Annex XXI: Specific and ubiquitous primers 

amplification (r cA sequ nces). 



for nucleic acid 



Originating DMA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



919 
92 0 b 

921 
922° 



1605 
1606 



935 
936 b 



937 
938 b 



Universal primers (recA) 

5'-GGI CCI GAR TCI TMI GGI AAR AC 
5' -TCI CCV ATI TCI CCI TCI AIY TC 

5'-TIY RTI GAY GCI GAR CAI GC 

5 ' -TAR AAY TTI ARI GCI YKI CCI CC 

Sequencing primers (recA) 

5'-ATY ATY GAA RTI TAY GCI CC 
5'-CCR AAC ATI AY I CCI ACT TTT TC 

Universal primers (radSl) 

5' -GGI AAR WSI CAR YTI TGY CAY AC 
5' -TCI SIY TCI GGI ARR CAI GG 

Universal primers (dmcl) 

5 ' -ATI ACI GAR GYI TTY GGI GAR TT 
5'-CYI GTI GYI SWI GCR TGI GC 



918 a 
918 a 

918 a 
918 a 



1704 a 
1704 a 



939 a 
939 a 



940 a 
940 a 



437-459 
701-723 

515-534 
872-894 



220-239 
628-650 



568-590 
1126-1145 



1038-1060 
1554-1573 



35 



a Sequences from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XXII i Specific and ubiquitous primers for nucl ic acid 
amplification {speA sequences). 

_____ ________________________ originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Bacterial species : Streptococcus pyogenes 

994 5'-TGG ACT AAC AAT CTC GCA AGA GG 993 a 60-82 

995 b 5' -ACA TTC TCG TGA GTA ACA GGG T 993 a 173-194 

996 5 ' -ACA AAT CAT GAA GGG AAT CAT TTA G 993 a 400-424 

997° 5'-CTA ATT CTT GAG CAG TTA CCA TT 993 a 504-526 

998 5'-GGA GGG GTA ACA AAT CAT GAA GG 993 a 391-413 

997° 5'-CTA ATT CTT GAG CAG TTA CCA TT 993 a 504-526 



Sequence from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XXXII i Sp cific and ubiquitous primers for nucleic acid 
amplification (toxin sequences). 

Originating DNA fragment 

5 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 Toxin aene: cdtA ~ 

2123 5'-TCT ACC ACT GAA GCA TTA C 2129 a 442-460 

2124 b 5 ' -TAG GTA CTG TAG GTT TAT TG 2129 a 580-599 

15 Toxin gene : cdtB 

2126 5 ' -ATA TCA GAG ACT GAT GAG 2130 a 2665-2682 

2127 b 5 '-TAG CAT ATT CAG AGA ATA TTG T 2130 a 2746-2767 

20 Tpxin gene: etx 3 

1081 5' -ATG TCA GAG GGA TAG ATC CA 1076 a 233-252 
1080 b 5 ' -TAT AGC TAC TGT CAC CAG ACA ATG T 1076 a 394-418 

25 Toxin gene: stx 2 

1078 5 ' -AG? TCT GCG TTT TGT CAC TGT C 1077 a 546-567 

1079 b 5'-CGG AAG CAC ATT GCT GAT T 1077 a 687-705 

30 Toxin gene?: stx 2 and stx 3 

1082 5 ' -TTG ARC RAA ATA ATT TAT ATG TG 1076 a 278-300 
1083 b 5' -TCA TGA TGR CAA TTC AGT AT 1076 a 781-800 

35 ' . 



Sequences from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing, 
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Annex XXXIII: Molecular beacon int rnal hybridization probes for 
specific detection of toxin s gu nc s. 



SEQ ID NO. Nucleotide seguence a 



Originating DNA fragment 

SEQ ID Nucleotide 
NO. position 



10- 



15 



20 



25 



30 



Toxin aene : 



cdtA 



2125 b 5'-CAC GCG GAT TTT GAA TCT CTT CCT CTA 
GTA GCG CGI G 



Toxin aene : 



cdtB 



2128 5'-CAA CGC TGG AGA ATC TAT ATT TGT AGA 

AAC TG£ fin Q 



Toxin genq 
1084 



5 ' -CCA CGC CGC TTT GCT GAT TTT TCA CAT 
GTT ACC CCS TGG 



2012 d 5'-CCG CGG ATT ATT AAA CCG CCC TTC CGC 
GQ-MR-HEG-ATG TCA GAG GGA TAG ATC CA 

Toxin gene : stx 3 

1085 5'-CC& CSC CAC TGT CTG AAA CTG CTC CTG 

TG CC1 QQ 



2129 c 

2130 c 

1076 c 
1076 c 

1077 c 



462-488 

2714-2740 

337-363 
248-264 

617-638 



35 



a Underlined nucleotides indicate the molecular beacon's stem. 

k These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

c Sequences from databases. 

40 & Scorpion primer. 
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Annex XXXIV t Specific and ubiquitous primers for nucleic acid 
amplification (van sequences) . 



Originating DNA fragment 

5 

SEQ ID NO. Nucleotide sequence SEQ id Nucleotide 

NO. position 



-10- 


— Resistance-aene:- - 




— vanA 



















1086 


5 ' -CTA CTC 


CCG 


CCT 


TTT 


GGG 


TT 


1049-1057 a 


513- 


-532 b 




1087 c 


5'-CTC ACA 


GCC 


CGA 


AAC 


AGC 


CT 


1049-1057 a 


699- 


-718 b 


15 


1086 


5 ' -CTA CTC 


CCG 


CCT 


TTT 


GGG 


TT 


1049-1057* 


513- 


-532 b 




1088 c 


5'-TGC CGT 


TTC 


CTG 


TAT 


CCG 


TC 


1049-1057 a 


885- 


-904 b 




1086 


5 ' -CTA CTC 


CCG 


CCT 


TTT 


GGG 


TT 


1049-1057 a 


513- 


>532 b 


20 


1089 c 


5'-ATC CAC 


ACG 1 GGC 


TAG 


ACC 


TC 


1049-1057 a 


933- 


•952 b 


1090 


5'-AAT AGC 


GCG 


GAC 


GAA 


TTG 


GAC 


1049-1057 a 


629- 


-649 b 




1091 c 


5'-AAC GCG 


GCA 


CTG 


TTT 


CCC 


AA 


1049-1057 a 


734- 


-753 b 




1090 


5'-AAT AGC 


GCG 


GAC 


GAA 


TTG 


GAC 


1049-1057 a 


629- 


•649 b 


25 


1089 c 


5'-ATC CAC 


ACG 


GGC 


TAG 


ACC 


TC 


1049-1057 a 


933- 


•952 b 




1092 


5' -TCG GCA 


AGA 


CAA 


TAT 


GAC 


AGC 


1049-1057 a 


662- 


-682 b 




1088 c 


5'-TGC CGT 


TTC 


CTG 


TAT 


CCG 


TC 


1049-1057 a 


885- 


904 b 


30 


Resistance aene: 




vanB 














1095 


5' -CGA TAG 


AAG 


CAG 


CAG 


GAC 


AA 


1117 d 


473 


-492 




1096° 


5' -CTG ATG 


GAT 


GCG 


GAA 


GAT 


AC 


1117 d 


611 


-630 


35 


Resistance genes: 




vanA, 


vanB 










1112 


5'-GGC TGY 


GAT 


ATT 


CAA 


AGC 


TC 


1049-1057, 1117 a 


437- 


456 b 




1113 c 


5'-ACC GAC 


CTC 


ACA 


GCC 


CGA 


AA 


1049-1057, 1117 a 


705- 


724 b 


40 


1112 


5' -GGC TGY 


GAT 


ATT 


CAA 


AGC 


TC 


1049-1057, 1117 a 


437- 


456 b 




1114 c 


5' -TCW GAG 


CCT 


TTT 


TCC 


GGC 


TCG 


1049-1057, 1117 a 


817- 


837 b 



"U 



15 



1115 


5' 


-TTT CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 


CG 


1049-1057, 1117* 


705-730" 


1114 c 


5' 


-TCW GAG 


CCT 


TTT 


TCC 


GGC 


TCG 






1049-1057. 1117 a 


817-837 b 


1116 


5' 


-TTT CGG 


GCT 


GTG AGG 


TCG 


GBT GHG 


CGG 


1049-1057, 1117 a 


705-731 b 


1114 c 


5' 


-TCW GAG 


CCT 


TTT 


TCC 


GGC 


TCG 






1049-1057, 1117 a 


817-837 b 


1112 


5' 


-GGC TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437-456 b 


1118 c 


5' 


-TTT TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057,11173 


817-840 b 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 1051) . 
15 c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Lasting. 
^ Sequences from databases. 



303 



WO 01/23604 



PCT/CAOO/01150 



Annex XXXIV i Specific and ubiquitous primers for nucleic acid 
amplification (van sequenc 8) (continu d) . 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 Resistance genes : 



vanA, vanB (continued) 



15 



20 



25 



30 



35 



40 



50 



55 



1115 


5' 


-TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT GHG CG 


1049-1057, 1117 a 


705 


-730 b 


1118 c 


5 ' 


-TTT 


TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057 , 1117 a 


817 


-840^ 


1116 


5' 


-TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 


CGG 


1049-1057, 1117 a 


705 


-731 b 


1118 c 


5' 


-TTT 


TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057,1117* 


817 


-840 b 


1119 


5' 


-TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 


C 


1049-1057 ,1117 a 


705 


-729 b 


1118 c 


5* 


-TTT 


TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057, 1117 a 


817 


-840 b 


1120 


5' 


-TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 




1049-1057, 1117 a 


705 


-728 b 


1118 c 


5' 


-TTT 


TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057, 1117 a 


817 


-840 b 


1121 


5' 


-TGT 


TTG 


WAT 


TGT 


CYG 


GYA 


TCC 


C 




1049-1057, 1117 a 


408 


-429 b 


llll c 


5' 


-CTT 


TTT 


CCG 


GCT 


CGW 


YTT 


CCT 


GAT 


G 


1049-1057,1117° 


806 


-830 b 


1112 


5' 


-GGC 


TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437 


-456 b 


llll c 


5' 


-CTT 


TTT 


CCG 


GCT 


CGW 


YTT 


CCT GAT G 


1049-1057, 1117 a 


806 


-830 b 


1123 


5' 


-TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT G 




1049-1057, 1117 a 


705 


-726 b 


llll c 


5' 


-CTT 


TTT 


CCG 


GCT 


CGW 


YTT 


CCT 


GAT 


G 


1049-1057, 1117 a 


806 


-830 b 


1112 


5' 


-GGC 


TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437 


-456 b 


1124 c 


5' 


-GAT 


TTG 


RTC 


CAC 


YTC 


GCC 


KAC 


A 




1049-1057, 1117 a 


757 


-778 b 


Resistance aene 






van CI 
















1103 


5' 


-ATC 


CCG 


CTA 


TGA 


AAA 


CGA 


TC 






1058-1059 a 


519 


-538 d 


1104 c 


5' 


-GGA 


TCA 


ACA 


CAG 


TAG 


AAC 


CG 






1058-1059 a 


678 


-697 d 



Resistance genes: 



van CI, vanC2, vanC3 



1097 



5 ' -TCY TCA AAA GGG ATC ACW AAA GTM AC 



1058-1066 a 



1098 c 


5' 


-TCT 


TCA 


AAA 


TCG 


AAA AAG 


CCG 


TC 


1058 


-1066 a 


1099 


5' 


-TCA 


AAA 


GGG 


ATC 


ACW AAA 


GTM 


AC 


1058 


-1066 a 


1100 c 


5' 


-GTA 


AAK 


CCC 


GGC 


ATR GTR 


TTG 


ATT TC 


1058 


-1066 a 


1101 


5' 


-GAC 


GGY 


TTT 


TTY 


GAT TTT 


GAA 


GA 


1058 


-1066 a 


1102 c 


5' 


-AAA AAR 


TCG 


ATK CGA GCM AGA CC 


1058 


-1066 a 



Resistance genes : 



vanC2, vanC3 



1105 5'-CTC CTA CGA TTC TCT TGA YAA ATC A 

1106 c 5 '-CAA CCG ATC TCA ACA CCG GCA AT 



607-632 d 
787-809 d 

6l0-632 d 
976-1001 d 

787-809 d 
922-944 d 



1060-1066, 1140 a 487-511 e 
1060-1066, 1140 a 690-712 e 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 1051). 

c These sequences are from the complementary DNA strand of the sequence of the originating fragment 

60 given in the sequence Listing. 

d The nucleotide positions refer to the vanCl sequence fragment (SEQ ID NO. 1058). 

e The nucleotide positions refer to the vanC2 sequence fragment (SEQ ID NO. 1140) . 
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Annex XXXIVt Sp cif ic and ubiquitous primers for nucleic acid 
amplification (van s qu nces) (continued). 

— Originating DNA fragment 



5 


SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 


Nucleotide 










NO. 


position 


-10 


- Resistance- crene 




vanD 


_ 







1591 


5 ' -ATG 


AGG 


TAA TAG AAC GGA TT 


1594 


797-B37 




1592 b 


5 ' -CAG 


TAT 


ttv hfiT aar (Tt 

X 1\. Aul AAIj Lui AAA 






15 


Resistance qene: 


vans 








1595 


5 '-AAA 


TAA 


TGC TCC ATC AAT TTG CTG A 


1599 a 


74-98 




1596 b 


5 ' -ATA 


GTC 


rii fijilv CCC 2VPP Ch.C* Tib.CZ 
wiA AAA nlU LAL AAV» 


1 RQ Qcl 




20 


1597 


5 ' -GAT 


GAA 


TTT GCG AAA ATA CAT GGA 


1599 a 


163-186 




1598 b 


5 ' -CAG 


CCA 


ATT TCT ACC CCT TTC AC 














Sequencing primers 


(vanAB) 




25 


1112 


5'-GGC 


TGY 


GAT ATT CAA AGC TC 


1139 a 


737-756 




llll b 


5'-CTT 


TTT 


CCG GCT CGW YTT CCT GAT G 


1139 a 


1106-1130 










Sequencing primers 


( vanA t vanX, 


vany) 


30 


1150 


5 ' -TGA 


TAA 


TCA CAC CGC ATA CG 


1141 a 


860-879 




1151 b 


5 ' -TGC 


TGT 


CAT ATT GTC TTG CC 


1141 a 


1549-1568 




1152 


5 ' -ATA 


AAG 


ATG ATA GGC CGG TG 


1141 a 


1422-1441 




1153 b • 


5 ' -CTC 


GTA 


TGT CCC TAC AAT GC 


1141 a 


2114-2133 


35 


1154 


5 ' -GTT 


TGA 


AGC ATA TAG CCT CG 


1141 a 


2520-2539 




1155 b 


5' -CAG 


TGC 


TTC ATT AAC GTA GTC 


1141 a 


3089-3109 



a Sequences from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex xxxiv: sp cific and ubiquitous primers for nucleic acid 
amplification (van sequences) (continued). 

Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 Sequencing primer s— ( vanCl) r 

1110 5'-ACG AGA AAG AC A ACA GGA AGA CC 1138 a 122-144 

1109 b 5 ' -ACA TCG TGA TCG CTA AAA GGA GC 1138 a 1315-1337 

15 Sequencing primers (vanC2, vanC3) 

1108 5'-GTA AGA ATC GGA AAA GCG GAA GG 1140 a 1-23 

1107° 5 ' -CTC ATT TGA CTT CCT CCT TTG CT 1140 a 1064-1086 



a Sequences from databases. 

k These sequences are from the complementary DMA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XXXV: Internal hybridization prob 8 
d tection of van sequences. 



for specific 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



Resistance gene: 



vanA 



1170 5' -ACG AAT TGG ACT ACG CAA TT 
2292 5'-GAA TCG GCA AGA CAA TAT G 



Resi stan ce gene: 



vanB 



1171 5' -ACG AGG ATG ATT TGA TTG TC 

2294 5 ' -AAA CGA GGA TGA TTT GAT TG 

2295 5' -TTG AGC AAG CGA TTT CGG 

Re sistanc e g e ne: vanD 

2297 5'-TTC AGG AGG GGG ATC GC 



1049-1057 a 
2293 c 



1117 c 
2296 a 
2296 a 



1594 c 



639-658 b 
583-601 



560-579 
660-679 
614-631 



458-474 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 

30 1051). 

c Sequences from databases. 
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Annex XXXVI: Specific and ubiquitous primers for nucleic acid 
amplification (pbp sequences). 



Originating DNA fragment 





^EO ID NO 


Nucleotide 




NO. 


iNJuci eot lue 
position 


10 


Resi-st-ance- 


-aene-: 













1129 
1131 C 


5 ' -ATG 
5 ' -CAT 


ATG 
CTG 


ACH GAM ATG ATG AAA AC 
GAG CTA CRT ARC CAG T 


1004-1018 a 
1004-1018 a 




681-703 b 
816-837 b 


15 


1130 
1131 


5 ' -GAC 
5 ' -CAT 


TAT 
CTG 


CCA AGC ATG CAT TAT G 
GAG CTA CRT ARC CAG T 


1004-1018 a 
1004-1018 a 


456-477 b 
816-837 b 


20 


2015 
2016 c 


5 ' -CCA 
5 ' -TAD 


AGA 
CCT 


AGC TCA AAA ACA TCT G 
GTC CAW ACA GCC AT 


2047 d 
2047 d 


909-930 
1777-1796 








Sequencing primers 


(pbpla) 




25 


1125 
1126 c 


5 ' -ACT 
5 ' -TTA 


CAC 
TGG 


AAC TGG GAT GGA TG 
TTG TGC TGG TTG AGG 


1169 d 
1169 d 


873-892 
2140-2160 


1125 
1128 c 


5' -ACT 
5 ' -GAC 


CAC 
GAC 


AAC TGG GAT GGA TG 
YTT ATK GAT ATA CA 


1169 d 
1169 d 


873-892 
1499-1518 


30 


1127 
1126 c 


5 ' -KCA 
5' -TTA 


AAY 
TGG 


GCC ATT TCA AGT AA 
TTG TGC TGG TTG AGG 


1169 d 
1169 d 


1384-1403 
2140-2160 



Sequencing primers (pbp2b) 





1142 


5' 


-GAT 


CCT 


CTA 


AAT 


GAT 


TCT CAG 


GTG G 


1172 d , 


1-25 


35 


1143 c 


5' 


-CAA 


TTA 


GCT 


TAG 


CAA 


TAG GTG 


TTG G 


1172 d 


1481-1505 




1142 


5' 


-GAT 


CCT 


CTA 


AAT 


GAT 


TCT CAG 


GTG G 


1172 d 


1-25 




1145 c 


5' 


-AAC 


ATA 


TTK 


GGT 


TGA 


TAG GT 




1172 d 


793-812 


40 


1144 


5' 


-TGT 


YTT 


CCA 


AGG 


TTC 


AGC TC 




1172 d 


657-676 




1143 c 


5' 


-CAA 


TTA 


GCT 


TAG 


CAA 


TAG GTG 


TTG G 


1172 d 


1481-1505 



Sequencing primers (pJbp2x) 



\5 



50 



1146 


5' 


-GGG 


ATT ACC TAT GCC AAT ATG 


AT 


1173 d 


219- 


241 


1147 c 


5' 


-AGC 


TGT GTT AGC VCG AAC ATC 


TTG 


1173 d 


1938- 


1961 


1146 


5' 


-GGG ATT ACC TAT GCC AAT ATG 


AT 


1173 d 


219- 


241 


1149 c 


5' 


-TCC 


YAC WAT TTC TTT TTG WG 




1173 d 


1231- 


1250 


1148 


5' 


-GAC 


TTT GTT TGG CGT GAT AT 




1173 d 


711- 


730 


1147 c 


5' 


-AGC 


TGT GTT AGC VCG AAC ATC 


TTG 


1173 d 


1938- 


1961 



>5 

a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the pbpla sequence fragment <SEQ ID NO. 1004). 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
>0 d Sequences from databases. 
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Annex XXXVII: 



Internal hybridization probes 
d tection of pbp b guences. 



for specific 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO, 



Nucleotide 
position 



10 



15 



20 



25 



30 



35 



40 



45 



Resistance gene ; 



pbpla 



1132 
1133 
1134 
1135 
1192 
1193 
1194 
1195 
1196 
1197 
1094 
1214 
1216 
1217 
1218 
1219 
1220 
2017 
2018 
2019 
2020 
2021 
2022 
2023 
2024 
2025 
2026 

2027 
2028 
2029 
2030 
2031 
2032 
2033 
2034 



5 ' -AGT GAA AAR ATG GCT GCT GC 
5 ' -CAT CAA GAA CAC TGG CTA YGT AG 
5 ' -CTA GAT AGA GCT AAA ACC TTC CT 
5 ' -CAT TAT GCA AAC GCC ATT TCA AG 
5'-GGT AAA ACA GGA ACC TCT AAC T 
5' -GGT AAG ACA GGT ACT TCT AAC T 
5 ' -CAT TTC AAG TAA TAC AAC AGA ATC 
5 ' -CAT TTC AAG TAA CAC AAC TGA ATC 
' -GCC ATT TCA AGT AAT ACA ACA GAA 
' -CAA ACG CCA TTT CAA GTA ATA CAA C 

-GGT AAA ACA GGT ACT TCT AAC TA 
' -GGT AAA ACA GGT ACC TCT AAC TA 
-GGT AAG ACT GGT ACA TCA AAC TA 
' -CAA ATG CCA TTT CAA GTA ACA CAA C 
-CAA ACG CCA TTT CAA GTA ACA CAA C 
' -CAA ATG CTA TTT CAA GTA ATA CAA C 
' -CAA ACG CCA TTT CAA GTA ATA CGA C 
' -ACT TTG AAT AAG GTC GGT CTA G 
' -ACA CTA AAC AAG GTT GGT TTA G 
' -ACA CTA AAC AAG GTC GGT CTA G 
5 ' -GTA GCT CCA GAT GAA ATG TTT G 
5 ' -GTA GCT CCA GAC GAA ATG TTT G 
-GTA GCT CCA GAT GAA ACG TTT 
-GTA ACT CCA GAT GAA ATG TTT 
-AGT GAA AAG ATG GCT GCT GC 
-AGT GAG AAA ATG GCT GCT GC 
'-TCC AAG CAT GCA TTA TGC AAA CG 

5 ' -TCG GTC TAG ATA GAG CTA AAA CG 
5 ' -TAT GCT CTT CAA CAA TCA CG 
5 ' -AGC CGT TGA GAC TTT GAA TAA G 
5 ' -CTT AAT GGT CTT GGT ATC G 
-CGT GAC TGG GGT TCT GCT ATG A 
'-CGT GAC TGG GGA TCA TCA ATG 
-CGT GAC TGG GGT TCT GCC ATG 
-ATC AAG AAC ACT GGC TAT GTA 



5' 
5' 
5' 
5' 
5' 
5' 
5' 
5' 
5' 
5' 
5' 
5' 



G 
G 



A 
A 
G 



1004-1018 3 
1004-1018 a 
1004-1018 3 
1004-1018 a 
1004-1018 a 
1004-1018 3 
1004-1018 3 
1004-1018 a 
1004-1018 a 
1004-1018 a 
1004-1018 a 
1004-1018 a 
1004-1018 a 
1004-1018 a 
1004-1018 a 
10O4-lO18 a 
1004-1018 a 

2047 c 
2063 
2064 

2140 c 
2057 

2053 c 
2056 

2048° 

2047 c 

2047 c 

2047 c 
2047 c 
2047 c 
2047 c 
2049 c 
2047 c 
2057 
2050 c 



531-550 b 
606-828 b 
417-439 b 
471-493 b 
759-780 b 
759-780 b 
485-508 b 
485-508 b 
483-506 b 
478-502 b 
759-781 b 
759-781 b 
759-781 b 
478-502 b 
478-502 b 
478-502 b 
478-502 b 
1306-1327 
354-375 
346-367 
1732-1753 
831-852 
805-826 
819-840 
1438-1457 
1438-1457 
1368-1390 



1319- 
1267- 
1296- 
1345- 
1096- 
1096- 
195- 
787- 



1341 

1286 

1317 

1366 

1117 

1117 

216 

808 



50 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the pbpla sequence fragment 



(SEQ ID NO. 



1004) . 

Sequence from databases. 
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Annex XXXVII: 



Internal hybridization probes for specific 
detection of pbp sequences (continued) ♦ 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



Resistance oene : 



pbpla (continued) 



2035 
2036 
2037 
2038 
2039 



5 ' -ATC AAG AAC ACT GGC TAC GTA G 
5 ' -ATC AAG AAC ACT GGT TAC GTA G 
5 ' -ATC AAA AAT ACT GGT TAT GTA G 
5 ' -ATC AAG AAT ACT GGC TAC GTA G 
5' -ATC AAA AAC ACT GGC TAT GTA G 



2051 c 
2047 
2057 
2052 c 
2053 c 



787-808 
1714-1735 
813-834 
757-778 
787-808 



20 
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Annex XXXIX: internal hybridization probe for sp cific 
det ction of mecA. 

____ __ _— originating DNA fragment 

5 — 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 — Resistance gene: mecA 



1177 5 ' -GCT CAA CAA GTT CCA GAT TA 1178 a 1313-1332 

15 



a Sequence from databases. 
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Annex XLi Specific and ubiquitous primers for nucleic acid 

amplification (hexA sequ nc s). 



5 Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Bacterial species : Streptococcus pneumoniae 

1179 5 ' -ATT TGG TGA CGG GTG ACT TT 1183 a 431-450 

1181 b 5'-AGC AGC TTA CTA GAT GCC GT 1183-1191 c 652-671 d 



15 



20 



Sequencing primers 

1179 5' -ATT TGG TGA CGG GTG ACT TT 1183 a 431-450 

1182 b 5'-AAC TGC AAG AGA TCC TTT GG 1183 a 1045-1064 



a Sequences from databases. 

° These sequences are from the complementary DNA strand of the sequence of the 
25 originating fragment given in the Sequence Listing. 

c These sequences were aligned to derive the corresponding primer, 
d The nucleotide positions refer to the hexA sequence fragment (SEQ ID NO. 
1183) . 
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Annex XLIt Internal hybridization probe for sp cific d tection 
of hexA s quenc s. 



5 Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Bacterial species : Streptococcus pneumoniae 

1180 a 5'-TCC ACC GTT GCC AAT CGC A 1183-1191 b 629-647 c 

15 



a This sequences is from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing, 
k These sequences were aligned to derive the corresponding primer. 

20 c The nucleotide positions refer to the hexA sequence fragment (SEQ ID NO. 
1183) . 
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Ann x XLXIIs Specific and ubiquitous primers for nucleic acid 
amplification (pep sequence). 

___ _ Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species: Streptococcus pyogenes 

1211 5' -ATT CTT GTA ACA GGC TTT GAT CCC 1215 a 291-314 

1210 b 5'-ACC AGC TTG CCC AAT ACA AAG G ' 1215 a 473-494 



a Sequences from databases. 

k These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XLIV: Sp cific and ubiquitous primers for nucl ic acid 
amplification of S. saprophytics sequences of 
unknown coding potential. 



Originating DNA fragment 

SEQ- ID-.N0 Nucleotide^ sequence^ SEQ ID Nucleotide 

NO^ position — 

Bacterial species : Staphylococcus saprophyticua 

1208 5'-TCA AAA AGT TTT CTA AAA AAT TTA C 74,1093, 169-193 c 

1198 b 

1209 a 5'-ACG GGC GTC CAC AAA ATC AAT AGG A 74,1093, 355-379° 

1198 b 



a This sequence is from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
b These sequences were aligned to derive the corresponding primer. 
c The nucleotide positions refer to the S. saprophytics unknown gene sequence 

fragment (SEQ ID NO. 1198). 
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Annex XLVj Molecular beacon internal hybridization prob s for 
sp cific d tection of antimicrobial ag nts 
resistance gene sequences. 



Originating DNA fragment 

~SE(TID- NO — Nucleotide -sequenced SEQ ID Nucleotide 

NO. position 

Resistance aene : gyrh 

2250 5 ' -CCS ICS GAT GGT GTC GTA TAC CGC GGA GTC 1954 b 218-243 

GCC SAC QQ 

2251 5' -CSS ACC CGT TCT CGC TGC GTT ACA TGC TGG 1954 b 259-286 

TGG CIC CS 

Resistance gene: mech 

1231 5' -GCG AGC CCG AAG ATA AAA AAG AAC CTC TGC 1178 b 1291-1315 
TGC ICS C 

Resistance gene: pare 



1938 b 


5 


' -CCS CSC 
TCT CCS 


ACC 
CSC 


ATT GCT TCG 

ss 


TAC 


ACT GAG 


GAG 


1321 c 


232 


-260 


1939 


5 


' -CGA CCC 
GCC AGC 


GCA 
GGC 


TGG TAG TAT 
CGG GTC G 


CGA 


TAA TGA 


TCC 


1321 c 


317 


-346 


1955 b 


5 


'-CGC GCA 
TCI GCG 


ACC 
CS 


ATT GCT TCG 


TAC 


ACT GAG 


GAG 


1321 c 


235 


-260 


Resistance 


qene: 




vanA 














1239 


5 


'-GCG AGC 
CGC 


GCA 


GAC CTT TCA GCA GAG GAG SCI 


1051 


860 


-880 


1240 


5 


' -GCG A£C 
TCS CTC 


CGG 
SC 


CAA GAC AAT 


ATG 


ACA GCA 


AAA 


1051 


663 


-688 


17 ■? c h a 








<trj» <nTS 















1241 5' -GCG AGS GGG GAA CGA GGA TGA TTT GAT TGS 1117 555-577 

CTC QQ 

Resistance gene: vano 

1593 5' -CCS AGC CAT TTA CCG GAT ACT TGG CTG ICG 1594 835-845 

CTC fifi 



a Underlined nucleotides indicate the molecular beacon's stem. 

b This sequence is from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

c Sequence from databases. 
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Annex XLVIs Molecular beacon internal hybridization probe £ r 
specific detection of S. aureus gen sequences of 
unknown coding potential. 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence* SEQ "ID Nucleotide 

NO. position 



Bacterial species: S. aureus 

1232 5 / -GGA QCC QCG CGA TTT TAT AAA TGA ATG TTG 1244 53-80 

ATA ACC fiSC TC£ 



a Underlined nucleotides indicate the molecular beacon's stem. 



320 



WO 01/23604 



PCT/CA00/0115O 



Annex XLVIIt Mol cular beacon internal hybridization probes for 
specific detecti n of tuf s quences. 



Originating DNA fragment 



SEQ ID NO. 


Nucleotide 


sequence* 


SEQ ID 
NO. 


Nucleotide 
position 


Bacterial 


species: 




Chlamydia pneumoniae 


- 




2091 


5 ' -CGC GAC 
TTC TTG 


TTG 
GTC 


AGA TGG AAC TTA GTG AGC 
GCG 


20 


157-183 


2092 


5' -CSC Qh£ 


GAA 
TCC 


AGA ACT TCC TGA AGG TCG 
AG 


20 


491-516 


Bacterial 


species: 




Chlamydia trachomatis 






2213 


5 ' -CGT GCC 
GAC GCT 


ATT 
GAA 


GAC ATG ATT TCC GAA GAA 
GGC ACG 


1739 b 


412-441 


Bacterial 


SDecies : 




Knterococcus faecalis 






1236 


5 ' -GCG AGC 
GCC ICS 


CGT 

c 


GGT GAA GTT CGC GTT GGT 


883 


370-391 


pacterial 


species: 




EnterococcuB faecium 






1235 


5 '-GCG AGC 
TGC TGG 


CGA 
CTC 


AGT TGA AGT TGT TGG TAT 
GC 


64 


412-437 


pacterial specie?: 




Legionella pneumophila 






2084 c 


5'-CAC GCG 
TTT TGC 


TCA 

gcg 


ACA CCC GTA CAA GTC GTC 


112 


461-486 


Pacterial 


species: 




Mycoplasma pneumoniae 






2096 c 


5' -CGC GAC 
TGT CGC 


CGG 
G 


TAC CAC GGC CAG TAA TCG 


2097 b 


658-679 


Bacterial 


species: 




Neisseria gonorrhoeae 







2177 5 ' -GGC ACG GAC AAA CCA TTC CTG CTG CCT 126 323-357 

ATC GAA ACG TGT TCC CGI GCC 

2178 5' -GGC ACG ACA AAC CAT TCC TGC TGC CTA 126 323-348 

TCG AAC GIG CC 

2179 5' -GGC AGC TCT ACT TCC GTA CCA CTG ACG 126 692-718 

TAA CCG GCT GCC 



a Underlined nucleotides indicate the molecular beacon's stem. 
b Sequence from databases. 

c This sequence is from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XLVXI; Mol cular beacon internal hybridization prob s for 
sp cif ic detection of tuf 8 quences (continued) • 



Originating DNA fragm nt 



SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 



Bacterial snecies: 




Pseudomonas aeruginosa 






2122 


5' -CCG AGC 
CTG CTC 


GAA 
GG 


TGT AGG AGT CCA GGG TCT 153 , 880, 2138 b ' 


c 280-302 d 


Bacterial species: 




Staphylococcus aureus 






2186 


5 ' -ACG CGC 
TAT CAA 


TCA 


AAG CAG AAG TAT ACG TAT 
ACG CGC 02 


1728 


615-646 


Bacterial arour> 






Staphylococcus Bp. other 


than 5. 


aureus 


1233 


5'-GCG £GC 
CGC CTC 


GTT 
GC 


ACT GGT GTA GAA ATG TTC 


878 


372-394 


Funqal 


species: 






Candida albicans 






2073 


5' -CCG 
AAC 


AGC 
TGG 


AAC 

CTC 


ATG ATT GAA CCA TCC ACC 
GG 


408 


404-429 


Funqal 


specie?: 






Candida dubliniensis 






2074 


5' -CCG 
AAC 


AGC 
TGG 


AAC 
CTC 


ATG ATT GAA GCT TCC ACC 
GG 


414 


416-441 


Fungal. 


Species: 






Candida glabrata 






2110 b 


5'-GCG 
TGG 


GGC 
ATT 


CCT 
CAG 


TAA CGA TTT CAG CGA ATC 
CCC GC 


417 


307-335 


2111 


5'-GCG 
CTT 


GGC 
CCT 


ATG 
GGC 


TTG AAG CCA CCA CCA ACG 
CCG C 


417 


419-447 


Funqal 


species: 






Candida krusei 







2112 b 5'-GCG GGC TTG ATG AAG TTT GGG TTT CCT 422 318-347 
TGA CAA TTG CCC GC 

2113 5'-GCG GGC ACA AGG GTT GGA CTA AGG AAA 422 419-447 

CCA AGG CAG CCC GC 

2114 5'-GCG GGC ATC GAT GCT ATT GAA CCA CCT 422 505-533 

GTC AGA CCG CCC GC 



a Underlined nucleotides indicate the molecular beacon's stem. 
b Sequence from databases. 

c These sequences were aligned to derive the corresponding primer. 
d The nucleotide positions refer to the P. aeruginosa tuf sequence fragment (SEQ 
ID NO. 153) . 
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Annex XLVZI: Molecular beacon int mal hybridization probes £ r 
sp cific det ction of tuf sequences (continued). 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence 9 SEQ ID Nucleotide 

NO. position 



Fungal species : Candida lueitaniae 

2115 b 5'-Q££ GGC GGT AAG TCC ACC GGT AAG ACC 424 304-330 
TTG TTG GCC CGC 

2116 5 ' -GCG GGC GTA AGT CAC CGG TAA GAC CTT 424 476-502 

GTT GGC CCG C 

2117 5' -GCG GGC GAC GCC ATT GAG CCA CCT TCG 424 512-535 

AGA GCC CGC 

Fungal Sffecies: Candida parapailosie 

2118 b 5' -GCG GGC TCC TTG ACA ATT TCT TCG TAT 426 301-330 

CTG TTC TTG GCC CGC 

Fungal species : Candida tropicalis 

2119 5'-GCG GGC TTA CAA CCC TAA GGC TGT TCC 429 357-384 

ATT CGT TGC CCG C 

2120 5 ' -GCG GGC AGA AAC CAA GGC TGG TAA GGT 429 459-487 

TAC CGG AGC CCG C 

Fungal species : Cryptococcus neoformans 

2106 5' -GCG AGC AGA GCA CGC CCT CCT CGC CGC 623 , 1985, 1986 c 226-244 d 

ICG C 

2107 5' -GCG AGC TCC CCA TCT CTG GTT GGC ACG 623,1985, 1986° 390-408 d 

CTC GC 

3figterjal genus: Legionella ep. 

2083 5'-CCG CCG ATG TTC CGT AAA TTA CTT GAI 111-112° 488-519 e 

GAA GGT CGA GCC GGC GG 



a Underlined nucleotides indicate the molecular beacon's stem. 

b This sequence is from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

c These sequences were aligned to derive the corresponding primer. 

d The nucleotide positions refer to the C. neoformans tuf (EF-1) sequence 
fragment (SEQ ID NO. 623). 

e The nucleotide positions refer to the L. pneumophila tuf (EF-1) sequence 
fragment (SEQ ID NO. 112). 
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Annex XLVIIi Molecular beacon int rnal hybridization probes for 
specific detection of tuf sequences (c ntinued) • 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 



SEQ ID 
NO. 



Nucleotide 
position 



Fungal germs: 



Candida spY 



2108 5' -JSCS GQZ AAC TTC RTC AAG AAG GTT GGT 

TAC AAC CCfi £££ Q£ 



414 , 417, 
422,424,, 
426,429, 624 b 



52-80 c 



2109 5 / -GCG £££ CCA ATC TCT GGT TGG AAY GGT Same as SE G 

GAC AAG C£C GC ID NO. 2108 



100-125 c 



Bacterial group: 

2121 



Pseudomonads 



5'-CGA ££G CIA GCC GCA CAC CAA GTT C£fi 
GTC fi 



153-155, 
205,880,2137 d , 
2138 d ' b 



598-616 e 



Underlined nucleotides indicate the molecular beacon's stem. 

These sequences were aligned to derive the corresponding primer. 

The nucleotide positions refer to the C. albicans tuf (EF-1) sequence fragment 
(SEQ ID NO. 624) . 

Sequence from databases. 

The nucleotide positions refer to the P. aeruginosa tuf sequence fragment (SEQ 
ID NO. 153) . 
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Annex XLVIIIt Mol cular beacon internal hybridization probes for 
specific detection of ddl and mtl gene sequences. 

____________ _ originating DNA fragment 

SEQ ID NO. Nucleotide sequence 8 SEQ ID Nucleotide 

— . NO. position^ 



pacteyial gpeciep: E. faecium (ddl) 

1237 5 ' -GCG AGC CGC GAA ATC GAA GTT GCT GTA 1242 b 334-359 

TTA GGQ CIC SC 

Bacterial species ; E. faecalis (mtl) 

1238 5'-QCS AfiC GGC GTT AAT TTT GGC ACC GAA 1243 b 631-656 

GAA GAG £2£ S£ 



Underlined nucleotides indicate the molecular beacon's stem. 
Sequence from databases. 
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Annex XLIX: Int rnal hybridization probe for specific detection 
of S. aureus sequences f unknown coding potential. 



^^-^^ Originating DNA fragment 

SEQ ID NO. Nucleotid^^sequence SEQ ID Nucleotide 

- - ^ NO. position 



"Bact^r-iaa-s n p r j g g ; ^ staphylococcus aureus 

12 ^4 5 '-ACT AAA TAA^CG~CTC ATT-CC X244 35-54 
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Annex L; Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistanc g nee 
segu nces ) • 



SEQ ID NO/ 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance gene : 



aac(2')-la 



1344 
1345 b 



5' -AGC AGC AAC GAT GTT ACG CAG CAG 
5'-CCC GCC GAG CAT TTC AAC TfcJ-TG 



1348 a 
1348 a 



163-186 
392-414 



1346 
1347 b 



5 ' -GAT GTT ACG CAG CAG GGC ,AGT C 
5' -ACC AAG. .CAG GTT CGC AGT CAA GTA 



Resistance oene : 



aac (3')- lb 



1348 a 
1348 a 



172-193 
467-490 



1349 
1350 b 



5 ' -CAG CCG ACC AAT GAG TAT CTT GCC 
5' -TAA TCA GGG CAG TTG CGA CTC CTA 



Resistance gene: 



aac(3')-llb 



1351 a 
1351 a 



178-201 
356-379 



1352 
1353 b 



5' -CCA CGC TGA CAG AGC CGC ACC G 
5 ' -GGC CAG CTC CCA TCG GAC CCT G 



1356 a 
1356 a 



383-404 
585-606 



1354 
1355 b 



5'-CAC GCT GAC AGA GCC GCA CCG 
5'-ATG CCG TTG CTG TCG AAA TCC TCG 



Resistance gene: 



aac(3')-IVa 



1356 a 
1356 a 



384-404 
606-629 



1357 
1358 b 



5 '-GCC CAT CCA TTT GCC TTT GC 
5'-GCG TAC CAA CTT GCC ATC CTG AAG 



1361 a 
1361 a 



295-314 
517-540 



1359 
1360 b 



5' -TGC CCC TGC CAC CTC ACT C 

5 ' -CGT ACC AAC TTG CCA TCC TGA AGA 



1361 a 
1361 a 



356-374 
516-539 



Resistance aene : 



aac (3') -Via 



1362 
1363 b 



5' -CGC CGC CAT CGC CCA AAG CTG G 
5'-CGG CAT AAT GGA GCG CGG TGA CTG 



1366 s 
1366 a 



285-306 
551-574 



1364 
1365 b 



5' -TTT CTC GCC CAC GCA GGA AAA ATC 
5 ' -CAT CCT CGA CGA ATA TGC CGC G 



Resistance gene: 



aac(6')-Ia 



1366 a 
1366 a 



502-525 
681-702 



1367 
1368 b 



5 ' -CAA ATA TAC TAA CAG AAG CGT TCA 
5'-AGG ATC TTG CCA ATA CCT TTA T 



1371 a 
1371 a 



56-79 
269-290 



1379 
1380 b 



5 ' -AAA CCT TTG TTT CGG TCT GCT AAT 
5' -AAG CGA TTC CAA TAA TAC CTT GCT 



1371 a 
1371 a 



153-176 
320-343 



a Sequence from databases. 

b These sequences are from the complementary DNA strand 
originating fragment given in the Sequence Listing. 
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Annex Lt Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents r sistance genes 
sequ nc 8) (continued)* 



-ssQ^ID NO. 



Nucleotide sequence 



Resistance gene: 



jaac(6')-lc 



1372 
1373 b 



5' -GCT TTC GTT GCC TTT^GCC GAG GTC 
5 ' -CAC CCC TGT TGC TTC GCC CAC TC 



1374 
1375b 



5' -AGA TAT TGG CTT CGC CGC ACC ACA 
5' -CCC TGT TGC TTC GCC CAC TCC TG 



Resistance gene : 



ant (3')-I* 



1377 
1378 b 



5 ' -GCC GTG GGT CGA TGT TTG 
5' -GCT CGA TGA CGC CAA CTA 



1379 
1380 b 



5' -AGC AGC AAC GAT GTT ACG 
5 ' -CGC TCG ATG ACG CCA ACT 



Resistance gene: 

1382 5 ' -TAG 

1383 b 5 ' -CCC 



Resist ance gen**: 

• 1387 5'-TTA 
1338 b 5'-TAC 



ATG TTA 
CCT CTG 

CAG CAG 
ACC TCT 

ant (4') -la 

ATA TGA TAG GCG GTA AAA AGC 
AAA TTC GAG TAA GAG GTA TT 



aph(3')-I* 

TGC CTC TTC CGA CCA TCA AGC 
GCT CGT CAT CAA AAT CAC TCG 



Originating DNA fragment 

SEQ ID Nucleotide 
NO. position 



1376 a 157-180 

1376 a 304-326 

1376 a 104-127 

1376 s 301-323 



1381 a 100-123 

1381 a 221-244 

1381 a 127-150 

1381 a 222-245 



1386 a 149-172 

1386 a 386-408 

1386 a 151-172 

1386 a 388-409 



1391 a 233-256 
1391 a 488-511 



1384 5 ' -GAT ATG ATA GGC GGT AAA AAG C 

1385 b 5' -TCC CAA ATT CGA GTA AGA GGT A 



1389 5'-GAA TAA CGG TTT GGT TGA TGC GAG 1391 a 468-491 

1390 b 5 ' -ATG GCA AGA TCC TGG TAT CGG TCT 1391 a 669-692 

Resistfrflcg gene: aph(3')-iia 

1392 5 ' -TGG GTG GAG AGG CTA TTC GGC TAT 1396 a 43-66 

1393 b 5' -CAG TCC CTT CCC GCT TCA GTG AC 1396 a 250-272 

1394 5'-GAC GTT GTC ACT GAA GCG GGA AGG 1396 a 244-267 

1395 b 5' -CTT GGT GGT CGA ATG GGC AGG TAG 1396 a 386-409 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquit us primers for nucleic acid 
amplification (antimicrobial ag nte resistanc gen s 
sequences) (continued) . 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 
NO. position 

Resistance gene: aph(3')-nia 

1397 5'-GTG GGA GAA AAT GAA AAC CTA T 1401 a 103-124 

1398 b 5'-ATG GAG TGA AAG AGC CTG AT 1401 a 355-374 

1399 5'-ACC TAT GAT GTG GAA CGG GAA AAG 1401 a 160-183 

1400 b 5'-CGA TGG AGT GAA AGA GCC TGA TG 1401 a 354-376 

Resistance gene: aph(3';-VIa 

1402 5 '-TAT TCA ACA ATT TAT CGG AAA CAG 1406 a 18-41 

1403 b 5' -TCA GAG AGC CAA CTC AAC ATT TT 1406 s 175-197 

1404 5 '-AAA CAG CGT TTT AGA GCC AAA TAA 1406 a 36-59 

1405 b 5'-TTC TCA GAG AGC CAA CTC AAC ATT 1406 a 177-200 

Resistance gene : blaCARB 

1407 5'-CCC TGT AAT AGA AAA GCA AGT AGG 1411 a 351-374 

1408 b 5'-TTG TCG TAT CCC TCA AAT CAC C 1411 a 556-577 

1409 5' -TGG GAT TAC AAT GGC AAT CAG CG 1411 a 205-227 

1410 b 5'-GGG GAA TAG GTC ACA AGA TCT GCT T 1411 a 329-353 

Resistance gene: blaCMY-2 

1412 5 '-GAG AAA ACG CTC CAG CAG GGC 1416 a 793-813 

1413 b 5 '-CAT GAG GCT TTC ACT GCG GGG 1416 a 975-995 

1414 5' -TAT CGT TAA TCG CAC CAT CAC 1416 a 90-110 

1415 b 5'-ATG CAG TAA TGC GGC TTT ATC 1416 a 439-459 

Resistance genes : blaCTX-M-1, blaCTX-M-2 

1417 5 ' -TGG TTA ACT AYA ATC CSA TTG CGG A 1423 a 314-338 

1418 b 5'-ATG CTT TAC CCA GCG TCA GAT T 1423 a 583-604 

Resistance gene : blaCTX-M-1 

1419 5'-CGA TGA ATA AGC TGA TTT CTC ACG 1423 a 410-433 

1420 b 5' -TGC TTT ACC CAG CGT CAG ATT ACG 1423 a 580-603 

1421 5 ' -AAT TAG AGC GGC AGT CGG GAG GAA 1423 a 116-139 

1422 b 5' -GAA ATC AGC TTA TTC ATC GCC ACG 1423 a 405-428 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequenc Listing. 

329 



WO 01/23604 



PCT/CAOO/01150 



Annex Lt Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance g nes 
seguenc s) (continued)* 

~ ~ ~ *" Originating DNA fragment 



SEQID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance aene: 



blaCTX-M-2 



1424 


5'- 


-GTT 


AAC 


GGT 


GAT 


GGC 


GAC 


GCT 


AC 




1428 a 


30- 


-52 


1425 b 


5'- 


-GAA 


TTA 


TCG 


GCG 


GTG 


TTA 


ATC 


AGC 




1428 a 


153- 


-176 


14lO 




-CAC 


GCT 


CAA 


TAC 


CGC 


OAT 




A 






Oil)- 


-531 


1427 b 


5'- 


•TTA 


TCG 


CCC 


ACT 


ACC 


CAT 


GAT 


TTC 




1428 a 


687- 


-710 


Resistance 


..qene : 






blaIMP 
















1429 


5 ' - 


-TTT 


ACG 


GCT 


AAA 


GAT 


ACT 


GAA 


AAG 


T 


1433 a 


205- 


-229 


1430 b 


5'- 


•GTT 


TAA 


TAA 


AAC 


AAC 


CAC 


CGA 


ATA 


AT 


1433 a 


513- 


-538 


1431 


5'- 


•TAA 


TTG 


ACA 


CTC 


CAT 


TTA 


CGG 


CTA 


A 


1433 a 


191- 


-215 


1432 b 


5'- 


•ACC 


GAA 


TAA 


TAT 


TTT 


CCT 


TTC 


AGG 


CA 


1433 a 


497- 


-522 


Resistance 


qene: 






blaOXA2 
















1434 


5'- 


•CAC 


AAT 


CAA 


GAC 


CAA 


GAT 


TTG 


CGA 


T 


1438 a 


319- 


-343 


1435 b 


5'- 


•GAA 


AGG 


GCA 


GCT 


CGT 


TAC 


GAT 


AGA 


G 


1438 a 


532- 


-556 


Resistance 


gene: 






blaOXAlO 
















1436 


5'- 


-CAG 


CAT 


CAA 


CAT 


TTA 


AGA 


TCC 


CCA 




1439 a 


194- 


-217 


1437 b 


5'- 


•CTC 


CAC 


TTG 


ATT 


AAC 


TGC 


GGA 


AAT 


TC 


1439 a 


479- 


-504 


Resistance 


aene: 






blaPER-1 
















1440 


5 '- 


AGA 


CCG 


TTA 


TCG 


TAA 


ACA 


GGG 


CTA 


AG 


1442 a 


281- 


-306 


1441 b 


5'- 


TTT 


TTT 


GCT 


CAA 


ACT 


TTT 


TCA 


GGA 


TC 


1442 a 


579- 


-604 



Resistance gene : blaPER-2 

1443 5'-CTT CTG CTC TGC TGA TGC TTG GC 1445 a 32-54 

1444 b 5' -GGC GAC CAG GTA TTT TGT AAT ACT GC 1445 a 304-329 

Resistance genes: blaPER-1, blaPER-2 



1446 


5* 


-GGC CTG 


YGA TTT GTT ATT TGA ACT GGT 


1442 a 


414 


-440 


1447 b 


5' 


-CGC TST 


GGT CCT GTG GTG GTT TC 


1442 a 


652 


-674 


1448 


5' 


-GAT CAG 


GTG CAR TAT CAA AAC TGG AC 


1442 a 


532 


-557 


1449 b 


5' 


-AGC WGG 


TAA CAA YCC TTT TAA CCG CT 


1442 a 


671 


-696 



Sequence from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance g nes 
sequ nc s) (continued) • 

— Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Resistance gene : blaSHV 



1883 5'-AGC 
1884 b 5' -GTA 


CGC 
TCC 


TTG AGC AAA 
CGC AGA TAA 


TTA 
ATC 


AAC TA 
ACC AC 


1900 a 
1900 a 


71 
763 


-93 
-785 


1885 5' -AGC 
1ftR4 b S ' -GTA 


GAA 
TCC 


AAA CAC CTT 
CGC AGA TAA 


GCC 
ATC 


GAC 
ACC 


AC 


1900 a 
1900 a 


313 
763 


-333 
-785 


Resistance aene: 




blaTEM 














1906 5'-CCT 
1907 b 5' -CAC 


TAT 
CTA 


TCC CTT 
TCT CAG 


TTT 
CGA 


TGC 
TCT 


GG 
GTC 


T 


1927 a 
1927 a 


27 
817 


-46 
-838 


1908 5 ' -AAC 
1907 b 5' -CAC 


AGC 
CTA 


GGT AAG 
TCT CAG 


ATC 
CGA 


CTT 
TCT 


GAG 
GTC 


AG 
T 


1927 a 
1927 a 


148 
817 


-170 
-838 


Resistance qene: 




catJ 
















2145 5'-GCA 
2146 b 5'-GGG 


AGA 
GCG 


TGT GGC 
AAG AAG 


GTG 
TTG 


TTA 
TCC 


CGG 
ATA 


T 

TT 


2147 a 
2147 a 


363 
484 


-384 
-506 


Resistance gene: 




catU 
















2148 5' -CAG 
2149 b 5 ' -ATC 


ATT 
AGG 


AAA TGC 
TAA ATC 


GGA 
ATC 


TTC 
AGC 


AGC C 
GGA TA 


2150 a 
2150 a 


67 
151 


-88 
-173 


Resistance gene: 




catUI 














2151 5 ' -ATA 
2152 b 5'-TAC 


TTT 
ACA 


CAG CAT 
ACT CTT 


TAC 
GTA 


CTT 
GCC 


GGG 
GAT 


TT 
TA 


2153 a 
2153 a 


419 
603 


-441 
-625 



Resistance gene: catP 

2154 5' -CGC CAT TCA GAG TTT AGG AC 2156 a 178-197 

2155 b 5 ' -TTC CAT ACC GTT GCG TAT CAC TT 2156 a 339-361 

Resistance gene: cat 

2157 5 ' -CCA CAG AAA TTG ATA TTA GTG TTT TAT 2159 a 89-115 

2158 b 5'-TCG CTA TTG TAA CCA GTT CTA 2159 a 201-221 

2160 5' -TTT TGA ACA CTA TTT TAA CCA GC 2162 a 48-70 

2161 b 5 ' -GAT TTA ACT TAT CCC AAT MC CT 2162 a 231-253 



Sequence from databases. 

These sequences are from the complementary DNA strand of the s quence of the 
originating fragment given in the Sequence Listing. 

331 



WO 01/23604 



PCT/CAOO/01150 



Annex L: Specific and ubiquitous primers for nucl ic acid 
anplificati n (antimicrobial agents resistance gen s 
sequenc s) (continu d) . 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Pepi stance geqe : dfrA 

1450 5'-ACC ACT GGG AAT ACA CTT GTA ATG GC 1452 a 106-131 

1451 b 5' -ATC TAC CTG GTC AAT CAT TGC TTC GT 1452 a 296-321 

Resistance gene : dhfrla 

1457 5 ' -CAA AGG TGA ACA GCT CCT GTT T 1461 a 75-96 

1458 b 5'-TCC GTT ATT TTC TTT AGG TTG GTT AAA 1461 a 249-275 

1459 5' -AAG GTG AAC AGC TCC TGT TT 1461 a 77-96 

1560 b 5' -GAT CAC TAC GTT CTC ATT GTC A 1461 a 207-228 

Resistance genes : dhfrla, dhfrXV 

1453 5'-ATC GAA GAA TGG AGT TAT CGG RAA TG 1461 a ' 27-52 

1454 b 5' -CCT AAA AYT RCT GGG GAT TTC WGG A 1461 a 384-408 

1455 5'-CAG GTG GTG GGG AGA TAT ACA AAA 1461 a 290-313 

1456 b 5' -TAT GTT AGA SRC GAA GTC TTG GKT AA 1461 a 416-441 

Resistance gene : dhfrlb 

1466 5' -AAG CAT TGA CCT ACA ATC AGT GT 1470 a 98-120 

1467 b 5 ' -AAT ACA ACT ACA TTG TCA TCA TTT GAT 1470 a 204-230 

1468 5'-CGT TAC CCG CTC AGG TTG GAC ATC AA 1470 a 183-208 

1469 b 5 '-CAT CCC CCT CTG GCT CGA TGT CG 1470 a 354-376 

Resistance gene: dhfrv 

147i~~ 5 ' -GAT AAT GAC AAC GTA ATA GTA TTC CC 1475 a " 208-233 

1472 b 5 ' -GCT CAA TAT CAA TCG TCG ATA TA 1475 a 342-364 

1473 5'-TTA AAG CCT TGA CGT ACA ACC AGT GG 1475 a 95-120 

1474 b 5 ' -TGG GCA ATG TTT CTC TGT AAA TCT CC 1475 a 300-325 

Resistance cgnQp; dhfrlb, dhfrV 

1462 5 ' -GCA CTC CCY AAT AGG AAA TAC GC 1470 a 157-179 

1463 b 5' -AGT GTT GCT CAA AAA CAA CTT CG 1470 s 405-427 

1464 5'-ACG TTY GAA TCT ATG GGM GCA CT 1470 a 139-161 

1465 b 5 ' -GTC GAT AAG TGG AGC GTA GAG GC 1470 a 328-350 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents r sistance g n s 
sequ nces) (continued). 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ id 
NO. 



Nucleotide 
position 



Resistance gene : dhfrVT 

1476 5' -GGC GAG CAG CTC CTA TTC AAA G 



1480 a 



79-100 



1477 b 


5 ' -TAG 


GTA 


AGC TAA TGC CGA TTC AAC A 


1480 a 


237-261 


1478 
1479 b 


5' -GAG 
5 ' -GCG 


AAT 
AAA 


GGA GTA ATT GGC TCT GGA TT 
TAC ACA ACA TCA GGG TCA T 


1480 a 
1480 a 


31-56 
209-233 


Resistance 


gene : 




dhfrVII 






1485 
1486 b 


5 '-AAA 
5 ' -CAT 


ATG 
TTG 


GCG TAA TCG GTA ATG GC 
AGC TTG AAA TTC CTT TCC TC 


1489 a 
1489 a 


32-54 
189-214 


1487 
1488 b 


5'-AA? 
5 ' -AGA 


CGA 
CTA 


AAA TAT GCA GTA GTG TCG AG 
TTG TAG ATT TGA CCG CCA 


1489 a 
1489 a 


166-191 
294-317 


Resistance 


aenes : 




dhfrVII, dhfrXVII 






1481 
1482 b 


5'-RTT 
5'-TAA 


ACA 
TTT 


GAT CAT KTA TAT GTC TCT 
ATA TTA GAC AWA AAA AAC TG 


1489 a 
1489 a 


268-291 
421-446 


1483 
1484 b 


5 ' -CAR 
5'-TKC 


YGT 
AAA 


CAG AAA ATG GCG TAA TC 
GCR WTT TCT ATT GAA GGA AA 


1489 a 
1489 a 


23-45 
229-254 


JR^f5iS5tence 


gene: 




dhfrVJJI 






1490 
1491 b 


5 ' -GAC 
5'-TCG 


CTA 
CCT 


TGA GAG CTT GCC CGT CAA A 
TCG TAC AGT CGC TTA ACA AA 


1494 a 
1494 a 


144-168 
376-401 


1492 
1493° 


5 ' -CAT TTT AGC TGC CAC CGC CAA TGG TT 
5'-GCG TCG CTG ACG TTG TTC ACG AAG A 


1494 a 
1494a " 


18-43 
245-269 


Reei stance gene: 


dhfrIX 






1495 
1496 b 


5'-TCT CTA AAC ATG ATT GTC GCT GTC 
5' -CAG TGA GGC AAA AGT TTT TCT ACC 


1499 a 
1499 a 


7-30 
133-156 


1497 
1498 b 


5'-CGG ACG ACT TCA TGT GGT AGT CAG T 
5' -TTT GTT TTC AGT AAT GGT CGG GAC CT 


1499 a 
1499 a 


171-195 
446-471 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance genes sequ nces) 
(continued) • 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 
NO. position 



Resistance gene : dhfrXII 

1500 5'-ATC GGG TTA TTG GCA ATG GTC CTA 

1501 b 5'-GCG GTA GTT AGC TTG GCG TGA GAT T 

1502 5 '-GCG GGC GGA GCT GAG ATA TAC A 

1503 b 5'-AAC GGA GTG GGT GTA CGG AAT TAC AG 

Resistance aene : dhfrXIII 

1505 5' -ATT TTT CGC AGG CTC ACC GAG AGC 

1506 b 5' -CGG ATG AGA CAA CCT CGA ATT CTG CTG 

Resistance gene : dhfrXV 

1508 5 ' -AGA ATG TAT TGG TAT TTC CAT CTA TCG 

1509 b 5 ' -CAA TGT CGA TTG TTG AAA TAT GTA AA 

1510 5' -TGG AGT GCC AAA GGG GAA CAA T 

1511 b 5'-CAG ACA CAA TCA CAT GAT CCG TTA TCG 

Resistance aene : dhfrXVII 

1513 5' -TTC AAG CTC AAA TGA AAA CGT CC 

1514 b 5 ' -GAA ATT CTC AGG CAT TAT AGG GAA T 

1515 5 ' -GTG GTC AGT AAA AGG TGA GCA AC 

1516 b 5'-TCT TTC AAA GCA TTT TCT ATT GAA GG 

2102 5'-CAC CTT CAC CCT GAC CGA CG 2105 a 822-841 

2103 b 5 ' -CGA ACC AGC GGA AAT AGT TGG AC 2105 a 948-970 

Resistance genes: ereX, ereA2 

1528 5'-AAC TTG AGC GAT TTT CGG ATA CCC TG 153 0 a 80-105 

1529 b 5' -TTG CCG ATG AAA TAA CCG CCG ACT 153 0 a 317-340 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 



1504 a 50-73 

1504 a 201-225 

1504 a 304-325 

1504 a 452-477 



1507 a 106-129 
1507 a 413-439 



1512 a 215-241 

1512 a 336-361 

1512 a 67-88 

1512 a 266-292 



1517 a 201-223 

1517 a 381-405 

1517 a 66-88 

1517 a 232-257 



334 



WO 01/23604 



PCT/CA00/01150 



Annex Ls Specific and ubiquitous prim rs for nucl ic acid 
amplification (antimicrobial agents resistance g nes 
sequences} (continu d). 



Originating DNA fragment 



SEQ ID Nucleotide 
NO. position 



Resistance gene : ereB 



1531 


5' 


-TCT 


TTT 


TGT TAC 


GAC 


ATA CGC TTT 


T 


1535 a 


152 


-176 


1532 b 


5' 


-AGT 


GCT 


TCT TTA 


TCC 


GCT GTT CTA 




1535 a 


456 


-479 


1533 


5 ' 


-CAG 


CGG 


ATA AAG 


AAG 


CAC TAC ACA 


TT 


1535 a 


A £ 1 

461 


A O C 

-486 


1534 b 


5' 


-CCT 


CCT 


GAA ATA 


AAG 


CCC GAC AT 




1535 a 


727 


-749 


Resistance 


aene 






gyrA 














1340 




-GAA 


CAA 


GGT ATG 


ACA 


CCG GAT AAA 


T 


1299 a 


163 


-188 


1341 b 


5' 


-GAT 


AAC 


TGA AAT 


CCT 


GAG CCA TAC 


G 


1299 a 


274 


-299 


1936 


5' 


-TAC 


CAC 


CCG CAC 


GGC 






1954 a 


205 


-219 


1937 b 


5' 


-CGG 


AGT 


CGC CGT 


CGA 


TG 




1954 a 


309 


-325 


1942 


5' 


-GAC 


TGG 


AAC AAA 


GCC 


TAT AAA AAA 


TCA 


1954 a 


148 


-174 


1937 b 


5' 


-CGG 


AGT 


CGC CGT 


CGA 


TG 




1954 a 


309 


-325 


2040 


5' 


-TGT 


GAC 


CCC AGA 


CAA 


ACC C 




2054 a 


33 


-51 


2041 b 


5' 


-GTT 


GAG 


CGG CAG 


CAC 


TAT CT 




2054 a 


207 


-226 


Resistance 


q<?ne: 






inhA 














2098 


5' 


-CTG 


AGT 


CAC ACC 


GAC 


AAA CGT C 




2101 a 


910 


-931 


2099 b 


5' 


-CCA 


GGA 


CTG AAC 


GGG 


ATA CGA A 




2101 a 


1074 


-109! 



Resistance genes : linA, liaA' 



± 3 J b 




-AUA 


TUT 


ATT AAU 


TUli 


AAA 


ALA 


ALA 


A 








1537 b 


5' 


-CTT 


TGT 


AAT TAG 


TTT 


CTG 


AAA 


ACC 


A 


1540 a 


352 


-376 


1538 


5' 


-TTA 


GAA 


GAT ATA 


GGA 


TAC 


AAA 


ATA 


GAA G 


1540 a 


187 


-214 


1539 b 


5' 


-GAA 


TGA 


AAA AGA 


AGT 


TGA 


GCT 


T 




1540 a 


404 


-425 


asi stance oene 






linB 


















1541 


5' 


-TGA 


TAA 


TCT TAT 


ACG 


TGG 


GGA 


ATT 


T 


1545 a 


246 


-270 


1542 b 


5' 


-ATA 


ATT 


TTC TAA 


TTG 


CCC 


TGT 


TTC 


AT 


1545 a 


359 


-384 


1543 


5' 


-GGG 


CAA 


TTA GAA 


AAT 


TAT 


TTA 


TCA 


GA 


1545 a 


367 


-392 


1544 b 


5' 


-TTT 


TAC 


TCA TGT 


TTA 


GCC 


AAT 


TAT 


CA 


1545 a 


579 


-604 



SEQ ID NO, Nucleotide sequence 



a S quence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance g n s 
sequ nces) (continued) • 

_______ .._ — ™' ^ Originating DNA fragment 

-SEO-ID-NO.. Nucleotide sequence SEQ ID Nucleotide 

N0 position 



Resistance 


gene: 




mefA 
















1546 
1547 b 


5' -CAA 
5' -TAA 


GAA 
TTC 


GGA ATG 
CCA AAT 


GCT GTA 
AAC CCT 


CTA 
AAT 


C 

AAT 


AGA 


1548 a 
1548 a 


625 
816 


-646 
-842 


Resistance aene: 




mefE 
















1549 
1550 b 


5 ' -GCT 
5 ' -TAG 


TAT 
CAA 


TAT TAG 
GTG ACA 


GAA GAT 
TGA TAC 


TAG 
TTC 


GGG 
CGA 


GC 


1551 a 
1551 a 


815 
1052 


-840 
-1075 


Resistance aenes: 




mefA, 


mefE 














1552 
1553 b 


5'-GGC 
5' -CAA 


AAG 
TGC 


CAG TAT 
TAC GGA 


CAT TAA 
TAA ACA 


TCA 
ATA 


CTA 
CTA 


TC 


1548 a 
1548 a 


50 
318 


-73 
-343 


1554 
1555 b 


5 ' -AGA 
5 ' -TAG 


AAA 
TAA 


TTA AGC 
AAA CCA 


CTG AAT 
ATG ATT 


ATT 
TAC 


TAG 
ACC 


GAC 
G 


1548 a 
1548 a 


1010 
1119 


-1035 
-1143 


Resistance 


aenes: 




xnphAg 


mphK 














1556 
1557 b 


5 ' -ACT 
5'-GAA 


GTA 
CGG 


CGC ACT 
CAG GCG 


TGC AGC 
ATT CTT 


CCG 
GAG 


ACA 
CAT 


T 


1560 a 
1560 a 


33 
214 


-57 
-237 


1558 
1559 b 


5'-GTG 
5 ' -GCC 


GTG 
GCA 


GTG CAT 
GCG AGG 


GGC GAT 
TAC TCT 


CTC T 
TCG TTA 




1560 a 
1560 a 


583 
855 


-604 
-878 


Resistance 


aene: 




mupA 
















2142 


5 ' -GCC 


TTA 


ATT TCG 


GAT AGT 


GC 






2144 a 


1831 


-1850 



2143° 5 '-GAG AAA GAG CCC AAT TAT CTA ATG T 2144 a 2002-2026 



Resistance gene: pare 



1342 


5' 


-GAT GTT ATT GGT CAA TAT 


CAT 


CCA 




1321 a 


205 


-229 


1343 b 


5' 


-AAG AAA CTG TCT CTT TAT 


TAA 


TAT 


CAC GT 


1321 a 


396 


-425 


1934 


5' 


-GAA CGC CAG CGC GAA ATT 


CAA AAA 


G 


1781 


67 


-91 


1935 b 


5' 


-AGC TCG GCA TAC TTC GAC 


AGG 






1781 


277 


-297 


2044 


5' 


-ACC GTA AGT CGG CCA AGT 


CA 






2055 a 


176 


-195 


2045 b 


5' 


-GTT CTT TCT CCG TAT CGT 


C 






2055 a 


436 


-454 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequ nee Listing, 
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Annex L: Sp cific and ubiquitous primers for nucleic acid 

amplification (antimicrobial agents resistance gen s sequences) 
(continued) . 

_ — originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 
NO. position 



Resistance aene: 




ppflo-like 








2163 5'-acc 
2164 b 5' -CAA 


TTC 
CGA 


ATC CTA 
CAC CAG 


CCG 
CAC 


ATG 
TGC 


TGG GTT 
CAT TG 


2165 a 
2165 a 


922-945 
1136-1158 


Resistance aene: 




rpoB 












2065 5 ' -CCA 
2066 b 5' -CAC 


GGA 
CGA 


CGT GGA 
CAG CGA 


GGC 
GCC 


GAT 
GAT 


CAC A 
CAG A 


2072 a 
2072 a 


1218-1239 
1485-1506 


Resistance aene: 




satG 












1581 5'-AAT 
1582 b 5' -GGC 


TGG 
AAA 


GGA CTA 
TCA GTC 


CAC 
AGT 


CTA 
TCA 


TTA TGA TG 
GGA GT 


1585 a 
1585 a 


93-118 
310-332 


1583 5'-CGA 
1584 b 5' -TCA 


TTG 
CCT 


GCA ACA 
ATT TTT 


ATA 
ACG 


CAC 
CCT 


TCC TG 
GGT AGG AC 


1585 a 
1585 a 


294-316 
388-413 


Resistance qene: 




suJIJ 












1961 5'-GCT 
1962 b 5' -GGA 


CAA 
CAA 


GGC AGA 
GGC GGT 


TGG 
TGC 


CAT 
GTT 


TCC C 
TGA T 


1965 a 
1965 a 


222-243 
496-517 


1963 5 '-CAT 
1964 b 5'-ATC 


TCC 
TGC 


CGT CTC 
CTG CCC 


GCT 
GTC 


CGA 
TTG 


CAG T 
C 


1965 a 
1965 a 


237-258 
393-411 


Resistance aene: 




tetB 












1966 5 '-CAT 
1967 b 5 ' -CAG 


GCC 
CAA 


AGT CTT 
TAA GTA 


GCC 
ATC 


AAC 
CAG 


G 

CGA TG 


1970 a 
1970 a 


66-84 
242-264 



1968 5 ' -GGA GAG ATT TCA CCG CAT AG 197 0 a 457-476 

1969 b 5'-AGC CAA CCA TCA TGC TAT TCC A 1970 a 721-742 

Resistance aene : tetM 



1586 


5' 


-ATT CCC ACA ATC TTT TTT ATC AAT AA 


1590 a 


361- 


-386 


1587 b 


5' 


-CAT TGT TCA GAT TCG GTA AAG TTC 


1590 a 


501- 


•524 


1588 


5' 


-GTT TTT GAA GTT AAA TAG TGT TCT T 


1590 a 


957- 


•981 


1589 b 


5' 


-CTT CCA TTT GTA CTT TCC CTA 


1590 a 


1172- 


•1192 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial ag nts resistance g nes 
sequences) (continu d) . 

— Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 
NO. position 



Resistance aene : vatB 



1609 
1610 b 


5'- 
5'- 


•GCC 
•CCT 


CTG 
GGC 


ATC CAA 
ATA ACA 


ATA 
GTA 


GCA 
ACA 


TAT 
TTC 


A 
TG 




1613 a 
1613 a 


11 
379 


-32 
-401 


1611 
1612 b 


5'- 
5'- 


-TGG 
-ACA 


GAA 
ACT 


AAA GCA 
GAA TTC 


ACT 
GCA 


CCA 
GCA 


TCT 
ACA 


C 

AT 




1613 a 
1613 a 


301 
424 


-322 
-446 


Resistance 


gene: 






vatC 


















1614 
1615 b 


5'- 
5'« 


•CCA 
-ATT 


ATC 
AGT 


CAG AAG 
TTA TCC 


AAA 
CCA 


TAT 
ATC 


ACC 
AAT 


C 

TCA 




1618 a 
1618 a 


26 

177 


-47 
-200 


1616 
1617 b 


5'- 
5'- 


-ATA 
-GCC 


ATG 
AAC 


AAT GGG 
AAC TGA 


GCT 
ATA 


AAT 
AGG 


CAT 
ATC 


CGT 
AAC 


AT 


1618 a 
1618 a 


241 
463 


-266 
-486 


Resistance oene: 






vga 


















1619 
1620 b 


5'- 
5'- 


-AAG 
-TGT 


GCA 
ACC 


AAA TAA 
CGA GAC 


AAG 
ATC 


GAG 
TTC 


CAA 
ACC 


AGC 
AC 




1623 a 
1623 a 


641 
821 


-664 
-843 


1621 
1622 b 


5'- 
5'- 


-AAT 
-CGA 


TGA 
TTT 


AGG ACG 
TGA CAG 


GGT 
ATG 


ATT 
GCG 


GTG 
ATA 


GAA AG 
ATG AA 


1623 a 
1623 a 


843 
975- 


-868 
1000 


Resistance 


gene: 






vgaB 


















1624 
1625 b 


5'- 
5'- 


-TTC 
•TTT 


TTT 
TCG 


AAT GCT 
TAT TCT 


CGT 
TCT 


AGA 
TGT 


TGA 
TGC 


ACC TA 
TTT C 


1628 a 
1628 a 


354 
578 


-379 
-602 


1626 


5'- 


-AGG 


AAT 


GAT TAA 


GCC 


CCC 


TTC 


AAA 


AA 


1628 a 


663 


-688 



1627 b 5 ' -TTA CAT TGC GAC CAT GAA ATT GCT CT 1628 a 849-874 

Resistance genes : vyb, vgh 

1629 5' -AAG GGG AAA GTT TGG ATT ACA CAA CA 1633 a 73-98 

1630 b 5 ' -GAA CCA CAG GGC ATT ATC AGA ACC 1633 a . 445-468 

1631 5' -CGA CGA TGC TTT ATG GTT TGT 1633 a 576-596 

1632 b 5' -GTT AAT TTG CCT ATC TTG TCA CAC TC 1633 a 850-875 



a Sequence from databases. 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex L; Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial ag nts r sistance genes 
s qu nces) (continu d) • 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance oene: 



vgbB 



1634 
1635 b 



5'-TTA ACT TGT CTA TTC CCG ATT CAG G 1882 a 23-47 

5'-GCT GTG GCA ATG GAT ATT CTG TA 1882 a 267-289 



1636 
1637 b 



5 ' -TTC CTA CCC CTG ATG CTA AAG TGA 
5' -CAA AGT GCG TTA TCC GAA CCT AA 



Resistance gene: 



Sequencing primers 
gyrA 



1882 a 
1882 a 



155-178 
442-464 



1290 


5' 


-GAY 


TAY 


GCI 


ATG 


ISI 


GTI 


ATH 


GT 


1299 a 


70-83 


1292 b 


5' 


-ARI 


SCY 


TCI 


ARI 


ATR 


TGI 


GC 




1299 a 


1132-1152 


1291 


5' 


-GCI 


YTI 


CCI 


GAY 


GTI 


MGI 


GAY GG 


1299 a 


100-123 


1292 b 


5' 


-ARI 


SCY 


TCI 


ARI 


ATR 


TGI 


GC 




1299 a 


1132-1152 


1293 


5' 


-ATG 


GCT 


GAA 


TTA 


CCT 


CAA 


TC 




1299 a 


1-21 


1294 b 


5' 


-ATG 


ATT 


GTT 


GTA 


TAT 


CTT 


CTT 


CAA C 


1299 a 


2626-2651 


1295 b 


5' 


-CAG 


AAA 


GTT 


TGA 


AGC 


GTT 


GT 




1299 a 


1255-1275 


1296 


5' 


-AAC 


GAT 


TCG 


TGA 


GTC 


AGA 


TA 




1299 a 


1188-1208 


1297 


5' 


-CGG 


TCA 


ACA 


TTG 


AGG 


AAG 


AGC 


T 


1300 a 


29-51 


1298 b 


5' 


-ACG 


AAA 


TCG 


ACC 


GTC 


TCT 


TTT 


TC 


1300 a 


415-437 


isistance 


gene: 




gyrB 














13m 




-CTT 


Mf?T 


AWT 




err 


fJST 


ATH 


TA 


1 ^07^ 


OR 



1302 b 


5' 


-TAI 


ADI GGI 


GGI KKI GCI ATR TA 


1307 a 


1600-1623 


1303 


5' 


-GGI 


GAI GAI 


DYI MGI GAR GG 


1307 a 


955-975 


1304 b 


5' 


-CIA 


RYT TIK 


YIT TIG TYT G 


1307 a 


1024-1043 


1305 


5' 


-ATG 


GTG ACT 


GCA TTG TCA GAT G 


1307 a 


1-23 


1306 b 


5' 


-GTC 


TAC GGT 


TTT CTA CAA CGT C 


1307 a 


1858-1888 



Sequence from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex Li Specific and ubiquitous prim rs for nucleic acid 
ajtiplif ication (antimicrobial agents resistance gen s 
sequences) (continued). 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Sequencing primers (continued) 



Resistance 


aene 






parC 














1308 




-ATG 


TAY 


GTI 


ATI 


ATG 


GAY 


MGI GC 


1320 a 


67 


-90 




0 


-ATT 


ZITV 
nl I 


TTR 


TTI 


PPY 


TTI 


cry tt 


J- J Z> VJ 




tUiU 


IJlU 


•j 


-ATT 

nil 


ATT 
nil 


TSI 


ATI 


ACY 


TCR 


TC 


1320 a 


1112 


-1132 


1311 b 


5' 


-GAR 


ATG 


AAR 


ATI 


MGI 


GGI 


GAR CA 


1320 a 


1288 


-1311 


1312 


5' 


-AAR 


TAY 


ATI 


ATI 


CAR 


GAR 


MGI GC 


1321 a 


67 


-90 


ijij 




-AMI 


AY I 


CKR 


TGI 


GGI 


TTI 


TTY TT 


1321 a 


2212 


-2235 


1314 


5' 


-TAI 


GAI 


TTY 


ACI 


GAI 


SMI 


CAR GC 


1321 a 


1228 


-1251 


1315 b 


5' 


-ACI 


ATI 


GCI 


TCI 


GCY 


TGI 


KSY TC 


1321 a 


1240 


-1263 


1316 


5' 


-GTG 


AGT 


GAA 


ATA 


ATT 


CAA 


GAT T 


1321 a 


1- 


■23 


1317 b 


5' 


-CAC 


CAA 


AAT 


CAT 


CTG 


TAT 


CTA C 


1321 a 


2356 


-2378 


1318 


5' 


-ACC 


TAY 


TCS 


ATG 


TAC 


GTR 


ATC ATG GA 


1320 a 


58 


-84 


1319 b 


5' 


-AGR 


TCG 


TCI 


ACC 


ATC 


GGY 


AGY TT 


1320 a 


832 


-855 


Resistance 


gene 






parE 














1322 


5' 


-RTI 


GAI 


AAY 


ISI 


GTI 


GAY 


GAR G 


1328 a 


133 


-155 


1325 b 


5' 


-RTT 


CAT 


YTC 


ICC 


IAR 


ICC 


YTT 


1328 a 


1732 


-1752 


1323 


5' 


-ACI 


AWR 


SAI 


GGI 


GGI 


ACI 


CAY G 


1328 a 


829 


-850 
























a *s r\ 



D \ - UJl" ULl" UL' 1~ 5WK "tC1~ L'Cf 1 .3 Z B " 1ZOU-1JUZ 



5'-TGA TTC AAT ACA GGT TTT AGA G 1328 a 27-49 

5' -CTA GAT TTC CTC CTC ATC AAA T 1328 a 1971-1993 



Sequence from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 



1326 
1327 b 
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Ann x LI i Internal hybridizati n probes for sp cific d t ction 
of antimicrobial ag nts resistanc genes sequences. 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Resistance aene : aph3 'Via 

2252 5 ' -CCA CAT ACA GTG TCT CTC 1406 a 149-166 

Resistance gene; biaSHV 



1886 


5' 


-GAC 


GCC 


CGC 


GCC 


ACC 


ACT 




1900 a 


484-501 


1887 


5' 


-GAC 


GCC 


CGC 


GAC 


ACC 


ACT 


A 


1899 a 


514-532 


1888 


5' 


-GAC 


GCC 


CGC 


AAC 


ACC 


ACT 


A 


1901 a 


514-532 


1889 


5' 


-GTT 


CGC 


AAC 


TGC 


AGC 


TGC 


TG 


1899 a 


593-612 


1890 


5' 


-TTC 


GCA 


ACG 


GCA 


GCT 


GCT 


G 


1899 a 


594-612 


1891 


5' 


-CCG 


GAG 


CTG 


CCG 


AIC 


GGG 




1902 a 


692-709 


1892 


5' 


-CGG 


AGC 


TGC 


CAA 


RCG 


GGG 




1903 a 


693-710 


1893 


5' 


-GGA 


GCT 


GGC 


GAR 


CGG 


GGT 




1899 a 


694-711 


1894 


5' 


-GAC 


CGG 


AGC 


TAG 


CGA 


RCG 




1904 a 


690-707 


1895 


5' 


-CGG 


AGC 


TAG 


CAA 


RCG 


GGG 


T 


1905 a 


693-711 


1896 


5' 


-GAA 


ACG 


GAA 


CTG 


AAT 


GAG 


GCG 


1899 a 


484-504 


1897 


5' 


-CAT 


TAC 


CAT 


GGG 


CGA 


TAA 


CAG 


1899 a 


366-386 


1898 


5' 


-CCA 


TTA 


CCA 


TGA 


GCG 


ATA 


ACAG 


1899 a 


365-386 


distance 


oene 






blaTEM 










1909 


5' 


-ATG 


ACT 


TGG TTA 


AGT 


ACT 


CAC C 


1928 a 


293-314 


1910 


5' 


-ATG 


ACT 


TGG TTG 


AGT 


ACT 


CAC C 


1927 a 


293-314 


1911 


5' 


-CCA 


TAA 


CCA TGG 


GTG 


ATA 


ACA C 


1928 a 


371-392 


1912 


5' 


-CCA 


TAA 


CCA TGA 


GTG 


ATA 


ACA C 


1927 a 


371-392 


1913 


5' 


-CGC 


CTT 


GAT CAT TGG 


GAA 


CC 


1928 a 


475-494 


1914 


5' 


-CGC 


CTT 


GAT CGT 


TGG 


GAA 


CC 


1927 a 


475-494 


1915 


5' 


-CGC 


CTT 


GAT 


AGT 


TGG 


GAA 


CC 


1929 a 


475-494 



1916 


5' 


-CGT 


GGG 


TCT 


TGC 


GGT 


ATC 


AT 




1927 a 


712-731 


1917 


5' 


-CGT 


GGG 


TCT 


GGC 


GGT 


ATC 


AT 




1930 a 


712-731 


1918 


5' 


-GTG 


GGT 


CTC 


ACG 


GTA 


TCA 


TTG 




1927 a 


713-733 


1919 


5' 


-CGT 


GGG 


TCT 


CTC 


GGT 


ATC 


ATT 




1931 a 


712-732 


1920 


5' 


-CGT 


GGI 


TCT 


CGC 


GGT 


ATC 


AT 




1927 a 


712-731 


1921 


5' 


-CGT 


GGG 


TCT 


AGC 


GGT 


ATC 


ATT 




1932 a 


713-733 


1922 


5' 


-GTT 


TTC 


CAA 


TGA 


TTA 


GCA 


CTT 


TTA 


1927 a 


188-211 


1923 


5' 


-GTT 


TTC 


CAA 


TGA 


TAA 


GCA 


CTT 


TTA 


1927 a 


188-211 


1924 


5' 


-GTT 


TTC 


CAA 


TGC 


TGA 


GCA 


CTT 


TT 


1932 a 


188-210 


1925 


5' 


-CGT 


TTT 


CCA 


ATG 


ATG 


AGC 


ACT 


TT 


1927 a 


187-209 


1926 


5' 


-GTT 


TTC 


CAA 


TGG 


TGA 


GCA 


CTT 


TT 


1933 a 


188-210 


2006 


5' 


-TGG 


AGC 


CGG 


TGA 


GCG 


TGG 






1927 a 


699-716 



a Sequence from databases. 
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Annex LI: 



Internal hybridization probes for sp cific d tection of 
antimicrobial ag nts resistance gen s s quences 
(continu d) . 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide seguence 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance gene: 



blaTEM (continued) 



2007 


5 ' -TGG 


AGC 


CAG 


TGA 


GCG 


TGG 




2010 a 


699-716 


2008 


5 ' -TCT 


GGA 


GCC 


GAT 


GAG 


CGT 


G 


1929 a 


697-715 


2009 


5 ' -CTG 


GAG 


CCA 


GTA 


AGC 


GTG 


G 


2011 a 


698-716 


2141 


5 ' -CAC 


CAG 


TCA 


CAG 


AAA 


AGC 




1927 a 


311-328 



Resistance gene: 



dhfrJa 



2253 5 '-CAT TAC CCA ACC GAA AGT A 

Resistance gene: embB 



Resistance aene : inhA 

2100 5' -GCG AGA CGA TAG GTT GTC 

Resistance gene: pare 



1461 a 



2101 a 



158-176 



2104 


5' 


-CTG 


GGC 


ATG GCI 


CGA 


GTC 




2105 a 


910-927 


distance 


aene: 


» 




gyrA 












1333 


5' 


-TCA 


TGG 


TGA CTT 


ATC 


TAT TTA TG 


1299 a 


240-263 


1334 


5' 


-CAT 


CTA 


TTT ATA 


AAG 


CAA 


TGG TA 


1299 a 


251-274 


1335 


5' 


-CTA 


TTT 


ATG GAG 


CAA 


TGG 


T 


1299 a 


254-273 


1940 


5' 


-GTA 


TCG 


TTG GTG 


ACG 


TAA 


T 


1299 a 


206-224 


1943 


5' 


-GCT 


GGT 


GGA CGG 


CCA 


G 




1954 a 


279-294 


1945 


5' 


-CGG 


CGA 


CTA CGC 


GGT 


AT 




1954 a 


216-232 


1946 


5' 


-CGG 


CGA 


CTT CGC 


GGT 


AT 




1954 a 


216-232 


1947 


5' 


-CGG 


TAT 


ACG GCA 


CCA 


TCG 


T 


1954 a 


227-245 


1948 


5' 


-GCG 


GTA 


TAC AAC 


ACC 


ATC 


G 


1954 a 


226-244 


1949 


5' 


-CGG 


TAT 


ACG CCA 


CCA 


TCG 


T 


1954 a 


227-245 


2 042 


5' 


-CAC 


GGG 


GAT TTC 


TCT 


ATT 


TA 


2054 a 


103-122 


2 043 


5' 


-CAC 


GGG 


GAT TAC 


TCT 


ATT 


TA 


2054 a 


103-122 



1017-1034 



1336 


5' 


-TGG 


AGA CTA CTC 


AGT GT 


1321 a 


232-249 


1337 


5' 


-TGG 


AGA CTT CTC 


AGT GT 


1321 a 


232-249 


1338 


5' 


-GTG 


TAC GGA GCA 


ATG 


1321 a 


245-260 


1339 


5' 


-CCA 


GCG GAA ATG 


CGT 


1321 a 


342-357 


1941 


5' 


-GCA 


ATG GTC CGT 


TTA AGT 


1321 a 


253-270 


1944 


5' 


-TTT 


CGC CGC CAT 


GCG TTA C 


1781 


247-265 


1950 


5' 


-GGC 


GAC ATC GCC 


TGC 


1781 


137-151 


1951 


5' 


-GGC 


GAC AGA GCC 


TGC TA 


1781 


137-153 



Sequence from databases. 
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Annex LZ: Internal hybridization probes for specific detection of 
antimicrobial ag nts resistance genes sequences 
(continued) • 



Originating DNA fragment 



SEQ ID NO. 


Nucleotide 


sequence 




SEQ ID 
NO. 


Nucleotide 
position 


Resistance 


aene: 




parC (continued) 






1952 


G> 1 -PPT 




ATG GAG CGA 


TGG T 


1781 


147-165 


1953 


5 ' -CGC 


CTG 


CTA TAA AGC 


GAT GGT 


1781 


145-165 


2046 


5'-ACG 


GGG 


ATTTTT CTA 


TCT AT 


2055 a 


227-246 


Resistance 


qene ; 




rpoB 








2067 


5 ' -AGC 


TGA 


GCC AAT TCA 


TGG 


2072 a 


1304-1321 


2068 


5 ' -ATT 


CAT 


GGA CCA GAA 


CAA C 


2072 a 


1314-1332 


2069 


5' -CGC 


TGT 


CGG GGT TGA 


CCC 


2072 a 


1334-1351 


2070 


5'-GTT 


GAC 


CCA CAA GCG 


CCG 


2072 a 


1344-1361 


2071 


5 ' -CGA 


CTG 


TCG GCG CTG 


GGG 


2072 a 


1360-1377 


Resistance 


aene: 




tetW 








2254 


5 ' -ACC 


TGA 


ACA GAG AGA 


AAT G 


1590 a 


1062-1080 



a Sequence from databases. 
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Annex LII: Molecular beacon internal hybridization probes for 
sp cific detection of atpD sequences. 

— Originating DNA fragment 

SEQ ID NO. Nucleotide sequence 9 SEQ ID Nucleotide 

NO. position 



Bacterial species : Bacteroldes fragllis 

2136 5 * -CCA ACG CGT CCT CAA TCA TTT CTA ACT TCT 929 353-382 

ATG GCC G££ GH GG 

Bacterial snecies : Bordetella pertussis 

2182 5'-GCG CGC CAA CGA CTT CTA CCA CGA AAT GGA 1672 576-605 

AGA GTC GCG CGC 

Bacterial group : Casapylobacter jejuni and C. coli 

2133 5 • -CCA CGC ACA WAA ACT TGT TTT AGA AGT 1576, 44-73 d 

AGC AGC WCA GCG 2GG 1600,1849, 

1863,2139 b ' c 



Funaal 


species : 


Candida glabra t a 








2078 


5 ' -CCG AGC 
TCG G 


CTT GGT CTT CGG CCA AAT GAA CGC 


463 


442- 


-463 


Fungal 


specie?: 


Candida krvsel 








2075 


5' -CCG AGC 
TAG GTG 


CAG GTT CTG AAG TCT CTG CAT TAT 
CTC GG 


468 


720- 


-748 


funqal 


species: 


Candida lusitaniae 








2080 


5' -CCG AGC 
G 


CGA AGA GGG CCA AGA TGT CGC 2CG 


470 


520- 


-538 


Funaal 


species: 


Candida parapsilosis 









2079 5' -CCG AGC GTT CAG TTA CTT CAG TCC AAG CCG 472 837-860 

GCT CGG 

Funaal species : Candida tropicalis 

2077 5' -CCG AGC AAC CGA TCC AGC TCC AGC TAC GCT 475 877-897 

CGG 

Bacterial species: Klebsiella pneumoniae 

2281 5'-CCC CCA GCT GGG CGG CGG TAT CGA TGG GGG 317 40-59 

a Underlined nucleotides indicate the molecular beacon's stem, 
k Sequence from databases. 

c These sequences were align d to derive the corresponding primer. 
d The nucleotide positions refer to the C jejuni atpD sequence fragment (SEQ ID 
NO. 1576) . 
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Annex LII: Molecular beacon int rnal hybridization probes for 
sp cific det ction of atpD s qu nc s (continued). 

„_ ^^^^^^^^^^^^^ Originating DNA fragment 



SEQ ID NO. Nucleotide sequence a SEQ ID Nucleotide 

NO. position 



Funoal genus: Candida sp. 

2076 5'-£CG &GC YGA YAA CAT TTT CAG ATT CAC CCA 460-478, 697-723 

RGC GCI Cfifi 663 b 



a Underlined nucleotides indicate the molecular beacon's stem, 
k These sequences were aligned to derive the corresponding primer. 
c The nucleotide positions refer to the C. albicans atpD sequence fragment (SEQ 
ID NO. 460) . 
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Annex LIXIs Internal hybridization prob e for specific det ction of 
atpD sequences. 

— . Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species : Acinetobacter bamoannii 

2169 5'-CCC GTT TGC GAA AGG TGG 243 304-321 

Bacterial species : Klebsiella pneumoniae 

2167 5'-CAG CAG CTG GGC GGC GGT 317 ' 36-53 
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Annex LIVi Internal hybridization probes for specific d t ction of 
ddl and mtl sequenc s. 

^^^^^^^^ - Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species : Enterococcus faecium (ddl) 

2286 5'-AGT TGC TGT ATT AGG AAA TG 2288 a 784-803 

2287 5'-TCG AAG TTG CTG TAT TAG GA 2288 a 780-799 

Bacterial species : Enterococcus faecalls mtl) 

2289 5'-CAC CGA AGA AGA TGA AAA AA 1243® 264-283 

2290 5'-TGG CAC CGA AGA AGA TGA 1243 a 261-278 

2291 5 ' -ATT TTG GCA CCG AAG AAG A 1243 a 257-275 



a Sequence from databases. 
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1. A method for generating a repertory of nucleic acids oftuf y fus t atpD 
and/or recA genes from which are derived probes or primers, or both, useful for 
the detection of one, more than one related microorganisms, or substantially all 
microorganisms_of_a_ group_selected_from_algae,-archaea,-bacteria,-fungi-and- 
parasites, which comprises the step of: 

amplifying the nucleic acids of a plurality of determinedalgal, 
archaeal, bacterial, fungal and parasitical species with any combination of 
the primer pairs defined in SEQ ID NOs.: 543, 556-574, 636-655, 664, 
681-683, 694, 696-697, 699-700, 708, 812-815, 911-917, 919-922, 935- 
938, 1203-1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999- 
2003,2282-2285. 

2. A method for generating a repertory of nucleic acid sequences, which 
comprises the steps of: 

reproducing the method of claim 1 , and 
adding the step of: 

sequencing said nucleic acids. 

3. A method for generating sequences of probes, or primers, or both, 
useful for the detection of one, more than one related microorganisms, or 
substantially all microorganisms of a group selected from algae,archaea, bacteria, 
fungi and parasites, which comprises the steps of: 

*a *i J ~r o — J 

rcpruuucnig uic uicuiuu ui ciaim z., anu 

adding the steps of: 

aligning a subset of nucleic acid sequences of said 

repertory, 

locating nucleic acid stretches that are present in the 
nucleic acids of strains or representatives of said one, more than one 
related microorganisms, or substantially all microorganisms of said 
group, and not present in the nucleic acid sequences of other 
microorganisms, and 



348 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



deriving consensus nucleic acid sequences useful as 
probes or primers from said stretches. 

4. A bank of nucleic acids comprising the repertory of nucleic acids 
obtained from the method of claim 1 . 



5. A bank of nucleic acid sequences comprising the repertory of nucleic 
acid sequences obtained from the method of claim 2. 

6. A method for generating sequences of probes, or primers, or both, 
useful for the detection of one, more than one related microorganisms, or 
substantially all microorganisms of a group selected from algae 3 archaea, bacteria, 
fungi and parasites, which comprises the steps of. 

aligning a subset of nucleic acid sequences of the bank as 
defined in claim 5, 

locating nucleic acid sequence stretches that are present in the 
nucleic acid sequences of strains or representatives of said one, more than 
one related microorganisms, or substantially all microorganisms of said 
group, and not present in the nucleic acid sequences of other 
microorganisms, and 

deriving consensus nucleic acid sequences useful as probes or 
primers from said stretches. 

7. A method for generating probes, or primers or both, useful for the 

Ania/^+i nk-n r\f s\r*a tvi/vr/a tVior* And r^nictA mir-rArvrrtonicmc rvr oiKctnnti otlir oil 
uvtv^uuii \jl v/iiv, uiv/iv man iwiaivu un^i i/v/igamoiuo, vi ouuottuitiwujr w.11 

microorganisms of a group selected from algae, archaea, bacteria, fungi and 
parasites, which comprises the steps of: 

reproducing the method of claim 3 or 6, and 
adding the step of: 

synthesising said probes or primers upon the nucleic acid 
sequences thereof. 

8. A nucleic acid used for universal detection of any one of alga, 
archaeon, bacterium, fungus and parasite which is obtained from the method of 
claim 7. 
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9. A nucleic acid used for universal detection as set forth in claim 8, 
which has a nucleic acid sequence of at least 12 nucleotides capable of 
hybridizing with said any one of alga, archaeon, bacterium, fungus and parasite 
and with any one of SEQ ID NOs.: 543, 556-574, 636-655, 658-661, 664, 681- 

_683,_69_4,_696,J59^ 

1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999-2003, 2282-2285. 

10. A nucleic acid used for the specific and ubiquitous detection and for 
identification of any one of a algal, archaeal, bacterial, fungal and parasitital 
species, genus, family and group, which is obtained from the method of claim 7. 

11. A nucleic acid as set forth in claim 10 having any one of the 
nucleotide sequences which are defined in SEQ ED NOs.: 

539, 540 for the detection and/or identification of Mycobacteriaceae 

family 

541,542,544, for the detection and/or identification of Pseudomonads 
2121 group 

545, 546 for the detection and/or identification of Corynebacterium 

sp. 

547 3 548, 1202 for the detection and/or identification of Streptococcus sp. 
549, 550, 582, 583, for the detection and/or identification of Streptococcus 
625,626,627,628, agalactiae 
i iyy 

551,552,2166, for the detection and/or identification of Neisseria 

2173,2174,2175, gonorrhoeae 

2176,2177,2178, 

2179 

553, 575, 605, 606, for the detection and/or identification of Staphylococcus sp. 
707,1175,1176 

554,555,2213 for the detection and/or identification of Chlamydia 
trachomatis 
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576, 63 1, 632, 633, for the detection and/or identification of Candida sp. 
634, 635, 1163, 

1164, 1167, 2076, 
2108,2109 

577, 1 1 56, 1 160 for the detection and/or identification of Candida albicans 
2073 

578rl 166711687 for the~detection~an^ 
2074 

579, 2 1 68 for the detection and/or identification of Escherichia coli 

580,603,1174, for the detection and/or identification of Enterococcus 
1236, 1238,2289, faecalis 
2290, 2291 

581 for the detection and/or identification of Haemophilus 

influenzae 

584, 585, 586, 587, for the detection and/or identification of Staphylococcus 

588, 1232, 1234, aureus 

2186 

589,590,591,592, for the detection and/or identification of Staphylococcus 
593 epidermidis 

594, 595 for the detection and/or identification of Staphylococcus 

haemolyticus 

596, 597, 598 for the detection and/or identification of Staphylococcus 

hominis 

599,600,601,695, for the detection and/or identification of Staphylococcus 
1208, 1209 saprophyticus 

602, 1235, 1237, for the detection and/or identification of Enterococcus 
1696, 1697, 1698, faecium 
1699,1700,1701, 
2286, 2287 

604 for the detection and/or identification of Enterococcus 

gallinarum 

620,1122 for the detection and/or identification of Enterococcus 

casseliflavus, E.flavescens and E. gallinarum 
629,630,2085, for the detection and/or identification of Chlamydia 
2086, 2087, 2088, pneumoniae 
2089,2090,2091, 
2092 
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636, 637, 638, 639, for the detection and/or identification of at least the 

640,641,642 following: 

Abiotrophia adiacens, Abiotrophia defectiva, Acinetobacter 
baumannii, Acinetobacter Iwoffi, Aerococcus viridans, 
Bacillus anthracis, Bacillus cereus, Bacillus subtilis, 
Brucella abortus, Burkholderia cepacia, Citrobacter 
diversus, Citrobacter freundii, Enterobacter aerogenes, 
Enterobacter agglomerans, Enterobacter cloacae, 

— Enterococcus avium^~~~Entermoccus casseliflavus~ 

Enterococcus dispar, Enterococcus durans, Enterococcus 
faecalis, Enterococcus faecium, Enterococcus flavescens, 
Enterococcus gallinarum, Enterococcus mundtii, 
Enterococcus raffinosus. Enterococcus solitarius, 
Escherichia coli, Gemella morbillorum, Haemophilus 
ducreyi, Haemophilus haemolyticus, Haemophilus 
influenzae, Haemophilus parahaemolyticus, Haemophilus 
parainjluenzae, Hafnia atvei, Kingella kingae, Klebsiella 
oxytoca, Klebsiella pneumoniae, Legionella pneumophila, 
Megamonas hypermegale, Moraxella atlantae, Moraxella 
catarrhalis, Morganella morganii, Neisseria gonorrheae, 
Neisseria meningitidis, Pasteurella aerogenes, Pasteurella 
multocida, Peptostreptococcus magnus, Proteus mirabilis, 
Providencia aicalifaciens, Providenciarettgeri, Providencia 
rustigianii, Providencia stuartii, Pseudomonas aeruginosa, 
Pseudomonas fluorescens, Pseudomonas stutzeri, 
Salmonella boneori, Salmonella choleraesuis, Salmonella 
enteritidis, Salmonella gallinarum, Salmonella 
typhimurium, Serratia liquefaciens, Serratia marcescens, 
Shigella flexneri, Shigella sonnei, Staphylococcus aureus, 
Staphylococcus capitis Staphylococcus epidermidis, 
Staphylococcus haemolyticus, Staphylococcus hominis, 
Staphylococcus lugdunensis, Staphylococcus saprophyticus, 
Staphylococcus simulans, Staphylococcus warneri, 
Stenotrophomonas maltophilia, Streptococcus 
acidominimus, Streptococcus agalactiae, Streptococcus 
anginosus, Streptococcus bovis, Streptococcus constellatus, 
Streptococcus cricetus, Streptococcus cristatus, 
Streptococcus dysgalactiae, Streptococcus equi, 
Streptococcus ferus, Streptococcus gordonii, Streptococcus 

intermedin^: Strp.ntnmr.riix mnrnrpp.^ J?frp.ntnfyy*nijx rnitix . 

Streptococcus mutans t Streptococcus oralis, Streptococcus 
parasanguinis, Streptococcus parauberis, Streptococcus 
pneumoniae, Streptococcus pyogenes, Streptococcus ratti, 
Streptococcus salivarius, Streptococcus sanguinis, 
Streptococcus sobrinus, Streptococcus uberis, Streptococcus 
vestibularis, Vibrio cholerae, Yersinia enter ocolitica, 
Yersinia pestis, Yersinia pseudotuberculosis. 

656, 657, 27 1 , for the detection and/or identification of Enterococcus sp. 
1136,1137 

701, 702 for the detection and/or identification of Leishmania sp. 
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703, 704, 705, 706, for the detection and/or identification of Entamoeba sp. 
793 

794, 795 for the detection and/or identification of Trypanosoma cruzi 

796, 797, 808, 809, for the detection and/or identification of Clostridium sp. 
810,811 

~~ 798; 79978OO78OI" for thendete^ 
802,803,804,805, parvum 
806, 807 

816,817,818,819 for the detection and/or identification of Giardia sp. 
820,821,822 for the detection and/or identification oi Trypanosoma 

brucei 

823, 824 for the detection and/or identification of Trypanosoma sp. 

825, 826 for the detection and/or identification of Bordetella sp. 

923, 924, 925, 926, for the detection and/or identification of Trypanosomatidae 
927, 928 family 

933, 934 for the detection and/or identification of Enterobacteriaceae 

group 

994, 995, 996, 997, for the detection and/or identification of Streptococcus 
998,999, 1000, pyogenes 
1001,1200, 1210, 
1211 

1157,2079,2118 for the detection and/or identification of Candida 
parapsilosis 

1158,1159, 2078, for the detection and/or identification of Candida glabrata 
2110,2111 

1 1 60, 2077, 2119, for the detection and/or identification of Candida tropicalis 
2120 

1161, 2075, 2112, for the detection and/or identification of Candida krusei 
2113,2114 

1162 for the detection and/or identification of Candida 

guilliermondii 

1 1 62, 2080, 2 1 1 5 for the detection and/or identification of Candida lusitaniae 
2116,2117 

1165 for the detection and/or identification of Candida 

zeylanoides 

1201 for the detection and/or identification of Streptococcus 

pneumoniae 
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1233 for the detection and/or identification ofStaphylococcus sp. 

other than S. aureus 
1329, 1330, 1331, for the detection and/or identification oiKlebsiella 
1332, 2167, 2281 pneumoniae 

1661,_1665 for_the_detection and/_orJdentification_of.&cAenc/z/a.co/z 

and Shigella sp. 

1690,1691,1692, for the detection and/or identification of Acinetobacter 
1693, 2169 baumanii 

1694,1695,2122 for the detection and/or identification of Pseudomonas 
aeruginosa 

1971,1 972, 1 973 for the detection and/or identification of Cryptococcus sp. 
208 1 , 2082, 2083 for the detection and/or identification of Legionella sp. 
2084 for the detection and/or identification of Legionella 

pneumophila 

2093,2094,2095, for the detection and/or identification of Mycoplasma 
2096 pneumoniae 

2106,2107 for the detection and/or identification of Cryptococcus 

neoformans 

2131,2132,2133 for the detection and/or identification of Campylobacter 
jejuni and C coli 

2134, 2135, 2136 for the detection and/or identification of Bacteroides fragilis 
2170 for the detection and/or identification of Abiotrophia 

adiacens 

L 1 / 1 JLOl L11C UCLCCLlUli OlJU/Ul lUCllllllUttUUll Ul \JCfUeilU 

2172 for the detection and/or identification of Enter ococcus sp., 

Gemella sp., A. adiacens 
2180,2181,2182 for the detection and/or identification of Bordetella 

pertussis. 

12. A method for detecting the presence in a test sample of a 
microorganism that is an alga, archaeum, bacterium, fungus or parasite, which 
comprises: 

a) putting in contact any test sample tuf or atpD or recA nucleic acids 
and nucleic acid primers and/or probes, said primers and/or probes having 
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been selected to be sufficiently complementary tohybridize to one or more 
tuf or atpD or recA nucleic acids that are specific to said group of 
microorganisms; 

b) allowing the primers and/or probes and any test sample tuf or 
atpD or recA nucleic acids to hybridize under specified conditions such as 

said-primere_and^_probes_hyM 

acids of said microorganism and does not detectably hybridize to tuf or 
atpD or recA sequences from other microorganisms; and, 

c) testing for hybridization of said primers and/or probes to any 
test sample tuf or atpD or recA nucleic acids. 

13. The method of claim 12 wherein c) is based on a nucleic acid target 
amplification method. 

14. The method of claim 12 wherein c) is based on a signal amplification 
method. 

15. The method of any one of claims 12 to 14 wherein said primers 
and/or probes that are sufficiently complementary are perfectly complementary. 

16. The method of any one of claims 12 to 14 wherein said primers 
and/or probes that are sufficiently complementary are not perfectly 
complementary. 

i/. rv iiicuiuu iui uic apci/iiic ucicv/iiuii oiiu/ui luciiLinvaiiun wi a 

microorganism that is an algal, archaeal, bacterial, fungal or parasitical species, 
genus, family or group in any sample, using a panel of probes or amplification 
primers or both, each individual probe or primer being derived from a nucleic 
acid which has a nucleotide sequence of at least 12 nucleotides in length capable 
of hybridizing with the nucleic acids of said microorganism and with a nucleic 
acid having any one of the nucleotide sequences defined in SEQ ID NOs.: 

539 5 540 for the detection and/or identification oiMycobacteriaceae 

family 

541, 542, 544, 2121 for the detection and/or identification ofPseudomonads group 
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545, 546 for the detection and/or identification of Corynebacterium 

sp. 

547, 548, 1202 for the detection and/or identification of Streptococcus sp. 
549, 550, 582, 583, for the detection and/or identification of Streptococcus 
625,626,627,628, agalactiae 
1T99 

551,552,2166, for the detection and/or identification of Neisseria 
2 1 73, 2 1 74, 2 1 75, gonorrhoeae 
2176,2177,2178, 
2179 

553, 575, 605, 606, for the detection and/or identification of Staphylococcus sp. 
707,1175,1176 ] 
554,555,2213 for the detection and/or identification of Chlamydia 
trachomatis 

576, 63 1, 632, 633, for the detection and/or identification of Candida sp. 
634, 635, 1163, 
1164,1167,2076, 
2108,2109 

577, 1 156, 1 160 for the detection and/or identification of Candida albicans 
2073 

578,1166,1168, for the detection and/or identification of Candida 
2074 dubliniensis 

579, 2 1 68 for the detection and/or identification of Escherichia coli 

580,603,1174, for the detection and/or identification of Enterococcus 
1236, 1238,2289, faecalis 
2290, 2291 

581 for the detection and/or identification of Haemophilus 

influenzae 

584, 585, 586, 587, for the detection and/or identification of Staphylococcus 

588, 1232, 1234, aureus 

2186 

589,590,591,592, for the detection and/or identification of Staphylococcus 
593 epidermidis 

594, 595 for the detection and/or identification of Staphylococcus 

haemolyticus 

596, 597, 598 for the detection and/or identification of Staphylococcus 

hominis 
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599, 600, 601, 695, for the detection and/or identification of Staphylococcus 
1208, 1209 saprophyticus 

602, 1235, 1237, for the detection and/or identification of Enterococcus 
1696, 1697, 1698, faecium 
1699, 1700, 1701, 
2286, 2287 

604 — for_the_detection_and/or_identificati on-oLEnterococcus~ 

gallinarum 

620,1122 for the detection and/or identification of Enterococcus 

casseliflavus, E.flavescens and E. gallinarum 
629, 630, 2085, for the detection and/or identification of Chlamydia 
2086, 2087, 2088, pneumoniae 
2089,2090, 2091, 
2092 

636, 637, 638, 639, for the detection and/or identification of at least the following: 
640, 641, 642 Abiotrophia adiacens, Abiotrophia defectiva, Acinetobacter 

baumannii, Acinetobacter Iwoffi, Aerococcus viridans, 
Bacillus anthracis t Bacillus cereus, Bacillus subtilis, Brucella 
abortus, Burkholderia cepacia, Citrobacter diversus, 
Citrobacter freundii, Enterobacter aerogenes, Enterobacter 
agglomerans, Enterobacter cloacae, Enterococcus avium, 
Enterococcus casselijlavus, Enterococcus dispar, 
Enterococcus durans, Enterococcus faecalis, Enterococcus 
faecium, Enterococcus flavescens, Enterococcus gallinarum, 
Enterococcus mundtii, Enterococcus raffinosus, 
Enterococcus solitarius, Escherichia coli, Gemella 
morbillorum, Haemophilus ducreyi, Haemophilus 
haemolyticus, Haemophilus influenzae, Haemophilus 
parahaemolyticus,Haemophilusparainfluenzae, Hqfhia alvei, 
Kingella kingae, Klebsiella oxytoca, Klebsiella pneumoniae, 
Legionella pneumophila, Megamonas hypermegale, 
Moraxella atlantae, Moraxella catarrhal is, Morganella 
morganii, Neisseria gonorrheae, Neisseria meningitidis, 
Pastpurplla np.mopjtps Ppxfpurplln mijltnr.ida 

Peptostreptococcus magnus, Proteus mirabilis, Providencia 
alcalifaciens, Providencia rettgeri, Providencia rustigianii, 
Providencia stuartii, Pseudomonas aeruginosa, 

Pseudomonas fluorescens, Pseudomonas stutzeri, 
Salmonella bongori, Salmonella choleraesuis, Salmonella 
enteritidis, Salmonella gallinarum, Salmonella typhimurium, 
Serratia liquefaciens, Serratia marcescens, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus 
capitis Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Staphylococcus hominis, Staphylococcus 
lugdunensis, Staphylococcus saprophyticus, Staphylococcus 
simulans, Staphylococcus warneri, Stenotrophomonas 
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maltophilia, Streptococcus acidominimus, Streptococcus 
agalactiae^ Streptococcus anginosus, Streptococcus bovis, 
Streptococcus constellatus, Streptococcus cricetus, 
Streptococcus cristatus, Streptococcus dysgalactiae, 
Streptococcus equi, Streptococcus ferus, Streptococcus 
gordonii, Streptococcus intermedins, Streptococcus 
macacae, Streptococcus mitis, Streptococcus mutans, 

Streptococcus oralisT~~ Streptococcus parasanguinis; 

Streptococcus parauberis, Streptococcus pneumoniae, 
Streptococcus pyogenes, Streptococcus ratti, Streptococcus 
salivarius, Streptococcus sanguinis, Streptococcus sobrinus, 
Streptococcus uberis, Streptococcus vestibularis, Vibrio 
cholerae, Yersinia enterocolitica, Yersinia pestis, Yersinia 
pseudotuberculosis. 

656, 657, 271, for the detection and/or identification of Enterococcus sp. 

1136,1137 

701, 702 for the detection and/or identification of Leishmania sp. 

703, 704, 705, 706, for the detection and/or identification of Entamoeba sp. 
793 

794, 795 for the detection and/or identification of Trypanosoma cruzi 

796, 797, 808, 809, for the detection and/or identification of Clostridium sp. 
810,811 

798, 799, 800, 801, for the detection and/or identification of Cryptosporidium 
802,803,804,805, parvum 
806, 807 

816,817,818,819 for the detection and/or identification of Giardia sp. 
820, 821, 822 for the detection and/or identification of Trypanosoma 

brucei 

823, 824 for the detection and/or identification of Trypanosoma sp. 

SO 5 fnr tho A(*t(*nt\f\n cjnH/nr identification nf Rnvrlotalln cn 

923, 924, 925, 926, for the detection and/or identification of Trypanosomatidae 
927, 928 family 

933, 934 for the detection and/or identification of Enterobacteriaceae 

group 

994, 995, 996, 997, for the detection and/or identification of Streptococcus 
998,999,1000, pyogenes 
1001, 1200, 1210, 
1211 

1157,2079,2118 for the detection and/or identification of Candida 
parapsilosis 
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1158,1159, 2078, for the detection and/or identification of Candida glabrata 
2110,2111 

1 1 60, 2077, 2 1 1 9, for the detection and/or identification of Candida tropicalis 
2120 

1161, 207 5,2112, for the detection and/or identification of Candida krusei 

2113,2114 

1162 for the detection and/or identification of Candida 

guilliermondii 

1 1 62, 2080, 2115 for the detection and/or identification of Candida lusitaniae 
2116,2117 

1165 for the detection and/or identification of Candida 

zeylanoides 

1201 for the detection and/or identification of Streptococcus 

pneumoniae 

1233 for the detection and/or identification oiStaphylococcus sp, 

other than S. aureus 
1329, 1330, 133 1, for the detection and/or identification of Klebsiella 
1 332, 2 1 67, 228 1 pneumoniae 

1661 , 1 665 for the detection and/or identification of Escherichia coli 

and Shigella sp. 

1690, 1691, 1692, for the detection and/or identification of Acinetobacter 
1693,2169 baumanii 

1694,1695,2122 for the detection and/or identification of Pseudomonas 
aeruginosa 

1971,1 972, 1 973 for the detection and/or identification of Cryptococcus sp. 
208 1 , 2082, 2083 for the detection and/or identification of Legionella sp. 
2084 for the detection and/or identification of Legionella 

pneumophila 

2093, 2094, 2095, for the detection and/or identification of Mycoplasma 

^f\rm 

pneumoniae 

2106,2107 for the detection and/or identification of Cryptococcus 

neoformans 

2131,2132,2133 for the detection and/or identification of Campylobacter 
jejuni and C. coli 

2134, 2135, 2136 for the detection and/or identification of Bacteroides fragilis 
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2 1 70 for the detection and/or identification ofAbiotrophia adiacens 

2171 for the detection and/or identification of Gemella sp. 

2172 for the detection and/or identification of Enterococcus sp, 

Gemella sp., A. adiacens 

2 1 80, 2 1 8 1 , 2 1 82 for the detection and/or identification ofBordetella pertussis, 



said method comprising the step of contacting the nucleic acids of the sample 
with said primers or probes under suitable conditions of hybridization or of 
amplification and detecting the presence of hybridized probes or amplified 
products as an indication of the presence of said specificalgal, archaeal, bacterial, 
fungal or parasitical species, genus, family or group. 

18. A method for the universal detection of any bacterium, fungus or 
parasite in a sample, using a panel of probes or amplification primers or both, 
each individual probe or primer being derived from a nucleic acid as defined in 
claims 8 or 9, the method comprising the step of contacting the nucleic acids of 
the sample with said primers or probes under suitable conditions ofhybridization 
or of amplification and detecting the presence of any alga, archaeon, bacterium, 
fungus or parasite. 

19. A method as set forth in claim 17 or 18, which further comprises 
probes or primers, or both, for the detection of at least one antimicrobial agent 
resistance gene. 

00 A mat\*r>A on cot -fvvrth in Moitn 17 18 f\r 10 nrViJoV* -fiiftViov nnmnncop 

probes or primers, or both, for the detection of at least one toxin gene. 

21. A method as set forth in claim 19 or 20, wherein the probes or 
primers for the detection of said antimicrobial agent resistance gene or toxin gene 
have at least 12 nucleotides in length capable ofhybridizing with an antimicrobial 
agent resistance gene and/or toxin gene selected from SEQ ID NOs.: 

1078, 1079, 1085 for the detection and/or identification of the£. coli Shiga- 
like toxin 2 (sfi^) gene 



360 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



1080, 1081, 1084, 
2012 

1082, 1083 



1086, 1087 
1089 , 109 0 



1092, 1170 
1240, 2292 
1095, 1096 
1241,2294 
1111, 1112 
1114, 1115 
1118, 1119 
1121,1123 
1103, 1104 
1110 

1105, 1106 
1108 

1097, 1098 
1100, 1101 
1150, 1153 
1155 

1094, 1125 
1127, 1128 
1130, 1131 
1133, 1134 
1192, 1193 
1195, 1196 
1214, 1216 



1088, 
IQ91,_ 



1239, 

1171, 

2295 

1113, 

1116, 

1120, 

1124 

1109, 

1107, 

1099, 
1102 
1154, 

1126, 
1129, 
1132, 
1135, 
1194, 
1197, 
1217, 



nio inn i nnn 

ixio, i£iy, i/,z.u, 
2015, 2016, 2017, 
2018 ; 2019, 2020, 
2021,2022, 2023, 
2024, 2025, 2026, 
2027, 2028, 2029, 
2030, 2031,2032, 
2033,2034, 2035, 
2036, 2037, 2038, 
2039 



for the detection and/or identification of the£. coli Shiga- 
like toxin 1 (stxj) gene 

for the detection and/or identification ofE. coli Shiga-like 
toxins 1 and 2 (stx) genes 

for the detection and/or identification of thevan^ resistance 
gene 



for the detection and/or identification of thevanB resistance 
gene 
for the 



detection 
resistance genes 



and/or identification of the vanAB 



for the detection and/or identification of the vanCl 
resistance gene 

for the detection and/or identification of the vanC2 and 
vanC3 resistance genes 

for the detection and/or identification of the vanCl, vanC2 
and vanCS resistance genes 

for the detection and/or identification of the vanAXY 
resistance genes 

for the detection and/or identification of the£ pneumoniae 
pbpla gene 
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for the detection and/or identification of theS. pneumoniae 
pbp2b gene 

for the detection and/or identification of the<S. pneumoniae 
pbp2x gene 

for the detection and/or identification of IhemecA resistance 

gene 

1290, 1291, 1292, for the detection and/or identification of thegyrA resistance 

1293, 1294, 1295, gene 

1296, 1297, 1298, 

1333, 1334, 1335, 

1340, 1341, 1936, 

1937, 1940, 1942, 

1943, 1945, 1946, 

1947, 1948, 1949, 

2040,2041,2042, 

2043,2250, 2251 

1301 , 1302, 1303, for the detection and/or identification of thegyrB resistance 
1304, 1305, 1306 gene 

1308, 1309, 1310, for the detection and/or identification of theparC resistance 

1311,1312,1313, gene 

1314, 1315, 1316, 

1317, 1318, 1319, 

1336, 1337, 1338, 

1339, 1342, 1343, 

1934, 1935, 1938, 

1939, 1941, 1944, 

1950, 1951, 1952, 

1953, 1955,2044, 

2045, 2046 

i Tin nn no^ r — j~*~„*: — „„j/ — :a — „r*u it : — 

idll, uzj, uxt, iui uic uciciHiun iuiu/oi luciumcauiui ui uie/zurxi icsisuuiuc 

1325, 1326, 1327 gene 

1344, 1345, 1346, for the detection and/or identification of the aac(2')-Ia 

1347 resistance gene 

1349, 1350 for the detection and/or identification of the aac(3')-Ib 
resistance gene 

1352, 1353, 1354, for the detection and/or identification of the aac(3')-IIb 

1355 resistance gene 

1357, 1358, 1359, for the detection and/or identification of the aac(3')-TVa 

1360 resistance gene 

1362, 1363, 1364, for the detection and/or identification of the aac(3')-VIa 

1365 resistance gene 



362 



1142,1143, 1144, 
1145 

1146, 1147, 1148, 
1149 

1177, 1231 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



1367, 1368, 1369, for the detection and/or identification of the aac(6')-Ia 

1370 resistance gene 

1372, 1373, 1374, for the detection and/or identification of the aac(6')-Ic 

1375 resistance gene 

1377, 1378, 1379, for the detection and/or identification of the ant(3')-Ia 



1380 resistance gene 

1382, 1383, 1384, for the detection and/or identification of the ant(4')-Ia 
1385 resistance gene 

1387, 1388, 1389, for the detection and/or identification of the aph(3')-Ia 
1390 resistance gene 

1392, 1393, 1394, for the detection and/or identification of the aph(3')-IIa 
1395 resistance gene 

1397, 1398, 1399, for the detection and/or identification of the aph(3')-IIIa 
1400 resistance gene 

1402, 1403, 1404, for the detection and/or identification of the aph(3')-VIa 
1405,2252 resistance gene 

1407, 1408, 1409 for the detection and/or identification of the blaCARB 
1410 resistance gene 

1412, 1413, 1414, for the detection and/or identification of the blaCMY-2 
1415 resistance gene 

1417, 1418 for the detection and/or identification of the blaCTX-M- 

7and blaCTX-M-2 resistance genes 

1419, 1420, 1421, for the detection and/or identification of theblaCTX-M-1 

1422 resistance gene 

1424, 1425, 1426, for the detection and/or identification of the blaCTX-M-2 
1427 resistance gene 

1429,1430,1431, for the detection and/or identification of the blaIMP 
1432 resistance gene 

1434, 1435 for the detection and/or identification of the blaOXA2 

resistance gene 

1436, 1437 for the detection and/or identification of the blaOXAlO 

resistance gene 

1440, 1441 for the detection and/or identification of the blaPER-1 

resistance gene 

363 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



1443, 1444 for the detection and/or identification of the blaPER-2 

resistance gene 

1446, 1447, 1448, for the detection and/or identification of theblaPER-1 and 
1449 blaPER -2 resistance genes 

1 45_0,_1 451 for the d etection and/or identification_of ihtdfrA, resistance. 

gene 

1453, 1454, 1455, for the detection and/or identification of the dhfrla and 
1456 dhfrXV resistance genes 

1457, 1458, 1459, for the detection and/or identification of the dhfrla 
1460, 2253 resistance gene 

1462, 1463, 1464, for the detection and/or identification of the dhfrlb and 
1465 dhfrV resistance genes 

1466, 1467, 1468, for the detection and/or identification of the dhfrlb 
1469 resistance gene 

147 1 , 1 472, 1473, for the detection and/or identification of the <#*/r ^resistance 
1474 gene 

1476, 1477, 1478, for the detection and/or identification of the dhfrVI 
1479 resistance gene 

1481, 1482, 1483, for the detection and/or identification of the dhfrVII and 
1484 dhfrXVIl resistance genes 

1485,1486, 1487, for the detection and/or identification of the dhfrVII 
1488 resistance gene 

1490, 1491, 1492, for the detection and/or identification of the dhfrVIH 

l*t7J icaiaLcuiv^ gene 

1495, 1496, 1497, for the detection and/or identification of the dhfrIX 

1498 resistance gene 

1500,1501,1502, for the detection and/or identification of the dhfrXIl 

1503 resistance gene 

1505, 1506 for the detection and/or identification of the dhfrXIII 
resistance gene 

1508,1509,1510, for the detection and/or identification of the dhfrXV 

1511 resistance gene 

1513,1514,1515, for the detection and/or identification of the dhfrXVIl 

1516 resistance gene 
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1528, 1529 for the detection and/or identification of theere^and ereA2 

resistance genes 

1531, 1532, 1533, for the detection and/or identification of iheereB resistance 
1534 gene 

1536,. 1 5 3J7 ,_1 5 38, Jbrjhe_de^ _ofJhelinA_andJinA- 

1539 resistance genes 

1541, 1542, 1543, for the detection and/or identification of thclinB resistance 
1544 gene 

1546, 1547 for the detection and/or identification of thzme/A resistance 

gene 

1549, 1550 for the detection and/or identification of XhemeJE resistance 

gene 

1552, 1553, 1554, for the detection and/or identification of ihtmefA and mefE 
1555 resistance genes 

1556, 1557, 1558, for the detection and/or identification of the mphA and 
1559 mphK resistance genes 

1581, 1582, 1583, for the detection and/or identification of thesatG resistance 
1584 gene 

1586, 1587, 1588, for the detection and/or identification of the tetM resistance 
1589,2254 gene 

1 591 , 1 592, 1 593, for the detection and/or identification of ihevanD resistance 
2297 gene 

1595, 1596, 1597, for the detection and/or identification of XhevanE resistance 

1 (DO narta 
1J70 

1 609, 1 6 1 0, 1 6 1 1 , for the detection and/or identification of the vatB resistance 

1612 gene 

1 6 1 4, 1 6 1 5, 1 6 1 6, for the detection and/or identification of the vatC resistance 

1617 gene 

1619, 1620, 1621, for the detection and/or identification of thevga resistance 

1622 gene 

1624, 1 625, 1 626, for the detection and/or identification of XhevgaB resistance 

1627 gene 

1629, 1630, 1631, for the detection and/or identification of thevgfc and vgh 

1632 resistance genes 
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1634, 1635, 1636, for the detection and/or identification of thevgbB resistance 
1637 gene 

1883, 1884, 1885, for the detection and/or identification of the blaSHV 
1886, 1887, 1888, resistance gene 
1889, 1890, 1891, 

1892 , 1893, 1894, 

1895, 1896, 1897, 
1898 

1906, 1907, 1908, for the detection and/or identification of the blaTEM 

1909, 1910, 1911, resistance gene 

1912, 1913, 1914, 

1915, 1916, 1917, 

1918, 1919, 1920, 

1921, 1922, 1923, 

1924, 1925, 1926, 

2006, 2007, 2008, 

2009, 2141 

1961, 1962, 1963, for the detection and/or identification of thesulIJ resistance 
1964 gene 

1966, 1967, 1968, for the detection and/or identification of thete/5 resistance 
1969 gene 

2065, 2066, 2067, for the detection and/or identification of XhtrpoB resistance 

2068, 2069, 2070, gene 

2071 

2098, 2099, 21 00 for the detection and/or identification of theinhA resistance 
gene 

2 1 02, 2 1 03, 2 1 04 for the detection and/or identification of theembB resistance 
gene 

21 23, 2 1 24, 2125 for the detection and/or identification of theC. difficile cdtA 

IUMJ1 gene 

2126, 2127, 2128 for the detection and/or identification of theC. difficile cdtB 
toxin gene 

2142,2143 for the detection and/or identification of the mupA 

resistance gene 

2145, 2146 for the detection and/or identification of ih&catl resistance 

gene 

2148, 2149 for the detection and/or identification of fhecatll resistance 

gene 
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2 1 5 1 , 2 1 52 for the detection and/or identification of thecatlll resistance 

gene 

2 1 54, 2 1 55 for the detection and/or identification of theca/P resistance 

gene 

2-157,-2158,-2160, for_the-detection-and/or-identification-of-theca/-resistance- 

2161 gene 

2163,2164 for the detection and/or identification of the ppflo-like 

resistance gene. 



22. A composition of matter comprising a specific nucleic acid as set 
forth in claim 10 or 11, which is specific for a bacterial, fungal or parasitical 
species, genus, family, or group, or a nucleic acid as set forth in claim 8 or 9 
which is universal for a bacterium, fungus or parasite, or both specific and 
universal nucleic acids, in conjunction with a nucleic acid sequence of at least 12 
nucleotides capable of hybridizing with an antimicrobial agent resistance gene 
and/or toxin gene. 

23. A composition as set forth in claim 22, wherein the nucleic acid 
capable of hybridizing with an antimicrobial agent resistance gene and/or toxin 
gene is any one of: 

1078, 1079, 1085 for the detection and/or identification of the£. coli Shiga- 
like toxin 2 (stxj) gene 
1080, 1081, 1084, for the detection and/or identification of the£. coli Shiga- 
2012 like toxin 1 (stx } ) gene 

1082, 1083 for the detection and/or identification of£. coli Shiga-like 

toxins 1 and 2 (six) genes 
1086, 1087, 1088, for the detection and/or identification of thevaw^ resistance 
1089, 1090, 1091, gene 
1092,1170, 1239, 
1240, 2292 

1 095, 1 096, 1171, for the detection and/or identification of IhevanB resistance 
1241,2294,2295 gene 

1111,1112,1113, for the detection and/or identification of the vanAB 

1114,1115,1116, resistance genes 

1118,1119,1120, 

1121,1123,1124 
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1103, 


ll 04, 


1109, 


1110 






1105, 


ll 06, 


1107, 


1108 






1097, 


1098, 


1099, 


1100, 


1101, 


1102 


1150, 


1153, 


1154, 


1155 






1094, 


1125, 


1126, 


1127, 


1128, 


1129, 


1130, 


1131, 


1132, 


1133, 


1134, 


1135, 


1192, 


1193, 


1194, 


1195, 


1196, 


1197, 


1214, 


1216, 


1217, 


1218, 


1219, 


1220, 


2015, 


2016, 


2017, 


2018, 


2019, 


2020, 


2021, 


2022, 


2023, 


2024, 


2025, 


2026, 


2027, 


2028, 


2029, 


2030, 


2031, 


2032, 


2033, 


2034, 


2035, 


2036 


2037 


2038 


2039 






1142, 


1143, 


1144, 


1145 






1146, 


1147, 


1148, 


1149 

i 1 nn 


1 1 





for the detection and/or identification of the vanCl 
resistance gene 

for the detection and/or identification of the vanC2 and 
vanC3 resistance genes 

for the detection and/or identification of the vanCl, vanC2 
and vanC3 resistance genes 



11 / /, 1ZJ1 



1290, 
1293, 
1296, 
1333, 
1340, 
1937, 
1943, 
1947, 
2040, 
2043, 



1291, 
1294, 
1297, 
1334, 
1341, 
1940, 
1945, 
1948, 
2041, 
2250, 



1292, 
1295, 
1298, 
1335, 
1936, 
1942, 
1946, 
1949, 
2042, 
2251 



for the detection and/or identification of the vanAXY 
resistance genes 

for the detection and/or identification of theS. pneumoniae 
pbpla gene 



for the detection and/or identification of theS. pneumoniae 
pbp2b gene 

for the detection and/or identification of theiS, pneumoniae 
pbplx gene 

j _ x _ _a: if : j — _r"i.i --t 

ior tne ueiecuon anu/or luenuiicauon oi mcmec/i resistance 
gene 

for the detection and/or identification of thzgyrA resistance 
gene 
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1301, 1302, 1303, for the detection and/or identification of thegyrB resistance 

1304,1305, 1306 gene 

1 308, 1 309, 1310, for the detection and/or identification of theparC resistance 

1311,1312,1313, gene 
1314,1315,1316, 
1317, 1318, 1319, 

1336; 1 -337; 133 8; 



1339, 1342, 1343, 
1934, 1935, 1938, 
1939, 1941,1944, 
1950, 1951, 1952, 
1953, 1955,2044, 
2045, 2046 

1 322, 1 323, 1324, for the detection and/or identification of theparE resistance 

1325, 1326, 1327 gene 

1344, 1345, 1346, for the detection and/or identification of the aac(2')-Ia 

1347 resistance gene 

1349, 1350 for the detection and/or identification of the aac(3')-lb 
resistance gene 

1352, 1353, 1354, for the detection and/or identification of the aac(3')-IIb 

1355 resi stance gene 

1357, 1358, 1359, for the detection and/or identification of the aac (3') -IVa 

1360 resistance gene 

1362, 1363, 1364, for the detection and/or identification of the aac(3')-VIa 

1365 resistance gene 

1367,1368,1369, for the detection and/or identification of the aac(6')-Ia 

1370 resistance gene 

1372, 1373, 1374, for the detection and/or identification of the aac(6')-lc 

1375 resistance gene 

1377, 1378, 1379, for the detection and/or identification of the ant(3')-la 

1380 resistance gene 

1382, 1383, 1384, for the detection and/or identification of the ant(4')-Ia 

1385 resistance gene 

1387, 1388, 1389, for the detection and/or identification of the aph(3')-Ia 

1390 resistance gene 

1392, 1393, 1394, for the detection and/or identification of the aph(3')-IIa 

1395 resistance gene 

1397, 1398, 1399, for the detection and/or identification of the aph(3')-IIIa 

1400 resistance gene 
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1402, 1403, 1404, for the detection and/or identification of the aph(3')-Vla 

1405, 2252 resistance gene 

1407, 1408, 1409 for the detection and/or identification of the blaCARB 

1410 resistance gene 

1412 , 1413,1414 , for th e det e ction and/or identification of the blaCMY-2 



1415 resistance gene 

1417, 1418 for the detection and/or identification of the blaCTX-M- 

7and blaCTX-M -2 resistance genes 
1419, 1420, 1421, for the detection and/or identification of the blaCTX-M-1 
1422 resistance gene 

1424, 1425, 1426, for the detection and/or identification of the blaCTX-M-2 
1427 resistance gene 

1429,1430,1431, for the detection and/or identification of the blaIMP 
1432 resistance gene 

1434, 1435 for the detection and/or identification of the blaOXA2 

resistance gene 

1436, 1437 for the detection and/or identification of the blaOXAW 

resistance gene 

1440, 1441 for the detection and/or identification of the blaPER-1 

resistance gene 

1443, 1444 for the detection and/or identification of the blaPER-2 

resistance gene 

1446, 1447, 1448, for the detection and/or identification of the blaPER-1 and 



1/1 /in l/«dcd „„r.:r,4*~~~~ ~~~~~ 

1450, 1451 for the detection and/or identification of ftitdfrA resistance 

gene 

1453, 1454, 1455, for the detection and/or identification of the dhfrla and 
1 456 dhfrXV resistance genes 

1457, 1458, 1459, for the detection and/or identification of the dhfrla 
1 460, 2253 resistance gene 

1462, 1463, 1464, for the detection and/or identification of the dhfrlb and 
1465 dhfrV resistance genes 

1466, 1467, 1468, for the detection and/or identification of the dhfrlb 
1469 resistance gene 
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1471 , 1472, 1473, for the detection and/or identification of the tflz/rFresistance 

1474 gene 

1476, 1477, 1478, for the detection and/or identification of the dhfrVI 

1 479 resistance gene 

1_481,_1482,_L483, forJhe_detection_and/or_identification_^ 

1 484 dhfrXVII resistance genes 

1485, 1486, 1487, for the detection and/or identification of the dhfrVU 

1488 resistance gene 

1490, 1491, 1492, for the detection and/or identification of the dhfrVIIl 

1493 resistance gene 

1495, 1496, 1497, for the detection and/or identification of the dhfrIX 

1498 resistance gene 

1500, 1501, 1502, for the detection and/or identification of the dhfrXIl 

1503 resistance gene 

1505, 1506 for the detection and/or identification of the dhfrXlII 
resistance gene 

1508,1509,1510, for the detection and/or identification of the dhfrXV 

1511 resistance gene 

1513,1514,1515, for the detection and/or identification of the dhfrXVII 

1516 resistance gene 

1528, 1529 for the detection and/or identification of theere^and ereA2 
resistance genes 

1531, 1532, 1533, for the detection and/or identification of IheereB resistance 

1 SI A rrck*+e* 

1536, 1537, 1538, for the detection and/or identification of the linA and linA' 
1539 resistance genes 

1541, 1542, 1543, for the detection and/or identification of the/mfl resistance 
1544 gene 

1546, 1547 for the detection and/or identification of themefA resistance 

gene 

1549, 1550 for the detection and/or identification of theme/E resistance 

gene 

1552, 1553, 1554, for the detection and/or identification of XhcmefA and meJE 
1555 resistance genes 

371 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



1556, 1557,1558, for the detection and/or identification of the mphA and 
1559 mphK resistance genes 

1581, 1582, 1583, for the detection and/or identification of thesa/G resistance 
1584 gene 

1586, 1587, 1588, for the detection and/or identification of the tetM resistance 
1589, 2254 g ene 



1591, 1592, 1593, for the detection and/or identification of thevanD resistance 

2297 gene 

1595, 1 596, 1 597, for the detection and/or identification of thevanE resistance 

1598 gene 

1609, 1610, 161 1, for the detection and/or identification of thevatB resistance 

1612 gene 

1614, 1615, 1616, for the detection and/or identification of theva/C resistance 

1617 gene 

1619, 1620, 1621, for the detection and/or identification of thevga resistance 

1622 gene 

1624, 1 625, 1626, for the detection and/or identification of thevgaB resistance 

1627 gene 

1629, 1630, 1631, for the detection and/or identification of the vgb and vgh 

1632 resistance genes 

1634, 1635, 1636, for the detection and/or identification of thevgbB resistance 

1637 gene 

1883,1884,1885, for the detection and/or identification of the blaSHV 

1 886, 1 887, 1 888, resistance gene 

1889, 1890, 1891, 

1892, 1893, 1894, 

1895, 1896, 1897, 

1898 

1906, 1907, 1908, for the detection and/or identification of the blaTEM 

innn 1 n i n inn 

iyvy, iyiv, lyii, resistance gene 

1912, 1913, 1914, 

1915,1916, 1917, 

1918, 1919, 1920, 

1921, 1922, 1923, 

1924, 1925, 1926, 

2006, 2007, 2008, 

2009,2141 

1961, 1962, 1963, for the detection and/or identification of thesuffl resistance 

1964 gene 
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1966, 1967, 1968, for the detection and/or identification of thetetB resistance 
1969 gene 

2065, 2066, 2067, for the detection and/or identification of thcrpoB resistance 

2068, 2069, 2070, gene 

2071 

2098^2099^ 100— —for-tJie-detection-and/or-identifieati 

gene 

2 1 02, 2 1 03 , 2 1 04 for the detection and/or identification of thcembB resistance 

gene 

2123, 2124, 2125 for the detection and/or identification of theC difficile cdtA 
toxin gene 

2126,2127,2128 for the detection and/or identification of the C difficile cdtB 

toxin gene 

2142,2143 for the detection and/or identification of the mupA 

resistance gene 

2145, 2146 for the detection and/or identification of thtcatl resistance 

gene 

2148, 2149 for the detection and/or identification of ihecatll resistance 

gene 

2151,2152 for the detection and/or identification of thecatffl resistance 

gene 

2154, 2155 for the detection and/or identification of thecatP resistance 

gene 

2157, 2158, 2160, for the detection and/or identification of thecat resistance 
2161 gene 

2163,2164 for the detection and/or identification of the ppfloAike 



resistance gene. 



24. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of thttuf sequences defined 
in SEQ ID NOs.: 1-73, 75-241, 399-457, 498-529, 612-618, 621-624, 675, 677, 
717-736, 779-792, 840-855, 865, 868-888, 897-910, 932, 967-989, 992, 1266- 
1287, 1518-1526, 1561-1575, 1578-1580, 1662-1664, 1666-1667, 1669-1670, 
1673-1683, 1685-1689, 1786-1843, 1874-1881, 1956-1960, 2183-2185, 2187- 
2188, 2193-2201, 2214-2249, 2255-2272. 
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25. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of the atpD sequences 
defined in SEQ ID NOs.: 242-270, 272-398, 458-497, 530-538, 663, 667, 673, 
674, 676, 678-680, 737-778, 827-832, 834-839, 856-862, 866-867, 889-896, 929- 
931,_9A1-966,J24 

1649-1660, 1671, 1684, 1844-1848, 1849-1865,2189-2192. 

26. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of the recA sequences 
defined in SEQ ID NOs.: 990-991, 1003, 1288-1289, 1714, 1756-1763, 1866- 
1873 and 2202-2212. 

27. A nucleic acid having at least 12 nucleotides in length, capable of 
selectively hybridizing with the nucleotide sequence of any one of the 
antimicrobial agent resistance gene sequences defined in SEQ ID NOs.: 1004- 
1075, 1255, 1607-1608, 1648, 1764-1785,2013-2014,2056-2064, 2273-2280. 

28. The nucleic acid sequences of the nucleic acids of any one of claims 
24 to 27. 

29. The use of a nucleic acid having at least 12 nucleotides in length 
capable of hybridizing with the nucleic acids of any one of the antimicrobial 
agent resistance genes sequences defined in SEQ ID NOs.: 1004-1075, 1255, 

1 ci\n uno *\cao tiza noc inn a nn<£ nntA 007^5 oocn -T/>«. *U<* 

detection and identification of microbial species. 

30. The use of a nucleic acid having at least 12 nucleotides in length 
capable of hybridizing with the nucleic acids of any one of the toxin genes 
defined in SEQ ID NOs.: 1078-1085, 2012 and 2123 to 2128 for the detection and 
identification of microbial species. 

31. A repertory of hexA nucleic acids used for the detection and/or 
identification of Streptococcus pneumoniae, which repertory is created by 
amplifying 
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the nucleic acids of any streptococcal species with any combination of 
primers SEQlDNOs.: 1179, 1181 and 1182. 

32. A repertory as defined in claim 31, which comprises the nucleic 
acids having a nucleotide sequence defmed in SEQ ID NOs.: 1 184 to 1 191. 



33. A repertory of nucleic acid sequences derived from the repertory of 
claim 31 or 32. 

34. A nucleic acid used for the specific and ubiquitous detection and for 
identification of Streptococcus pneumoniae, which is derived from the repertory 
of claim 3 1 . 

35. A nucleic acid as set forth in claim 34 which has a nucleic acid 
sequence of at least 12 nucleotides capable of hybridizing with said any 
Streptococcus pneumoniae and with any one of SEQ ID NOs.: 1 1 84 to 1 1 87. 

36. A nucleic acid as set forth in claim 34, which has a nucleic acid 
sequence of at least 12 nucleotides capable of hybridizing with the nucleic acids 
of Streptococcus pneumoniae and with any one of the nucleic acids having SEQ 
ID NOs.: 1179,1180,1181,1182. 

37. A peptide derived from the translation of the nucleic acids from the 
repertory obtained from the method of claim 1,31 or 32, or of the nucleic acids 

u^iui^u in an^y uiic ui viaiuio ^-r wj a* i , aiiu 

38. A peptide sequence derived from the peptide of claim 37. 

39. A recombinant vector comprising a nucleic acid obtained from the 
method of claim 1, 31 or 32, or from the nucleic acids defined in any one of 
claims 24 to 27, 35 and 36. 

40. A recombinant vector as defined in claim 39 which is an expression 
vector. 
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41. A recombinant host cell comprising the recombinant vector defined 
in claim 39 or 40. 

42. The use of the nucleic acid sequences defined in claim 28 or 33, or 
obtained from the method of claim 2 and of the protein sequences deduced from 
said_nucleic acid_sequences,_forJhe_de 

said microorganisms. 

43. The use as defined in claim 42, wherein said therapeutic agent is an 
antimicrobial agent, a vaccine or a genie therapeutic agent. 

44. A method for identification of a microorganism in a test sample, 
comprising the steps of: 

a) obtaining a nucleic acid sequence for a tuf, atpD, and/or recA 
genes of said microorganisms, and 

b) comparing said nucleic acid sequence with the nucleic acid 
sequences of a bank as defined in claim 5, said repertory comprising a 
nucleic acid sequence obtained from the nucleic acids of said 
microorganism, whereby said microorganism is identified when said 
comparison results in a match between said sequences. 
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2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter baumannii 

(B) STRAIN: ATCC 19606 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



CAAACTCGTG AGCACATCCT TCTTTCTCGT CAGGTAGGTG TACCTTACAT 50 

CATCGTATTC TTAAACAAAT GCGACCTTGT TGATGACGAA GAATTACTTG 100 

20 AATTAGTAGA AATGGAAGTA CGTGAACTTC TTTCTACTTA TGACTTCCCA 150 

GGTGATGACA CTCCAGTAAT CCGTGGTTCA GCTCTTGCAG CGCTTAACGG 200 

TGAAGCTGGT CCTTACGGTG AAGAATCAGT TCTTGCTCTT GTAGCAGCAC 250 

TTGACTCTTA CATCCCAGAG CCAGAGCGTG CAATCGACAA AGCATTCTTG 300 

ATGCCAATCG AAGACGTATT CTCAATTTCT GGTCGTGGTA CAGTAGTAAC 350 

25 AGGCCGTGTT GAAGCTGGTA TCATCAAAGT TGGTGAAGAA GTAGAGATCG 400 

TTGGTATTAA AGATACAGTT AAAACAACTG TAACTGGCGT AGAAATGTTC 450 

CGTAAACTTC TTGACGAAGG CCGTGCAGGT GAGAACTGTG GTATCTTACT 500 

TCGTGGTACT AAGCGTGAAG AAGTACAACG TGGTCAAGTA CTTGCTAAAC 550 

CAGGTACAAT CAAGCCGCAC ACTAAATTCG ACGCAGAAGT ATACGTACTT 600 

30 TCTAAAGAAG AAGGTGGTCG TCACACTCCA TTCTTAAATG GTTACCGTCC 650 

ACAGTTCTAC TTCCGTACAA CTGACGTAAC TGGTGCRATC CAGTTGAAAG 700 

AAGGCGTTGA AATGGTAATG CCAGGTGACA ACGTTGAAAT GTCAGTAGAA 750 



35 

2) INFORMATION FOR SEQ ID NO : 2 

( i ) SEQUENCE CHARACTER I ST I CS : 
(A) LENGTH: 82 6 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
{ uT % tufuTIugyT" binear 

(ii) MOLECULE TYPE: Genomic DNA 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

CGGTGCGATC CTCGTGGTCG CCGCGACCGA CGGCCCCATG GCCCAGACCC 50 

GCGAGCACGT CCTGCTCGCC CGTCAGGTCG GCGTTCCCAC CATCCTCATC 100 

GCCCTCAACA AGTCCGACAT GGTTGACGAC GAGGAAATGA TGGAACTGGT 150 

55 CGAGGAGGAG TGCCGCGACC TGCTGGAGTC CCAGGACTTC GATCGCGATG 200 

CCCCGATCGT CCAGGTTTCC GCTCTGAAGG CCCTCGAGGG CGACGCGGAG 250 

TGGGTTGCCA AGATCGAGGA GCTCATGGAG GCTGTGGATT CCTACATCCC 300 

CACCCCCGAG CGCGATATGG ACAAGCCCTT CCTCATGCCG ATCGAGGACG 350 

TCTTCACGAT CACAGGTCGT GGCACGGTCG TCACGGGGCG TGTTGAGCGT 400 

60 GGCAAGCTGC CGATCAACTC CGAGGTCGAG ATCCTCGGTA TCCGTGATCC 450 

3 
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CCAGAAGACC 
AGGCATGGGC 
GATGAGGTTG 
TCACACCGAG 
5 GCCGTCACAA 
ACCACGGACG 
CATGCCTGGC 
CCATGGAGCC 



ACGGTCACCG 
CGGCGAGAAC 
AGCGCGGCCA 
TTCGAGGGCC 
CCCGTTCTTC 
TGACCGGCGT 
GACACCACCG 
CGGCTGGGCT 



GCATCGAGAT 
TGTGGCCTGC 
GGTTGTGGCC 
AGGTTTACAT 
TCGAACTACC 
CATCACCCTC 
AGATCTCCGT 
TCGCCA 



GTTCCACAAG 
TGCTGCGCGG 
ATTCCCGGCT 
CCTCAAGAAG 
GTCCGCAGTT 
CCCGAGGGCA 
TCAGCTGATC 



TCGATGGACG 
TACCAAGCGC 
CCATCACGCC 
GAAGAGGGCG 
CTACTTCCGT 
CCGACATGGT 
CAGCCCATCG 



500 
550 
600 
650 
700 
750 
800 
826 



10- 



2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aerococcus viridans 

(B) STRAIN: ATCC 11563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



TGGTGCGATC 
GTGAGCACAT 

30 TTCTTAAACA 
TGAAATGGAA 
ATCTACCTGT 
GCTCAAGAAG 
TCCAGAACCA 

35 ATGTATTCTC 
CGTGGTGAAG 
ACAAATCGGT 
TAGACTACGC 
CAACGTGAAG 

40 CACTCCACAT 
AAGGTGGACG 



TTAGTAGTAT 
CCTTTTAGCT 
AAGTTGACCA 
GTTCGTGACT 
AATCGCTGGT 
CTAAAATCAT 
GAACGTGACA 
AATCACTGGT 
TTCGTACAGG 
AAATCAGTTG 
TCAAGCTGGT 
ACATCCAACG 
ACTAAATTTA 
TCATACACCA 



CTGCTGCTGA 
GGCCAAATCG 
AGTTGACGAT 
TATTATCTGA 
TCTGCTTTAT 
GGAATTAATG 
ACGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GACAACATCG 
TGGTCAAGTA 
AAGCGCAAGT 
TTCTTAACTA 



TGGTCCAATG 
GTGTTCCTGC 
GAAGAATTAC 
GTACAACTAC 
TAGCATTACA 
GAAGCTGTAG 
ATTCATGATG 
TTGCTACAGG 
GACATCGTTG 
TGAAATGTTC 
GTGCATTATT 
TTGGCTGCTC 
TTACGTTTTA 
ACTACCGTCC 



CCACAAACTC 
ATTCGTAGTA 
TAGAATTAGT 
CCAGGTGACG 
AGGCGATGAA 
ACTCTTACAT 
CCAATTGAGG 
TCGTGTTGAA 
GTATTGCTGA 
CGTAAAAACT 
ACGTGGTGTT 
CTGGTTCAAT 
TCTAAAGAAG 
ACAATTCTAC 



TTCCGTACTA CTCjACATTAU IWl'UTTATC ACT1TACCJAU AAUACUTAUU 
TATGGTTATG CCTGGTGACA ACGTTGATAT GGACGTTGAA TTGATTCACC 
CAGTTGCGAT CGAAGATGGT ACTAAATTCT CTATC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 



7SU 
800 
835 



45 



2) INFORMATION FOR SEQ ID NO: 4 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AchromoJbacter xylosoxidans subsp. 
60 denitrificans 
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(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

5 CCTGGTGGTG TCGGCCGCTG ACGGCCCGAT GCCGCAAACG CGCGAACACA 50 

TCCTGCTGAG CCGCCAGGTT GGCGTGCCGT ACATCATCGT CTTCCTGAAC 100 

AAGGCCGACA TGGTTGACGA CGCCGAGCTG CTTGAGCTGG TGGAAATGGA 150 

AGTTCGCGAR CTGCTGAGCA AGTACGACTT CCCGGGCGAC GACACCCCGA 200 

TCGTGAAGGG TTCGGCCAAG CTGGCGCTGG AAGGCGACAA GGGCGAACTG 250 

10 GGCGAACAGG CCATCATGGC GCTGGCCGCT GCGCTGGACT CGTACATCCC 300 



40 



GACGCCTGAG CGTGCCGTTG ACGGCGCGTT CCTGATGCCG GTTGAAGACG 350 

TGTTCTCGAT CTCGGGTCGC GGCACCGTGG TGACCGGCCG TATCGAACGC 400 

GGCATCATCA AGGTCGGCGA GGAAATCGAA ATCGTCGGTC TGGTGCCGAC 450 

GGTGAAGACG ACCTGCACGG GCGTGGAAAT GTTCCGCAAG CTGCTGGACC 500 

15 AAGGTCAAGC CGGCGACAAC GTGGGCATCC TRCTGCGCGG CACCAAGCGT 550 

GAAGACGTCC AGCGCGGCCA GGTTCTGGCC AAGCCGGGCT CGATCACCCC 600 

GCACACGGAC TTCACGTCCG AGGTGTACAT CCTGTCCAAG GAAGAAGGCG 650 

GCCGTCACAC TCCGTTCTTC CAAGGCTATC GTCCCCAGTT CTACTTCCGC 700 

ACGACGGACG TGACGGGCAC GATCGAGCTG CCGGCCGACA AGGAAATGGT 750 

20 CCTGCCGGGC GACAACGTGG CCATGACGGT CAAGCTGCTG GCTCCGATCG 800 

CCATGGAAGA AGGCCTGCGT TCGCCAC 827 



25 2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anaerorhabdus furcosus 

(B) STRAIN: ATCC 25662 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 



TGGATCAATC CTAGTAGTTG CTGCAACTGA TGGACCAATG CCTCAAACTC 50 

GTGAACATAT CTTACTTGCT CGTCAAGTAG GTGTTCCAAG AATGGTTGTA 100 

TTCTTGAACA AATGCGACAT GGTTGAAGAT GAAGAATTAA TCGACCTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTAAGTGC TTACGGTTTC GAAGGTGATG 200 

45 ATACACCAGT TATCCGTGGT TCTGCATTAA AATCTCTTGA AGGAAATGCT 250 

GATTGGGAAG CAAAAGTTGC TGAATTAATG GATGCAGTTG ACTCTTGGAT 300 

TCCAACTCCA ACTCATGAAA CAGACAAACC ATTCTTAATG GCTGTTGAAG 350 

ATGTATTCAC AATTACAGGT CGTGGTACAG TTGCTACTGG ACGTGTTGAA 400 

CGTGGACACT TAAACCTTAA CGAAGAAGTT GAAATCGTTG GTATTCATGA 450 

50 TACTAAGAAA TCAGTTGTTA CTGGTATCGA AATGTTCCGT "AAATTATTAG 500 

ACTATGCTGA AGCAGGAGAC AACATTGGTG CATTATTACG TGGTGTTTCT 550 

CGTGATGAAA TCGAACGTGG ACAATGTCTA GCTAAACCTG GATCAGTTAC 600 

TCCACATACA GCTTTCAAAG CTCAAGTATA CGTATTAACT AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC GTAACTAACT ACCGTCCTCA ATTCTATTTC 700 

55 CGTACAACTG ACGTAACAGG AGTTGTTAAA CTTCCTGAAG GTACTGAAAT 750 

GGTTATGCCT GGAGACAACA TCGAAATGAT CGTTGAATTA ATCGCTCCAA 800 

TCGCTGTTGA ACAAGGAACT AAG 823 
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2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracic 

(B) STRAIN: 4229 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACTC 50 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTACCTTA CATCGTTGTA 100 

TTCTTAAACA AATGCGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

20 AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 200 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 250 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 300 

CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 350 

ACGTATTCTC TATCACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 400 

25 CGCGGTATCG TTAAAGTTGG TGACGTAGTA GAAATCATCG GTCTTGCTGA 450 

AGAAAATGCT TCTACAACTG TAACTGGTGT AGAGATGTTC CGTAAACTTC 500 

TTGACCAAGC TCAAGCTGGA GACAACATCG GTGCTTTACT TCGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA CTTGCAAAAA GCGGTTCTGT 600 

AAAAGCTCAC GCTAAATTCA AAGCTGAAGT TTTCGTATTA TCTAAAGAAG 650 

30 AAGGTGGACG TCACACTCCA TTCTTCGCTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTATCATC CAATTACCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGACA ACATCGAAAT GACTATCGAA CTTATCGCTC 800 

CAATCGCTAT CGAAGAGGGA ACTAA 825 



35 



50 



2) INFORMATION FOR SEQ ID NO: 7 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

IcT' S iRjwDtiUNti&i:~ Double 
(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACAC 50 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTTCCTTA CATCGTTGTA 100 

55 TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 200 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 250 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 300 

CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 350 

60 ACGTATTCTC TATCACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 400 
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CGCGGTATCG 
AGAAAATGCT 
TTGACCAAGC 
GCTCGTGAAG 
AAAAGCTCAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCTAT 



TTAAAGTTGG 
TCTACAACTG 
TCAAGCTGGA 
ACATCCAACG 
GCTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGACA 
CGAAGAGGGA 



TGACGTAGTA 
TAACTGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCGCTA 
TGGTATCATC 
ACATTGAAAT 
ACTAAATTC 



GAAATCATCG 
AGAGATGTTC 
GTGCTTTACT 
CTTGCAAAAA 
TTTCGTATTA 
ACTACCGTCC 
CAATTACCAG 
GACTATCGAA 



GTCTTGCTGA 
CGTAAACTTC 
TCGTGGGGTT 
GCGGTTCTGT 
TCTAAAGAAG 
TCAGTTCTAC 
AAGGTACTGA 
CTTATCGCTC 



450 
500 
550. 
600 
650 
700 
750 
800 
829 
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15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CGGTGCTATC ATCGTAGTTG CTGCTACTGA TGGTCCTATG CCTCAAACTC 
GCGAGCACAT CCTTTTGGCT CGTCAGGTAA ACGTTCCGAG ATTGGTTGTA 
TTCATGAACA AGTGTGACAT GGTTGACGAC GAGGAAATGT TGGAATTGGT 



TGAGATGGAG ATGAGAGAGT 
ACACTCCGAT CATCCGTGGT 



CAATGGGAAG ATAAAGTAAT 
TCCTCTGCCT CCGCGCGAAA 
ACGTATTCTC AATCACGGGT 
ACAGGTATTG TTAAGGTTGG 
TGCTGGTAAG AAATCTGTTG 
TGGATCAAGG TGAGGCTGGT 
GATAAGAATG AGATCAAGCG 
TAAAGAGCAT TCTAAGTTCA 

AAGGTGGTCG TCACACTCCG 
ATCCGTACAT TGGATGTAAC 



TGCTTTCATT CTATCAATTC 
TCTGCTCTTG GTGCATTGAA 
GGAGTTGATG GAAGCTTGTG 
TCGACAAGCC GTTCTTGATG 
CGTGGTACTG TTGCTACAGG 
TGAGGAAGTT CAGATCATCG 
TTACAGGTGT TGAGATGTTC 
GATAACGTTG GTTTGTTGCT 
TGGTATGGTA ATCTGCCACC 
AGGCTGAGGT TTATATCTTG 
TTCCACAACA AATATCGTCC 
TGGTGAGATC ACTTTGCCGG 



GACGGTGACA 
CGGTGATGCT 
ATACTTGGAT 
CCGGTTGAGG 
TCGTATCGAG 
GTCTTGGCGC 
CGTAAGTTAT 
TCGCGGTATC 
CGGGTCAGGT 
AAGAAAGAGG 

TC^GTTOTAT 
AAGGAACTGA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 



45 



AATGGTAATG CCGGGTGATA ACGTAACGAT 
CGGTAGCATG TAGCGTAG 



CGAGGTTGAG TTGATCTATC 



700 
750 
800 
818 



2) INFORMATION FOR SEQ ID NO: 9 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus casseliflavus 

7 
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(B) STRAIN: R763 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

5 GGTCCTATGC CTCAAACACG TGAACACATC TTGTTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTTT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTACT AGAATTAGTT GAAATGGAAG TTCGTGACTT ATTGTCAGAA 150 

TATGACTTCC CAGGCGACGA TGTTCCTGTA ATCGCTGGTT CTGCTTTGAA 200 

AGCTCTTGAA GGCGATGCTT CATACGAAGA AAAAATCATG GAATTAATGG 250 

10 CT GCAGTT GA CGAATACGTT CCAAC T CCAG AACGTGACAC TGACAAACCA 300 . 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 350 

TGCTACAGGC CGTGTTGAAC GTGGACAAGT TCGCGTTGGT GACGAAGTTG 400 

AAATCGTTGG TATTGCTGAA GAAACTGCTA AAACAACTGT AACTGGTGTT 450 

GAAATGTTCC GTAAATTGTT AGACTATGCT GAAGCAGGGG ATAACATTGG 500 

15 TGCATTGCTA CGTGGTGTTG CTCGTGAAGA CATCCAACGT GGACAAGTAT 550 

TGGCTAAAGC TGGTACAATC ACACCTCATA CAAAATTTAA AGCTGAAGTT 600 

TACGTTTTAA CAAAAGAAGA AGGTGGACGT CACACACCA 639 



20 



2) INFORMATION FOR SEQ ID NO: 10 



( i ) SEQUENCE CHARACTER I ST I CS : 
(A) LENGTH: 692 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



35 



45 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



GAACACATTC TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT 
CTTAAACAAA GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG 
AAATGGAAGT TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT 
40 GTACCTGTAA TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA 
CTATGAGCAA AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC 



CAACACCAGA 
GTATTCTCAA 
TGGTCAAATC 
AATCAAGCAA 
GACTACGCTG 
ACGTGATGAC 
CACCACATAC 
GGTGGTCGTC 
CCGTACTACT 



ACGTGATTCT 
TCACTGGTCG 
AAAGTCGGTG 
AACAACTGTT 
AAGCTGGTGA 
GTACAACGTG 
AAAATTCAAA 
ATACACCATT 
GACGTAACTG 



GACAAACCAT 
TGGTACTGTT 
AAGAAATCGA 
ACTGGTGTAG 
CAACATTGGT 
GTCAAGTTTT 
GCGGATGTTT 
CTTCACTAAC 
GTGTTGTTAA 



TCATGATGCC 
GCTACAGGCC 
AATCATCGGT 
AAATGTTCCG 
GCATTATTAC 
AGCTGCTCCT 
ACGTTTTATC 
TACCGCCCAC 
CTTACCAGAA 



AGTTGAGGAC 
GTGTTGAACG 
ATGCAAGAAG 
TAAATTATTA 
GTGGTGTTTC 
GGTACTATTA 
TAAAGATGAA 
AATTCTATTT 
GG 



50 
100 
150 
200 
250 



300 
350 
400 
450 
500 
550 
600 
650 
692 



2) INFORMATION FOR SEQ ID NO: 11 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

8 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacteroides ova t us 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

10 CGGTGCTATC_AT.CG.TTT.GTG _CTGCAACT.GA_TGGT.CCGATG_ CCT.CAAACT.C 5.0. 

GCGAACACAT TCTGTTAGCT CGTCAGGTAA ACGTACCTCG TCTGGTTGTA 100 

TTCTTGAACA AATGCGATAT GGTAGACGAC GAAGAAATGT TGGAACTCGT 150 

TGAAATGGAA ATGAGAGAAC TCCTTTCATT CTATGATTTC GATGGTGACA 200 

ATACTCCTAT CATCCGTGGT TCTGCTCTTG GCGCATTGAA CGGTGTTGAA 250 

AAATGGGAAG ACAAAGTTAT GGAACTGATG GATGCAGTTG ATAACTGGAT 300 

TCCACTGCCT CCGCGCGATG TTGATAAACC ATTCTTGATG CCGGTTGAAG 350 

ACGTGTTCTC TATCACAGGT CGTGGTACTG TAGCAACAGG TCGTATCGAA 400 

ACAGGTGTCA TCCACGTTGG TGATGAAGTC GAAATTCTTG GTTTAGGTGA 450 

AGATAAGAAA TCAGTTGTAA CTGGTGTTGA AATGTTCCGT AAACTGTTGG 500 

ATCAAGGTGA AGCTGGTGAC AACGTAGGTC TTTTGCTTCG TGGTATTGAC 550 

AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAACCAG GTCAGATTAA 600 

ACCGCACTCT AAATTCAAAG CTGAGGTTTA TATCTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTACTTG 700 

CGTACTATGG ACTGTACAGG TGAAATCACT TTGCCGGAAG GAACAGAAAT 750 

GGTAATGCCG GGTGATAACG TAACTATTAC AGTTGAGTTG ATTTACCCAG 800 

TAGCATTGAA CCCGGGCTTC G 821 



15 



20 



25 



30 2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 
IB)~ STRAIN :" " ATCC* 4 9882* " 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



TGGTGCGATT TTGGTTGTTT CAGCTGCTGA TGGTCCGATG CCTCAAACAC 50 

GTGAGCATAT TCTTCTTGCC CGTCAGGTTG GTGTTCCAGC GATTGTTGTT 100 

TTTCTTAATA AGGTTGATCA GGTTGATGAT GCTGAGCTTT TGGAGCTTGT 150 

TGAGCTTGAA GTTCGGGAGT TATTGTCGAA ATATGATTTT CCAGGAGACG 200 

50 ATATTCCGAT CGTTAAAGGT TCTGCTTTGG CAGCGCTTGA AGATAAAGAT ' 250 

AAAAGCATTG GTGAAGATGC GGTTCGTCTT TTGATGAGTG AAGTTGATAA 300 

TTATATACCG ACGCCTGAAC GTCCTGTTGA TCAGCCGTTT TTGATGCCAA 350 

TTGAAGATGT TTTTTCGATT TCGGGTCGTG GAACTGTTGT GACGGGTCGT 400 

GTTGAGCGTG GTGTTATTAA GGTTGGTGAA GAAGTTGAGA TTATCGGCAT 450 

55 TCGTCCAACT TCTAAGACAA CAGTTACAGG GGTTGAAATG TTCCGCAAGC 500 

TTTTAGATCA GGGGCAAGCG GGTGATAATA TTGGAGCGCT GCTTCGTGGT 550 

ATTGATCGTG AAGGGATTGA GCGTGGACAA GTTTTGGCGA AGCCTGCTTC 600 

GGTTACACCT CATACGAGAT TTAAAGCAGA GGCTTACATT TTGACGAAAG 650 

ATGAAGGTGG TCGTCATACT CCATTTTTCA CGAATTATCG TCCTCAGTTT 700 

60 TATTTCCGTA CTACGGATGT AACGGGAATT GTTACGCTTC CAGAAGGTAC 750 

9 
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AGAGATGGTT ATGCCTGGTG ATAATGTTGC TATGGATGTC TCTCTGATTG 800 
TTCCAATTGC CATGGAAGAA AAACTTCGTT TTGCTATC 838 



2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 839 bases 
10 (B) TYPE: Nucleic acid 

(_C)— STRANDEDNESS :- Double- 

(D) TOPOLOGY: Linear 



15 



40 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TGGCGCCATC CTTGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTCCCGAA GATCCTCGTC 100 

GCTCTGAACA AGTGCGATAT GGTCGACGAC GACGAGCTCA TCGAGCTCGT 150 

25 TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAATGGCTTC GATCGCGATT 200 

GCCCGGTCAT CCACGTGTCC GCTTACGGCG CACTGCACGA TGACGCTCCG 2 50 

GACCACGAGA AGTGGGTTGA GCAGATCAAG AAGCTCATGG ACGCCGTCGA 3 00 

TGACTACATC CCGACCCCGG TCCACGATCT GGACAAGCCG TTCCTGATGC 350 

CGATCGAAGA TGTCTTCACC ATCTCCGGCC GTGGCACCGT GGTGACCGGC 4 00 

30 CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTCG AGATCGTCGG 450 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGCAGATGGA CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 600 

CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

35 AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGCCGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 800 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG ■ 839 



2) INFORMATION FOR SEQ ID NO: 14 



( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium dentium 

(B) STRAIN: ATCC 27534 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTGCCGCG TATCCTCGTC 100 

60 GCCCTGAACA AGTGCGATAT GGTCGACGAC GAAGAGCTCA TCGAGCTCGT 150 

10 
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TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAACGGCTTC GATCGCGATT 2 00 

GCCCGGTCAT CCACACCTCC GCCTACGGCG CGCTGCACGA TGACGCTCCG 2 50 

GACCACGACA AGTGGGTTGA GTCCGTCAAG GAACTCATGA AGGCCGTCGA 3 00 

CGAGTACATC CCGACCCCGA CCCACGATCT GGACAAGCCG TTCCTGATGC 350 

5 CGATCGAAGA TGTGTTCACC ATCTCCGGCC GTGGCACCGT GGTTACCGGC. 400 

CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTTG AGATCGTCGG 450 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGCAGATGGA CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 6 00 

10 CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 



AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGCCGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 8 00 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG 839 



15 



2) INFORMATION FOR SEQ ID NO: 15 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
30 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

TGGCGCGATC CTGGTGGTTT CGGCTGCTGA CGGCCCGATG CCGCAGACCC 50 

35 GCGAGCACAT CCTGCTTGCC CGTCAGGTTG GCGTTCCGGC GATCGTCGTG 100 

TTCCTCAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTGCGCGAAC TTCTGTCGAA GTACGAATTC CCCGGCGACG 200 

AAATCCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCC 250 

AAGGAACTGG GCGAAGATGC CATCCGCAAC CTGATGGACG CGGTTGACAG 300 

40 CTACATTCCG ACCCCGGAAC GCCCGATCGA CCAGCCGTTC CTGATGCCGA 350 

TCGAAGACGT GTTCTCGATC TCCGGCCGTG GTACGGTTGT GACGGGTCGC 400 

CAAGGCGACG ACGAAGACCA CGGTTACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGACCA GGGCCAGGCT GGCGACAACA TTGGCGCGCT GATCCGCGGC 550 

45 GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TTAAGGCAGA AGCCTATATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTCA CCAACTACCG TCCGCAGTTC 700 

TACTTCCGTA CGACGGACGT GACGGGTGTT GTGACGCTTC CGGCTGGCAC 750 

GGAAATGGTC ATGCCTGGCG ATAACGTCGC CATGGACGTT ACCCTGATCG 800 

50 TGCCGATCGC CATGGAAGAG AAGCTTCGCT TCGCTATC 838 



2) INFORMATION FOR SEQ ID NO: 16 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

11 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Burkholderia cepacia 

(B) STRAIN: LSPQ 2217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

10 GGCAGCAGAC G GCCCGATG C CGCAA A CGCG TGAGCA CATC CTGC TGGCGC 50 

GTCAGGTTGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA GTGCGACATG 100 

GTGGACGACG CCGAACTGCT CGAGCTGGTC GAGATGGAAG TTCGCGAACT 150 

CCTGTCGAAG TACGACTTCC CGGGCGACGA CACGCCGATC GTGAAGGGTT 200 

CGGCGAAGCT GGCGCTGGAA GGCGACACGG GCGAGCTGGG CGAAGTGGCG 250 

15 ATCATGAGCC TGGCCGACGC GCTGGACACG TACATCCCGA CGCCGGAGCG 300 

TGCAGTTGAC GGCGCGTTCC TGATGCCGGT GGAAGACGTG TTCTCGATCT 350 

CGGGCCGCGG TACGGTGGTG ACGGGTCGTG TCGAGCGCGG CATCGTGAAG 400 

GTCGGCGAAG AAATCGAAAT CGTCGGTATC AAGCCGACGG TGAAGACGAC 450 

CTGCACGGGC GTTGAAATGT TCCGCAAGCT GCTGGACCAA GGTCAAGCAG 500 

20 GCGACAACGT TGGTATCCTG CTGCGCGGCA CGAAGCGTGA AGACGTGGAG 550 

CGTGGCCAGG TTCTGGCGAA GCCGGGTTCG ATCACGCCGC ACACGCACTT 600 

CACGGCTGAA GTGTACGTGC TGAGCAAGGA CGAAGGCGGC CGTCACACGC 650 

CGTTCTTCAA CAACTACCGT CCGCAGTTCT ACTTCCGTAC GACGGACGTG 700 

ACGGGCTCGA TCGAGCTGCC GAAGGACAAG GAAATGGTGA TGCCGGGCGA 750 

25 CAACGTGTCG ATCACGGTGA A 771 



2) INFORMATION FOR SEQ ID NO: 17 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

45 GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCCCAG TACGACTTCC CGGGCGACGA 200 

TACTCCAATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

50 AGTGGGAAGC TAAAATCGTT GAGCTGGCTG GCTACCTGGA TTCTTACATC 300 

CCTGAGCCAG AGCGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

55 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCAGG CTCTATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGC ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

60 GTAATGCCTG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

12 
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CGCGATGGAT GACGGTCTGC GTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

JLO (C) STRAND.EDNE.S.S_:_D_0_ub.l.e___ 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



CGCTATCCTG GTTGTTGCTG CGACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGATGACA 200 

25 CTCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAAGCAGAG 250 

TGGGAAGCTA AAATYGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 3 00 

AGAACCAGTA CGTGCAATCG AYCTGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 4 50 

30 TGCGAAATCT ACCTGTACCG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

35 ACAACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCG 824 



40 

2) INFORMATION FOR SEQ ID NO: 19 

( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH.: 827 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CGCTATTCTG GTTGTTGCTG CAACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGC CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTAGA 150 

60 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCA GGCGATGATA 200 

13 
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CCCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAAGCAGAG 250 

TGGGAAGCTA AAATCGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 300 

AGAACCAGTA CGTGCAATCG ACCTGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TKACCGGTCG TGTAGAGCGC 400 

5 GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 450 

TGCGAAATCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

10- — GGGGTGATAG-TGCGTTCTTC-AARGGCTACC-GTCCACAGTT-CTACT-TCCGT^ — -700 

ACCACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 750 

AATGCCAGGT GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTGGCAA 827 



15 



2) INFORMATION FOR SEQ ID NO: 20 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



GCGGAGCTAT CCTAGTCGTT TCAGCTACAG ACGGAGCTAT GCCACAAACT 50 

AAAGAACATA TCTTGCTAGC TCGCCAGGTT GGAGTTCCTT ATATCGTTGT 100 

35 TTTCTTGAAT AAAGTAGATA TGATCTCTCA AGAAGATGCT GAACTTATTG 150 
ACCTTGTTGA GATGGAACTT AGTGAGCTTC TTGAAGAAAA AGGCTACAAA - 200 

GGATGCCCTA TTATCCGTGG TTCTGCTTTG AAAGCTCTTG AAGGTGATGC 250 

AAATTATATC GAAAAAGTTC GAGAACTTAT GCAAGCTGTG GATGACAACA 300 

TCCCTACACC AGAAAGAGAA ATTGATAAGC CTTTCTTAAT GCCTATCGAA 350 

40 GACGTATTCT CAATCTCTGG TCGTGGTACT GTGGTTACAG GAAGAATCGA 4 00 

GCGTGGAATC GTTAAAGTTT CTGATAAAGT TCAGCTCGTG GGATTAGGAG 450 

AGACTAAAGA AACAATCGTT ACTGGAGTCG AAATGTTCAG GAAAGAACTT 500 

CCTGAAGGTC GTGCAGGAGA AAACGTTGGT TTACTCCTCA GAGGTATTGG 550 

AAAGAACGAT GTTGAAAGAG GTATGGTGGT TTGTCAGCCT AACAGCGTGA 600 

AGCCTCATAC GAAATTTAAG TCAGCTGTTT ACGTTCTTCA GAAAGAAGAA 650 

GGCGGACGTC ATAAGCCTTT CTTCAGCGGA TACAGACCTC AGTTCTTCTT 700 

CCGTACTACA GACGTGACAG GAGTCGTAAC TCTTCCTGAA GGAACTGAAA 750 

TGGTAATGCC TGGAGATAAC GTTGAGCTTG ATGTTGAGCT CATTGGAACA 800 

GTTGCTCTTG AAGAAGGAAT GAGATTTGCA A 831 



2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

14 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia psittaci 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TGGAGCGATT CTCGTTGTTT CCGCTACTGA CGGTGCGATG CCTCAGACCA 50 
AAGAACATAT TCTTTTGGCG AGACAGGTTG GTGTTCCTTA CATCGTTGTT 100 
10 TTCCTTAACA AAATCGATAT GATTTCTCAA GAAGATGCTG AGCTCGTAGA 150 



CTTAGTTGAA ATGGAATTGT CCGAACTTCT AGAAGAAAAA GGTTATAAAG 200 

GTTGCCCAAT TATCCGTGGT TCTGCTTTGA AAGCCTTAGA AGGTGATGCA 250 

AGCTACGTTG AAAAAATTCG CGAGTTAATG CAAGCAGTGG ATGATAACAT 3 00 

CCCTACTCCA GAGCGTGAAG TTGATAAGCC TTTCTTAATG CCTATCGAAG 3 50 

15 ACGTATTCTC TATTTCTGGT CGTGGTACTG TGGTCACAGG ACGTATCGAG 400 

CGTGGAATCG TTAAAGTGGG TGATAAAGTA CAGATTGTTG GTTTAAGAGA 450 

TACTAGAGAG ACAATTGTTA CCGGTGTGGA AATGTTCAGA AAAGAACTTC 500 

CAGAAGGTCA AGCAGGGGAA AACGTTGGTT TGCTCCTCAG AGGTATCGGT 550 

AAGAATGACG TTGAACGTGG TATGGTTATC TGCCAACCTA ATAGCGTGAA 600 

20 ATCTCACACA CAATTTAAAG GTGCTGTCTA CATTCTACAA AAAGAAGAGG 650 

GTGGACGTCA TAAACCTTTC TTTACCGGAT ACAGACCTCA GTTCTTCTTC 700 

CGTACAACAG ATGTTACAGG TGTTGTAACT CTCCCAGAAG GTACAGAGAT 750 

GGTTATGCCA GGCGATAACG TTGAATTCGA AGTTCAATTA ATTAGCCCAG 800 

TAGCTCTAGA AGAAGGTATG AGATTT 826 

25 



2) INFORMATION FOR SEQ ID NO: 22 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
40 (B) STRAIN: LGV 12 

(x'i) SEQUEN^ 22 " 

GGGGCTATTC TAGTAGTTTC TGCAACAGAC GGAGCTATGC CTCAAACTAA 50 

45 AGAGCATATT CTTTTGGCAA GACAAGTTGG GGTTCCTTAC ATCGTTGTTT 100 

TTCTCAATAA AATTGACATG ATTTCCGAAG AAGACGCTGA ATTGGTCGAC 150 

TTGGTTGAGA TGGAGTTGGC TGAGCTTCTT GAAGAGAAAG GATACAAAGG 200 

GTGTCCAATC ATCAGAGGTT CTGCTCTGAA AGCTTTGGAA GGGGATGCTG 250 

CATACATAGA GAAAGTTCGA GAGCTAATGC AAGCCGTCGA TGATAATATC 300 

50 CCTACTCCAG AAAGAGAAAT TGACAAGCCT TTCTTAATGC ' CCATTGAGGA 350 

CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA CGTATTGAGC 400 

GTGGAATTGT TAAAGTTTCC GATAAAGTTC AGTTGGTCGG TCTTAGAGAT 450 

ACTAAAGAAA CGATTGTTAC TGGGGTTGAA ATGTTCAGAA AAGAACTCCC 500 

AGAAGGTCGT GCAGGAGAGA ATGTTGGATT GCTCCTCAGA GGTATTGGTA 550 

55 AGAACGATGT GGAAAGAGGA ATGGTTGTTT GCTTGCCAAA CAGTGTTAAA 600 

CCTCATACAC GGTTTAAGTG TGCTGTTTAC GTTCTGCAAA AAGAAGAAGG 650 

TGGACGACAT AAGCCTTTCT TCACAGGATA TAGACCTCAA TTCTTCTTCC 700 

GTACAACAGA CGTTACAGGT GTGGTAACTC TGCCTGAGGG AGTTGAGATG 750 

GTCATGCCTG GGGATAACGT TGAGTTTGAA GTGCAGTTGA TTAGCCCTGT 800 

60 GGCTTTAGAA GAAGGTATGA GA 822 

15 
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2) INFORMATION FOR SEQ ID NO: 23 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOG Y: Linea r 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
15 (A) ORGANISM: Chryseobacterium meningosepticum 

(B) STRAIN: CDC B7681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

20 CGGAGCTATC TTAGTATGTG CTGCTACAGA TGGTCCAATG CCTCAAACTA 50 

GAGAACACAT CCTACTTTGC CGTCAGGTAA ACGTACCTAG AATTGTTGTG 100 

TTCATGAACA AAGTTGACAT GGTAGATGAT CCAGAATTGT TAGAGCTTGT 150 

TGAGCTTGAA CTTAGAGATC TATTATCTAC TTACGAATAT GATGGTGATA 200 

ACTCTCCAGT AATTCAAGGT TCTGCTCTTG GTGCTCTTAA CGGTGATGCT 250 

25 AAGTGGGTAG CTACTGTAGA AGCTCTAATG GATGCTGTTG ATACTTGGAT 300 

CGAGCAACCA GTAAGAGATT CTGATAAGCC ATTCCTTATG CCAATCGAAG 350 

ACGTATTCTC TATTACAGGT AGAGGTACTG TAGCAACTGG TAGAATCGAG 400 

GCTGGTGTAA TCAACACAGG TGATCCTGTT GACATCGTAG GTATGGGTGA 450 

CGAGAAGTTA ACTTCTACTA TTACAGGTGT TGAGATGTTT AGAAAAATCC 500 

30 TAGACAGAGG TGAAGCTGGT GATAACGTAG GTCTATTGTT GAGAGGTATT 550 

GAAAAGACTG ACATCAAGAG AGGTATGGTT ATCGCTAAGA AAGATTCAGT 600 

TAAGCCACAC AAGAAATTCA AAGCTGAGGT TTATATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCCACAACA AATACCGTCC TCAGTTCTAT 700 

GTAAGAACTA CTGACGTTAC AGGTGAAATC TTCTTACCAG AAGGTGTAGA 750 

35 AATGGTAATG CCTGGTGATA ACTTAACTAT CACTGTAGAA TTGTTACAAC 800 

CAATCGCTCT TAACGAGGGT CTTAGATTCG CGATC 835 



40 2) INFORMATION FOR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonaticus 

(B) STRAIN: ATCC 25405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

55 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 
TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 
AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 
60 ACACCCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

16 
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GAGTGGGAAG CGAAAATCAT CGAACTGGCC GGCTTCCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00- 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

5 GACTGCCAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCWCCATCAA 600 

GCCGCACACC ATGTTCGAAT CYGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

10 CG TACAACTG A CGTGACTGG CACCA TCGAA CT G CCGGAAG G CGTTGAGA T 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGG 816 



15 

2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter braakii 

(B) STRAIN: ATCC 43162 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CGCGATCCTG GTTGTTGCTG CAACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGY CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

35 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAWGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCT ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 40 0 

40 GGTATCATCA AAGTTGGTGA AGAAGTTGAA ATCGTTGGTA TCAARGACAC 450 

TGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG T ATCAAGCGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGTAC CATCGAACTG CCGGAAGGCG TTGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCGC 825 



2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

17 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

1.0_ __TT_CCTGAACA .AAT5CGACAT_G_GTTj3A 15_0_ 

TGAGATGGAA GTGCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCT 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATYGTTG GTATCAAAGA 450 

GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GYTCCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATYCTGTCY AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 

25 



2) INFORMATION FOR SEQ ID NO : 27 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citroibacter farmeri 
40 (B) STRAIN: ATCC 51112 

{ xi j sequence" des!:rTpYio¥:~~ seq id" n67 2 7~ " 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

GATGGAAGTT CGTGAACTGC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCAGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCATCA AAGTGGGTGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

TGCCAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCW CCATCAAGCC 600 

RCACACTATG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACGACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGTG TTGAGATGGT 750 

TATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCGATCG 800 

CGATGGACGA CGGTCTGCGT TTCGCAA 827 

18 
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2) INFORMATION FOR SEQ ID NO: 28 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

10 (.D.) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: CitroJbacter freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

20 CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACTCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGAAGC AGAGTGGGAA 250 

25 GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCAGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCTATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTGCTAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

30 GTGCTGGTGA GAACGTTGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCTCTATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGACGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

35 GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCGA 797 



2) INFORMATION FOR SEQ ID NO: 29 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

CCGCAGACCC 50 

CATCATCGTG 100 

TGGAACTGGT 150 

CCGGGCGACG 200 

AGGCGACGCA 250 

ATTCTTACAT 300 



CGGCGCGATC CTGGTTGTTG 
GTGAGCACAT CCTGCTGGGT 
TTCCTGAACA AATGCGACAT 
AGAGATGGAA GTTCGTGAAC 
ACACGCCGAT CGTTCGTGGT 
GAGTGGGAAG CGAAAATCAT 



CCGCGACTGA CGGCCCGATG 
CGTCAGGTAG GCGTTCCGTA 
GGTTGATGAC GAAGAGCTGC 
TGCTGTCTCA GTACGATTTC 
TCAGCTCTGA AAGCGCTGGA 
CGAACTGGCT GGCTTCCTGG 
19 
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TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

5 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCGAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA TATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

10 GGTAATGCCG GGCGACAACA TCAAAATGGT TGT TAC CCTG A TCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTC 826 



15 2) INFORMATION FOR SEQ ID NO : 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30 



GCGATCCTGG TTGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GATTTCCCGG GCGACGACAC 200 

35 TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GAAGCAGAGT 250 

GGGAAGCGAA AATCATCGAA CTGGCTGGCT TTCTGGATTC TTACATCCCG 300 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTRCCTA TCGAAGACGT 350 

ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 4 00 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGACACC 450 

40 GCTAAGTCTA CCTGTACCGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCTC TATCAAGCCG 600 

CACACCAAGT TCGAATCTGA AGTGTACATC CTGTCCAAAG ACGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATYAA AATGGTTGTT ACYCTGATCC ACCCGATCGC 800 

GATGGACGAC GGTCTGCGTT TCG 823 



2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 2 9935 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

10 _ GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

TACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG "2T0" 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCGGAACCAG AACGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACTGGT CGTGTAGAAC 400 

15 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCCAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

20 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACGGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCGATGGAT GACGGTCTGC GTTTCG 826 



25 



2) INFORMATION FOR SEQ ID NO : 32 



( i ) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32 



CGGAGCTATA TTAGTTTGTT CAGCAGCTGA TGGTCCAATG CCTCAAACAA 50 

GAGAGCACAT CTTATTATCA TCAAGAGTTG GAGTTGACCA CATCGTAGTA 100 

TTCTTAAACA AAGCAGATAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTAGAGAGT TATTAAGCGA GTACAACTTC CCAGGAGACG 200 

AYATTCCAGT AATCAARGGA TCAGCTTTAG TAGCATTAGA AAACCCAACT 250 

GACGAAGCTG CAACAGCTTG TATCAGAGAG TTAATGGATG CTGTAGATAG 3 00 

CTACATCCCA ACACCAGAAA GAGCAACAGA TAAGCCATTC TTAATGCCAG 350 

TAGAGGACGT ATTCACAATC ACTGGTAGAG GAACAGTTGC AACAGGAAGA 400 

GTTGAAAGAG GAGTTCTACA TGTAGGAGAC GAAGTAGAAG TAATCGGATT 450 

AACTGAAGAA AGAAGAAAAA CTGTTGTAAC AGGAATCGAA ATGTTCAGAA 500 

AGTTATTAGA TGAAGCACAA GCTGGAGATA ACATCGGAGC ATTATTAAGA 550 

GGTATCCAAA GAACTGAYAT CGAAAGAGGT CAAGTTTTAG CTCAAGTTGG 600 

AACAATCAAC CCACACAAAA AATTCGTAGG TCAAGTATAC GTACTTAAAA 650 

AAGAAGAAGG TGGAAGACAT ACTCCATTCT TCGATGGATA CAGACCACAA 700 

TTCTACTTCA GAACAACAGA CGTTACAGGA TCAATCAAAT TACCAGAAGG 7 50 

AATGGAAATG GTTATGCCTG GAGACCACAT CGACATGGAA GTTGAATTAA 800 

TCACAGAAAT CGCTATGGAY GAAGGATTAA GATTCGCTAT C 841 



21 
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2 ) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 
15 (B) STRAIN: ATCC 15668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGGCGCCATC CTGGTGTGCT CGGCCGCTGA CGGCCCCATG CCCCAGACCC 50 

20 GCGAGCACAT CCTGCTGGCC CGTCAGGTGG GCGTGCCCTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTGGACGAC GAAGAGCTGC TGGAACTGGT 150 

CGAAATGGAA GTGCGCGAGC TGCTTGCCAA GTACGACTTC CCCGGCGACG 200 

ACACCCCCAT CATCCGCGGC TCGGCCAAGC TGGCCCTGGA AGGCGACCAG 250 

TCCGACAAGG GCGAACCTGC CATCCTGCGC CTGGCTGAAG CACTGGACTC 300 

25 CTACATCCCC ACGCCCGAGC GCGCTGTGGA CGGCGCCTTT GCAATGCCCG 350 

TGGAAGACGT GTTCTCGATC TCTGGCCGTG GCACCGTGGT GACTGGCCGT 4 00 

ATCGAGCGCG GCATCATCAA GGTCGGCGAA GAAATCGAAA TCGTCGGTAT 450 

CCGCGACACC CAGAAGACCA TCGTCACCGG CGTGGAAATG TTCCGCAAGC 500 

TGCTGGACCA AGGTCAAGCT GGCGACAACG TGGGTCTGCT GCTGCGCGGC 550 

30 ACCAAGCGTG AAGACGTGGA ACGCGGCCAA GTGCTGTGCA AGCCCGGCTC 600 

CATCAAGCCC CACACCCACT TCACGGCTGA GGTGTACGTG CTGTCCAAGG 650 

ACGAAGGTGG TCGCCACACT CCGTTCTTCA ACAACTACCG TCCCCAGTTC 700 

TATTTCCGTA CGACCGACGT GACCGGCTCC ATCGAGCTGC CCGCCGACAA 750 

GGAAATGGTG ATG'CCTGGCG ACAACGTGTC GATCACCGTC AAGCTGATCG 800 

35 CCCCCATCGC CATGGAAGAA GG 822 



2 ) INFORMATION FOR SEQ ID NO: 34 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) ~ LENGTH™ 7 02 £>a se S 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium bovis 

(B) STRAIN: ATCC 7715 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

GCCGCAGACC CGTGAGCACG TCCTCCTGGC CCGTCAGGTC GGTGTGCCCT 50 

ACATCCTCGT CGCCCTCAAC AAGTGCGACA TGGTCGACGA CGAGGACCTC 100 

ATCGAGCTCG TCGAGATGGA GGTCCGTGAG CTCCTCGCCG AGCAGGACTA 150 

CGACGAGGAC GCCCCGATCA TCCACATCTC CGCCCTCAAG GCCCTCGAGG 200 

GTGACCCGGA GTGGACGCAG CGCATCGTCG ACCTCATGAA GGCCTGCGAC 250 

GACGCCATCC CGGATCCGGA GCGCGAGACG GACAAGCCGT TCCTCATGCC 300 

22 
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GATCGAGGAC ATCTTCACGA TCACCGGCCG 
GTGTCGAGCG TGGCATCCTC AACGTCAACG 
ATCTGCGAGA ACTCCCAGAA GACGACCGTC 
CAAGTTCCTC GACACGGCCG AGGCCGGCGA 
5 GTGGCCTGAA GCGCGAGGAC GTCGAGCGTG 
GGCGCCTACA CGCCGCACAC CGAGTTCGAG 
CAAGGACGAG GGTGGCCGCC ACACGCCGTT 
AGTTCTACTT CCGGACGACC GACGTCACCG 
GG 



CGGCACCGTC GTCACGGGCC 350 

AGGAGGTCGA GATCCTGGGT 400 

ACCTCCATCG AGATGTTCAA 450 

CAACGCCGCC CTGCTGCTCC 500 

GCCAGATCGT GGCCAAGCCG 550 

GGCTCCGTGT ACATCCTCTC 600 

CTTCGACAAC TACCGTCCGC 650 

GCGTCGTCAA GCTGCCGGAG 700 

702 



2) INFORMATION FOR SEQ ID NO: 35 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium cervicis 
25 (B) STRAIN: NCTC 10604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GGCTCAGACC CGCGAGCACG TTCTGCTTGC TCGCCAGGTT GGCGTTCCGA 50 

30 CGATCCTGGT TGCCCTCAAC AAGGCCGATA TGGTCGACGA TGAGGAAATG 100 

CTGGAGCTCG TTGAGGAAGA GTGCCGCGAC CTGCTCGAGT CCCAGGACTT 150 

CGATCGTGAC GCCCCGATCA TCCAGGTTTC CGCGCTGAAG GCTCTCGAAG 200 

GTGATCCGCA GTGGGTTGCT AAGGTCGAGG AGCTCATGGA GGCAGTCGAC 250 

ACCTTCGTGC CGACTCCTGA GCGCGACATG GACAAGCCGT TCCTCATGCC 3 00 

35 GATCGAAGAC GTCTTCACCA TCACCGGCCG TGGCACCGTT GTTACCGGTC 350 

GTGTTGAGCG TGGCAAGCTC CCGATCAACT CTGAGGTTGA AATCCTCGGT 400 

ATCCGCGAAC CGCAGAAGAC CACCGTTACC GGTATCGAGA TGTTCCACAA 4 50 

GTCCATGGAT GAAGCATGGG CAGGCGAGAA CTGTGGTCTC CTCCTGCGTG 500 

GCACCAAGCG CGATGAGGTT GAGCGCGGTC AGGTCGTTGC CGTTCCCGGT 550 

40 TCGATCACCC CGCACACCAA CTTCACCGGA CAGGTCTACA TCCTCAAGAA 600 

GGAAGAAGGC GGTCGTCACA ACCCGTTCTT CTCGAACTAC CGTCCGCAGT 650 

TCTACTTCCCf CACCAC~GGAC GTGACCGGCcT TCATCACCCf " 6 8 V 



45 



55 



2) INFORMATION FOR SEQ ID NO: 36 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 804 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium flaveacena 

(B) STRAIN: ATCC 10340 

60 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

23 
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.LO- 



IS 



GGTTGTTGCT 
TTCTGGCTCG 
TGCGACATGG 
CCGCGAACTG 
ACATCTCCGC 
ATCGTCGACC 
CGAGACCGAC 
CCGGCCGCGG 
GTCAACGAGG 
CACCGTTACC 
CTGGCGACAA 
GAGCGCGGCC 
GTTCGAGGGT 
CCCCGTTCAT 
GTGACCGGCG 
CGACAACGTT 
GGGC 



GCAACCGATG 
CCAGGTTGGC 
TTGATGATGA 
CTCGCTGAGC 
TCTCAAGGCT 
TCATGCAGGC 
AAGCCCTTCC 
TACCGTTGTT 
ATGTTGAGAT 
GGTATCGAAA 
CTGTGGTCTG 
AGGTTGTTAT 
TCCGTCTACG 
GGACAACTAC 
TTGTTCACCT 
GATATGACCG 



GTCCTATGCC 

GTTCCTTACA 

GGAAATCATC 

AGGACTACGA 

CTTGAGGGTG 

CTGCGATGAC 

TCATGCCTAT 

ACCGGCCGTG 

_CATCGG_CAT_C_ 

TGTTCCGCAA 

CTTCTGCGTG 

CAAGCCGGGC 

TCCTCAAGAA 

CGTCCGCAGT 

GCCTGAGGGC 

TTGAGCTCAT 



GCAGACCCGC 
TCCTCGTTGC 
GAGCTCGTTG 
CGAGGATGCC 
ACGAGAAGTG 
TCCATTCCGG 
CGAGGACATC 
TTGAGCGTGG 
AAGGAGAAGT 
GATGATGGAC 
GTACCAAGCG 
GCCTACACCC 
GGAAGAGGGC 
TCTACTTCCG 
ACCGAGATGG 
CCAGCCCGTC 



GAGCACGTTC 

TCTTAACAAG 

AGATGGAAAT 

CCCATCATCC 

GGTACAGGCC 

ATCCGGAGCG 

TTCACCATCA 

CGTTTTGAAG 

_CCAT_CT_CCAC_ 

TACACCGAGG 

TGAAGAGGTC 

CCCACACCAA 

GGCCGCCACA 

TACCACTGAC 

TCATGCCTGG 

GCTAGGATGA 



50 
100 
150 
200 
250 
300 
350 
400 
_45_0 
500 
550 
600 
650 
700 
750 
800 
8 04 



20 



25 



30 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium kutscheri 

(B) STRAIN: ATCC 15677 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

TGCCTCAGAC CCGTGAGCAC GTTCTTCTTG CTCGCCAGGT TGGCGTTCCT 
TACATCCTCG TTGCTCTTAA CAAGTGCGAC ATGGTTGACG ATGAGGAAAT 
CATCGAGCTC GTTGAGATGG AAGTTCGCGA GCTTCTTGCT GAGCAGGAGT 
ACGATGAAGA GGCTCCAATC ATCCACATCT CTGCTTTGAA GGCTCTTGAG 
GGCGACGAGA AGTC^ACTCA GGCCA^GATC" GACCTCATGC* AGGCTTGTCA 
TGACTCCATC CCAGATCCAG AGCGTGAGAC CGACAAGCCA TTCCTCATGC 
CTATCGAGGA TATCTTCACC ATCACCGGTC GTGGCACCGT TGTTACCGGT 
CGTGTTGAGC GCGGTTCCTT GAAGGTGAAT GAGGACGTCG AGATCATCGG 
CATCAAGGAG AAGTCCACCA CTACTACCGT TACCGGTATC GAAATGTTCC 
GTAAGCTTCT TGATTACACC GAAGCTGGCG ATAACTGTGG TCTGCTTCTT 
CGTGGTATCA AGCGCGAAGA CGTTGAGCGT GGTCAGGTTG TTGTTAAGCC 
AGGCGCTTAC ACACCTCACA CCGAGTTCGA GGGCTCTGTT TACGTTCTTT 
CCAAGGACGA GGGCGGCCGC CACACCCCAT TCTTCGACAA CTACCGTCCA 
CAGTTCTACT TCCGCACCAC TGACGTTACC GGTGTTGTGA AG 



50 
100 
150 
200 



250 
300 
350 
400 
450 
500 
550 
600 
650 
692 



55 2} INFORMATION FOR SEQ ID NO: 38 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

24 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium minutissimum 

(B) STRAIN: ATCC 23348 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

10 

CCTGGTTGTT GCTGCAACCG ATGGCCCGAT GCCGCAGACC CGCGAGCACG 50 

TTCTTCTGGC CCGCCAGGTT GGCGTTCCGT ACATCCTCGT TGCACTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAAATC ATCGAGCTCG TTGAGATGGA 150 

GATCCGTGAG CTGCTCGCTG AGCAGGACTA CGACGAGGAA GCTCCGATCG 200 

15 TTCACATCTC CGCTCTGAAG GCTCTTGAGG GCGACGAGAA GTGGGCACAG 250 

TCCATCGTTG ACCTGATGCA GGCTTGCGAT GACTCCATCC CGGATCCGGA 3 00 

GCGCGAGCTG GACAAGCCGT TCCTGATGCC GATCGAGGAC ATCTTCACCA 3 50 

TTACCGGCCG CGGTACCGTT GTTACCGGCC GTGTTGAGCG TGGCTCCCTG 400 

AACGTTAACG AGGACATCGA GATCATCGGT ATCAAGGACA AGTCCATGTC 450 

20 CACCACCGTT ACCGGTATCG AGATGTTCCG CAAGATGATG GACTACACCG 500 

AGGCTGGCGA CAACTGTGGT CTGCTTCTGC GTGGTACCAA GCGTGAAGAG 550 

GTTGAGCGTG GCCAGGTTTG CATCAAGCCG GGCGCTTACA CCCCGCACAC 600 

CAAGTTCGAG GGTTCCGTCT ACGTCCTGAA GAAGGAAGAG GGCGGCCGCC 650 

ACACCCCGTT CATGGACAAC TACCGTCCGC AGTTCTACTT CCGCACCACC 700 

25 GACGTCACCG GTGTCATCAA GCTGCCGGAG GGCACCGAGA TGGTCATGCC 750 

GGGCGACAAC GTTGAGATGT CCGTAGAGCT GATCCAGCCG GTCGCTA 797 



30 2) INFORMATION FOR SEQ ID NO: 39 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium mycetoides 
(B) ' STRAIN V ATCC 2 1134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



GCCGCAGACC CGCGAGCACG 
ACATCCTCGT TGCGCTGAAC 
ATCGAGCTCG TGGAGATGGA 
CGACGAGGAC GCCCCCATCA 
GCGACGAGAA GTGGGTTCAG 
GACTCCATCC CGGATCCGGT 
GATCGAGGAC ATCTTCACCA 
GTGTGGAGCG CGGCGTGCTC 
ATCCGCGACA AGTCCCAGAA 
CAAGCTGCTC GATACCGCTG 
GCGGTCTGAA GCGCGAGGAC 
GGCGCCTACA CCCCGCACAC 
CAAGGACGAG GGCGGCCGCC 
AGTTCTACTT CCGCACCACC 
GG 



TTCTTCTGGC CCGCCAGGTC 
AAGTGCGACA TGGTTGATGA 
GGTCCGTGAG CTGCTCGGCG 
TCCACATCTC CGCTCTGAAG 
TCCGTGCTCG ACCTCATGCA 
CCGCGAGACC GACCGCGACT 
TCTCCGGCCG CGGCACCGTG 
AACCTCAACG ACGAGGTCGA 
GACCACCGTC ACCTCCATCG 
AGGCAGGCGA CAACGCGGCT 
GTCGAGCGTG GCCAGGTTGT 
CAAGTTCGAG GGTTCCGTCT 
ACACCCCGTT CTTCGACAAC 
GACGTGACCG GTGTTGTGAA 

25 



GGCGTCCCCT 50 

TGAGGAGATC 100 

AGCAGGACTA 150 

GCTCTCGAGG 200 

GGCGTGCGAC 250 

TCCTGATGCC 3 00 

GTTACCGGTC 350 

GATCATCGGC 400 

AGATGTTCAA 450 

CTGCTGCTCC 500 

CATCAAGCCG 550 

ACGTCCTGTC 600 

TACCGTCCGC 650 

GCTGCCGGAG 700 
702 
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2) INFORMATION FOR SEQ ID NO: 40 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
_1 0 ( P ) TOPOL OGY: Linear 



35 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Corynebacterium pseudogeni'talium 

(B) STRAIN: ATCC 33038 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

20 GCTCGCCAGG TTGGCGTTCC TTACATCCTC GTTGCGCTGA ACAAGTGCGA 50 

CATGGTTGAT GATGAGGAAA TCATCGAGCT CGTTGAGATG GAGATCCGTG 100 

AGCTGCTCGC AGAGCAGGAT TACGATGAGG AAGCTCCTAT CGTTCACATC 150 

TCCGCTCTGA AGGCCCTCGA GGGCGATGAC AAGTGGGTAC AGTCCGTCGT 200 

TGATCTGATG GAAGCCTGCG ACAACTCCAT CCCGGATCCG GAGCGCGCTA 250 

25 CCGACCAGCC GTTCCTGATG CCTATCGAGG ACATCTTCAC CATTACCGGC 300 

CGCGGTACCG TTGTTACCGG CCGTGTTGAG CGTGGCCGTC TGAACGTCAA 350 

CGAGGACGTT GAGATCATCG GTATCCAGGA GAAGTCCCAG ACCACCACCG 4 00 

TTACCGGTAT CGAGATGTTC CGCAAGATGA TGGACTACAC CGAGGCTGGC 450 

GACAACTGTG GTCTGCTTCT GCGTGGTACC AAGCGTGAGG ACGTTGAGCG 500 

30 TGGCCAGGTT GTTATCAAGC CGGGCGCTTA CACCCCGCAC ACCAAGTTCG 550 

AGGGCTCCGT CTACGTCCTG AAGAAGGAAG AGGGCGGCCG CCACACCCCG 600 

TTCATGAACA ACTACCGTCC GCAGTTCTAC TTCCGTACCA CGGACGTTAC 650 

CGGTGTTGTT CACCTGCCAG AGGG 674 



2) INFORMATION FOR SEQ ID NO: 41 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 694 bases 

(B) TYPE: Nucleic acid 

(C) S TRANDEDNE SS~: Do'uble" 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CoryneJbacteriujn renale 

(B) STRAIN: ATCC 19412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TGCCTCAGAC CCGTGAGCAC GTTCTGCTTG CTCGTCAGGT CGG CGTTCCT 50 

TACATCCTCG TTGCACTGAA CAAGTGCGAC ATGGTCGACG ACGAAGAAAT 100 

55 CATCGAGCTC GTCGAGATGG AAATCCGTGA ACTGCTCGCA GAGCAGGACT 150 

ACGATGAGGA AGCTCCTATC GTTCACATCT CCGCTCTGGG CGCCCTGAAC 200 

GGCGAGCAGA AGTGGGTTGA CTCCATCGTC GAACTGATGG AAGCTTGCGA 250 

CAACTCCATC CCAGACCCAG TTCGCGACAT CGACCACCCA TTCCTGATGC 300 

CTATCGAGGA CATCTTCACC ATTACCGGTC GCGGTACCGT TGTTACCGGC 350 

60 CGTGTCGAGC GTGGCCGTCT CAACGTCAAC GAAGAAGTTG AGATCATCGG 400 

26 
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TATCAAGGAC AAGTCCCAGA AGACCACCGT 
GCAAGATGCT GGACTACACC GAAGCTGGCG 
CGCGGCATCG GCCGTGAGGA TGTCGAGCGT 
AGGCGCTTAC ACCCCTCACT CTGAGTTCGA 
5 CCAAGGACGA GGGTGGCCGC CACACCCCAT 
CAGTTCTACT TCCGCACCAC CGACGTGACC 



CACCGGTATC GAGATGTTCC 450 

ACAACTGTGG TCTGCTGCTC 500 

GGCCAGGTTA TCATCAAGCC 550 

GGGCTCTGTC TACGTCCTGT 600 

TCTTCGACAA CTACCGTCCA 650 

GGCGTTGTGC ACCT 694 



_10 2 )_INFORMAT.ION_F_OR__SEQ_.ID„NO_: _42 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



GCCGCAGACC CGCGAGCACG TTCTGCTGGC TCGCCAGGTT GGCGTTCCKT 50 

ACATCCTSGT TGCACTGAAC AAGTGCGACA TGGTTGACGA TGAGGARCTC 100 

CTSGAGCTCG TCGAGATGGA GGTCCGCGAG CTGCTGGCTG AGCAGGACTA 150 

CGACGAGGAA GCTCCGRTCG TTCACATCTC CGCWCTGAAC GCCCTGGACG 200 

30 GCGACSAGAA GTGGGCTVAC TCCATCCTCG AGCTGATGCA GGCTTGCGAC 250 

GAGTCCATCC CGGATCCGGA GCGCGAGACC GACAAGCCGT TCCTGATGCC 300 

GATTGAGGAC ATCTTCACCA TTACCGGTCG CGGYACCGTT GTTACCGGCC 350 

GTGTTGAGCG TGGCDTCCTG AACGTSAACG ACGASGTTGA GATCATGGGY 40 0 

ATCCGGGAGA AGTCCCAGAA GACCACCGTY ACCKSCATCG AGATGTTCAA 450 

35 CAAGMTGMTG GACWCCGCAG AGGCTGGCGA CAACGCTGSW CTGCTGCTGC 500 

GTGGTMTSAA GCGTGAGGAC GTTGAGCGTG GCCAGATCAT CGYTAAGCCG 550 

GGCGCKTACA CCCCGCACAC CGAGTTCGAG GGCTCCGTCT ACGTCCTGTC 600 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTTCTACTT CCGCACCACC GACGTSACCG GTGTTGT 687 

40 



2) INFORMATION FOR SEQ ID NO: 43 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CoryneJbacteriuJT? urealyticum 
55 (B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC GTGAGCACGT 50 
60 TCTGCTGGCT CGCCAGGTTG GCGTTCCGTA CATCCTCGTT GCACTGAACA 100 

27 
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AGTGCGACAT GGTTGACGAT GAGGAGCTCC TCGAGCTCGT CGAGATGGAG 150 

GTCCGCGAGC TTCTGGCTGA GCAGGACTAC GACGAGGAGG CTCCGGTCGT 200 

CCCGATCTCC GCACTGGGCG CCCTGGACGG CGATCAGAAG TGGGTCGACT 250 

CCATCCTCGA GCTCATGAAG GCTTGCGACG AGTCCATCCC GGACCCGGAG 300 

5 CGCGAGACCG ACAAGCCGTT CCTGATGCCG GTTGAGGACA TCTTCACCAT 350 

TACCGGTCGC GGCACCGTCG TTACCGGCCG TGTTGAGCGT GGCGTCCTGA 4 00 

ACCTGAACGA CGAGGTCGAG ATCCTGGGCA TCCGCGAGAA GTCCACCAAG 450 

ACCACCGTCA CCTCCATCGA GATGTTCAAC AAGCTGCTGG ACACCGCAGA 500 

GGCTGGCGAC AACGCTGCAC TGCTGCTGCG TGGTCTGAAG CGTGAGGACG 550 

-10 TCGAGCGAGG. JTCAGATjCATC_GCTAAGC£^ 600_ 

GAGTTCGAGG GCTCCGTCTA CGTCCTGTCC AAGGACGAGG GCGGCCGTCA 650 

CACCCCGTTC TTCGACAACT ACCGTCCGCA GTTCTACTTC CGTACCACCG 700 

ACGTCACCGG TGTCGTTACC CTGCCAGAGG GCACCGACAT GGTCATGCCG 750 

GGCGACAACG TTGAGATGAG CGTCAAGC 778 

15 



2) INFORMATION FOR SEQ ID NO: 44 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium xerosis 
3 0 (B) STRAIN: ATCC 373 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 



CGCAGACCCG TGAGCACGTC CTCCTGGCCC GCCAGGTCGG CGTCCCCTAC 50 

35 ATCCTCGTCG CCCTGAACAA GTGCGACATG GTCGACGATG AGGAGATCAT 100 

CGAGCTCGTG GAGATGGAGG TGCGTGAGCT TCTCGCCGAG CAGGACTACG 150 

ACGAGGAGGC CCCGATCGTG CACATCTCCG CCCTGGGCGC CCTCAATGGC 200 

GAAGAGAAGT GGGTCGACTC CATCGTCGAG CTCATGAACG CCGTCGACGA 250 

GAACGTTCCG GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCCG 3 00 

40 TCGAGGACAT CTTCACCATC ACCGGCCGCG GCACCGTCGC CACCGGTCGC 350 

GTGGAGCGCG GCACCCTGAA GGTCAACGAC GAGGTCGAGA TCCTGGGCAT 400 

CCAGGAGAAG TCCCAGACCA CCACCGTCAC CGGCATCGAG ATGTTCCGCA 450 

AGCTGCTGGA CTCCGCCGAG GCCGGCGACA ACTGTGGCCT GCTGCTCCGC 50 0 

GGCATCAAGC GCGAGGACAT CGAGCGCGGC CAGATCATCG CGAAGCCGGG 550 

CGCCTACACC CCGCACACCG AGTTCGAGGG CTCCGTCTAC ATCCTGGCCA 600 

AGGACGAGGG CGGCCGCCAC ACCCCGTTCT TCGACAACTA CCGTCCGCAG 650 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCGTGAAGC TGCCGGAGGG 700 

CAC 703 



2) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

28 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



GGAGCGATAT 
GGAACACATT 

_1.0 ACTTGAACAA 

GAAATGGAAG 
GACGCCGATA 
GTGAGGTTGG 
TACTTCCCGC 

15 CGAAGATGTG 
TAGAGCGAGG 
AAGGACACGA 
ATTGGATGAA 
CGAAACGCGA 

20 ATCACGCCAC 
AGAAGGGGGA 
ATTTCCGCAC 
ATAGAGATGG 
TGCGCCGGTA 

25 



TGGTGGTGAG 
GTATTGGCGA 
AGCGGACATG 
TGAGGGATTT 
ATAGTGGGGT 
GGAGCCATCG 
AGCCGGAGCG 
TTTTCGATAT 
GATCATCAAA 
CGAAGACGAC 
GGTCAAGCGG 
AGAAGTGGAG 
ACAAGAAATT 
CGCCACACAC 
GACGGACGTG 
TGATGCCGGG 
GCGATGGATG 



CGCAGCGGAC 
AGCAAGTGGG 
GTGGATGACA 
ATTGAACAGT 
CAGCGTTAAA 
ATAATCAAAT 
AGCGATAGAC 
CGGGCCGAGG 
GTGGGCGACG 
GTGCACGGGC 
GTGACAACGT 
CGTGGTCAAG 
TGAGGCGGAG 
CGTTTTTACA 
ACGGGCCAGT 
AGATAACGTG 
AAGGGCTACG 



GGCCCGATGC 
TGTTCCGAAC 
AAGAGCTGTT 
TATGATTTCC 
GGCGTTAGAA 
TAGTGGAAAC 
AAACCGTTTT 
GACGGTGGTG 
AGATAGAGAT 
GTTGAGATGT 
AGGAATTTTA 
TATTGGCGAA 
ATTTATGTGT 
AGGCTATCGA 
TATTGAGTTT 
AAAGTGACGG 
AT 



CGCAAACGCG 

ATAGTGGTTT 

GGAATTAGTG_ 

CTGGGGATGA 

GGTGACAAGA 

GATGGACACG 

TAATGCCGAT 

ACGGGACGCG 

TGTGGGGATC 

TTCGCAAATT 

TTGAGAGGGA 

ACCGGGATCG 

TGTCGAAGGA 

CCGCAATTTT 

ACCGGAGGGG 

TTGAATTGAT 



50 
100 
150 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
832 



2 ) INFORMATION FOR SEQ ID NO: 4 6 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

(B) STRAIN: ATCC 33379 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 



GGCGCTATCC 

45 TGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
TACGCCGGTA 
AGTGGGAAGC 

50 CCGG AACCTG 
CGTATTCTCA 
GCGGTATCAT 
ACCACCAAGA 
CGAAGGCCGT 

55 GTGACGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACCACTGA 
GTAATGCCGG 

60 CGCCATGGAC 



TGGTTGTTGC 
CTGCTGGGTC 
GTGCGACATG 
TTCGCGAACT 
ATCCGCGGTT 
GAAGATCATC 
AGCGTGACAT 
ATCTCTGGTC 
CAAGGTAGGC 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGC 
GCGACAACAT 
GATGGT 



TGCGACTGAC 
GCCAGGTAGG 
GTTGATGACG 
GCTGTCTCAG 
CTGCGCTGAA 
GAACTGGCTG 
CGACAAGCCG 
GTGGTACCGT 
GACGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
AGAAGTGTAC 
TCAAAGGTTA 
ACCATCGAAC 
CAAGATGGTT 



GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGATTTCC 
AGCGCTGGAA 
AAACGCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTAGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATCCTGAGCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



CGCAGACCCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGAAGCCG 
CTCCTACATT 
CGATCGAAGA 
CGTGTAGAGC 
TATCAAGCCG 
AACTGCTGGA 
GGTACCAAGC 
CACCATCACT 
AGGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCACCCGAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
816 
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2) INFORMATION FOR SEQ ID NO: 47 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

1.0- - (D) TOP-OLGGY-:- .Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 



20 GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
CACGCCGGTA 

25 AGTGGGAAGC 
CCGGAACCTG 
CGTATTCTCT 
GCGGTATCAT 
ACCACCAAGA 

30 CGAAGGCCGT 
GTGACGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 

35 GTAATGCCGG 
CGCCATGGAC 



TGGTTGTTGC 
CTGTTGGGTC 
GTGCGACATG 
TTCGCGAACT 
ATCCGCGGTT 
GAAGATCATC 
AGCGTGACAT 
ATCTCTGGCC 
CAAGGTAGGC 
CCACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGT 
GCGACAACAT 
GATGGTCTGC 



TGCGACTGAC 
GCCAGGTAGG 
GTTGATGACG 
GCTGTCTCAG 
CTGCGCTGAA 
GAACTGGCTG 
CGACAAGCCG 
GTGGTACCGT 
GACGAAGTTG 
TGGCGTTGAA 
ACGTTGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAGATGGTT 
G 



GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
AAACTCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATCCTGAGCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



CGCAGACCCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGAAGCCG 
CTCCTACATC 
CGATCGAAGA 
CGTGTAGAGC 
TATCAAGCCG 
AACTGCTGGA 
GGTACTAAGC 
CACCATCACT 
AGGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCACCCGAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
821 



40 2 ) INFORMATION FOR SEQ ID NO: 48 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 



CGGTGCCATC CTGGTGGTAT 
GCGAACACAT CCTGTTGGCT 
TTCATGAACA AATGCGACAT 
TGAGATGGAA ATCCGCGACC 
ACTGCCCGAT CGTACAAGGT 



CCGCTGCTGA CGGCCCCATG 
CGTCAGGTAG GTGTACCCTA 
GGTAGATGAT GCCGAGCTGC 
TGCTCTCCAG CTATGACTTC 
TCCGCTCTCA AAGCCCTCGA 
30 



CCTCAGACTC 50 

CATCCTCGTA 100 

TTGAGTTGGT 150 

CCTGGTGACG 2 00 

AGGCGATGCC 250 
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GGTTACAAAG AAAAAATCTT CGAACTAGCT GCTGCTTTGG ATAGCTACAT 300 

CCCCACTCCT CAACGTGCTG TAGACAAACC CTTCCTGTTG CCGATCGAAG 350 

ACGTATTCTC TATCTCCGGC CGTGGTACCG TAGTAACCGG TCGTGTAGAG 400 

CGCGGCATCA TCAAAGTAGG TGAAGAGATC GAAATCGTTG GTCTGAAGCC 450 

5 CACTCAGAAA ACTACCTGTA CTGGCGTGGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA GGCCGGTGAC AACGTAGGCG TACTGCTGCG CGGTACCAAA 550 

CGTGAAGAAG TTGAGCGTGG TCAAGTATTG GCTAAACCCG GCACCATCAC 600 

TCCGCACACC AAGTTCAAAG CCGAAGTATA CGTATTGAGC AAAGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTTGCCAACT ACCGTCCACA GTTCTACTTC 700 

10 CGTAC TACT G ACGT AACCGG TG CTGTAG AG CTGGAGCCTG GTGTAGAAAT _ 1 50 

GGTTATGCCT GGTGAGAACG TAACCATCAC CGTAGAACTG ATTGCTCCGA 800 

TTGCTATGGA AGAAGGTCTG CGCTTTGCGA 830 



15 



2) INFORMATION FOR SEQ ID NO: 49 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 808 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EnteroJbacter aerogenes 

(B) STRAIN: ATCC 13048 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

35 GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCRGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

40 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAC 4 50 

ACCGCGAAAA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGYGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAGCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 650 

45 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGRTG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGG 808 



50 



2) INFORMATION FOR SEQ ID NO: 50 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 



CGGCGCGATC 
GTGAGCACAT 
10 TTCCTGAACA 



TGAAATGGAA 
ATACTCCGAT 
GAGTGGGAAG 
CCCGGAACCA 

15 ACGTATTCTC 
CGCGGTATCA 
TACYGCGAAA 
ACGAAGGCCG 
CGTGAAGAAA 

20 GCCGCACACC 
GCGGTCGTCA 
CGTACAACTG 
GGTAATGCCG 
TCGCGATGGA 

25 



CTGGTTGTTG 
CCTGCTGGGT 
A ATGTG ACAT 
GTTCGTGAAC 
CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 
TYAAAGTGGG 
TCAACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
CACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



CTGCGACTGA 
CGTCAGGTAG 
GGTTGATGAC 



CGGCCCGATG 
GCGTTCCGTA 
GAAGAGCTGC 



TTCTGTCTCA 
TCTGCTCTGA 
CGARCTGGCT 
TTGACAAGCC 
CGCGGTACCG 
CGAAGAAGTT 
CCGGCGTTGA 
AACGTTGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGTTCGCA 



GTACGATTTC 
AAGCGCTGGA 
GGCCACCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATTCTGTCC 
ACCGTCCSCA 
CTGCCGGAAG 
TGTTACCCTG 



CCGCAGACTC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGAMGCW 
ATACCTATAT 
CCGATCGAAG 
TCGTGTAGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAA 
GCACCATCAA 
AAAGATGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCGA 



50 
100 
150^ 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



.2) INFORMATION FOR SEQ ID NO: 51 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 
40 (B) STRAIN: ATCC 33072 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 



TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACGC 50 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCTTACAT 300 

50 CCCGGAACCA GAACGTGCTA TCGATAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCCGGC CGTGGTACTG TTGTAACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCAG GCTCAATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA TATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG CACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TTCAGATGGT TGTTACCCTG ATCCACCCAA 800 

60 TCGCGATGGA TGACGGTCTG CGTTT 825 

32 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 52 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTT CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCAAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

30 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCGAAGCCAG GCTCAATCAA 600 

GCCACACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

35 GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTCTG CG 822 



40 2) INFORMATION FOR SEQ ID NO: 53 

( i ') SEQUENCE "CHA^cYElTfsYlCS": " 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCAGGCGACG 200 

ACACTCCAAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 
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GAGTGGGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGCGGTATCA 
5 TACTGCKAAA 
ACGAAGGCCG 
CGCGAAGAAA 
GCCACACACC 
GCGGCCGTCA 

_10 CG.TACAAC.TG_ 

GGTAATGCCA 
TCGCGATGGA 



CAAAAATCAT CGAACTGGCT GGCTTCCTGG 
GAGCGTGCGA TTGACAAGCC ATTCCTGCTG 
CATCTCCGGT CGTGGTACCG TTGTTACCGG 
TCAAAGTTGG TGAAGAAGTT GAAATCGTTG 
TCTACCTGTA CTGGCGTTGA AATGTTCCGC 
TGCTGGTGAG AACGTTGGTG TTCTGCTGCG 
TCGAACGTGG TCAGGTTCTG GCGAAGCCAG 
AAGTTCGAAT CTGAAGTGTA CATCCTGTCC 
TACTCCGTTC TTCAAAGGCT ACCGTCCACA 
ACGTGACTO CTGCCAGAAG 
GGCGACAACA TCAAGATGGT TGTGACGCTG ATCCACCCAA 
CGACGGTCTG CGTTTC 



ATTCTTACAT 
CCAATCGAAG 
TCGTGTAGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAA 
GCTCAATCAA 
AAAGACGAAG 
GTTCTACTTC 
GCGTAGAGAT 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 



800 
826 



15 



2) INFORMATION FOR SEQ ID NO: 54 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 806 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

GATCCTGGTA GTAGCTGCGA CTGACGGCCC AATGCCTCAG ACTCGTGAGC 50 

ACATCCTGCT GGGTCGTCAG GTAGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAATGCG ACATGGTTGA TGACGAAGAG CTGCTGGAAC TGGTAGAGAT 150 

35 GGAAGTTCGT GAACTGCTGT CTCAGTACGA TTTCCCAGGC GACGATACCC 200 

CAATCGTTCG TGGTTCTGCT CTGAAAGCGC TGGAAGGCGA CGCAGAGTGG 250 

GAAGMGAAAA TCATCGAACT GGCTGGCTAC CTGGATTCTT ACATCCCAGA 300 

ACCAGAGCGT GCGATTGAYA AGCCATTCCT GCTGCCAATC GAAGACGTAT 350 

TCTCCATCTC CGGTCGTGGT ACCGTTGTTA CCGGTCGTGT AGAGCGCGGT 400 

40 ATCATCAAAG TGGGTGAAGA AGTTGAAATC GTTGGTATCA AAGAGACTGC 450 

GAAGTCTACC TGTACTGGCG TTGAAATGTT CCGCAAACTG CTGGACGAAG 500 

GCCGTGCTGG TC^GAaVgTT~ WTCTTCTGCf TC^GTC^TAT TtA^CGTGAA " b b~U~ " 

GAAATCGAAC GTGGTCAGGT TCTGGCGAAG CCAGGCTCAA TCAAGCCACA 600 

CACCAAGTTC GAATCTGAAG TGTACATCCT GTCCAAAGAC GAAGGCGGCC 650 

45 GTCATACTCC GTTCTTCAAA GGCTACCGTC CACAGTTCTA CTTCCGTACA 700 

ACTGACGTGA CCGGTACCAT CGAACTGCCA GAAGGCGTAG AGGTGGTAAT 750 

GCCAGGCGAC AACATCAAGA TGGTTGTGAC TCTGATCCAC CCAATCGCGA 800 

TGGACG 806 



50 



2) INFORMATION FOR SEQ ID NO: 55 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

34 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

_10 TTCCTGAACA AGT.GCGACM_GGTTGATGAC_ GAAGAGCTGC JTGGAACTGGT 150_ 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACCCCGAT CGTTCGCGGT TCTGCGCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCCACCTGG ATACCTAYAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATTTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAGGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCGAAA ACCACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTCGGCG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTC 826 

25 



2) INFORMATION FOR SEQ ID NO: 56 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 



tXl) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CAGGCGACGA 200 

CACCCCAATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAMGCAG 250 

AGTGGGAAGM GAAAATCATC GARCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCAGAACCAG AGCGTGCGAT TGACAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTWGAGC 400 

GCGGTATCAT CAAAGTAGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

55 GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CGAAGCCAGG CTCAATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACAT CAAGATGGTT GTGACGCTGA TCCACCCAAT 800 

60 CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 

35 
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2 ) INFORMATION FOR SEQ ID NO: 57 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 ( D) _ TOPOLOGY: Linear „ 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter sakazakii 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

20 GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TGCGCGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCTG 250 

25 AGTGGGAAGC GAAAATCATC GAGCTGGCAG GTCACCTGGA TTCCTACATC 300 

CCGGAACCGG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCYGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTTGGT GAAGAAGTTG AAATCGTGGG CATCAAAGAC 450 

ACCGCGAAAT CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACRACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT C 831 



40 2) INFORMATION FOR SEQ ID NO: 58 



45 



50 



55 



60 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 200 

ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCYTGA AGGCGATGCT 250 
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TCATACGAAG AAAAAATCAT GGAATTAATG GCTGCAGTTG ACGAATACGT 3 00 

TCCAACTCCA GAACGTGACA CTGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTGCTGA 450 

5 AGAAACTGCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGTGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACAAT 600 

CACACCTCAT ACAAAATTTA AAGCTGAAGT TTACGTTTTA ACAAAAGAAG 650 

AAGGTGGACG TCACACACCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 
.10_^T.T_CCGTACAA_CT.GACGTAAC_ TGGTGTTGTT GAATTACCAG AAGGAACTGA _ _ _750_ 

AATGGTTATG CCTGGTGATA ACGTAACAAT CGACGTTGAA TTGATCCACC 8 00 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CAATT 835 



15 



2) INFORMATION FOR SEQ ID NO: 59 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

GGTGCTATCT TAGTAGTATC TGCTGCTGAT GGTCCTATGC CACAAACTCG 50 

TGAACACATT CTTTTATCAC GTAACGTTGG TGTTCCATAC ATCGTTGTTT 100 

TCTTAAACAA AGTTGATATG GTTGACGACG AAGAATTATT AGAATTAGTT 150 

35 GAAATGGAAG TACGTGACTT ATTAACTGAA TACGACTTCC CAGGAGACGA 200 

TGTTCCTGTA ATCGCTGGTT CTGCATTAAA AGCTTTAGAA GGCGACCCAT 250 

CTTACGAAGA AAAAATCTTA GAATTAATGG CTGCAGTTGA CGAATACATC 300 

CCAACTCCAG AACGTGACAA CGATAAACCA TTCATGATGC CAGTCGAAGA 350 

CGTATTTTCA ATCACTGGTC GTGGTACTGT TGCTACAGGT CGTGTTGAAC 400 

40 GTGGACAAGT ACGTGTTGGT GACGAAGTTG AAATAGTTGG TATCCATGAT 450 

GAAATTTCTA AAACAACAGT TACTGGTGTT GAAATGTTCC GTAAATTATT 500 

AGATTACGCT GAAGCTGGAG ACAACATCGG TGCATTATTA CGTGGTGTGG 550 * 

CTCGTGAAGA TATCCAACGT GGTCAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CAAAATTCAC TGCTGAAGTG TACGTTTTAA CTAAAGAAGA 650 

45 AGGTGGACGT CATACTCCAT TCTTCACTAA CTACCGTCCA CAATTCTACT 700 

TCCGTACAAC TGACGTTACA GGTGTAGTTA ACTTACCAGA AGGTACTGAA 750 

ATGGTTATGC CTGGTGATAA CGTAACTATG GAAGTTGAAT TAATCCACCC 800 

AATCGCTATC GAAGACGGAA CTCGTT 82 6 



50 



2) INFORMATION FOR SEQ ID NO: 60 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

37 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

CGGCGCGATC TTGGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCA CGTAACGTTG GTGTTCCTTA CATCGTCGTT 100 

TTCTTGAACA _AAATJ3GACAT_GGTTGATGAC GAAGAAT TAT TAQAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACGACTTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

TCATATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATCGCTGA 450 

AGAAACTGCT AAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TGGATTACGC TGAAGCTGGC GACAACATTG GTGCATTATT ACGTGGTGTG 550 

GCTCGTGAAG ATATCCAACG TGGTCAAGTA TTATCAAAAC CAGGTTCAAT 600 

CACTCCACAT ACAAAATTTG CGGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTTACTAT GGACGTTGAA TTAATCCACC 800 

CAATCGCGAT CGAAGACGGT ACTCGTTTCT CAATC 835 



2) INFORMATION FOR SEQ ID NO: 61 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
40 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

45 GTGAACATAT CCTATTATCT CGTCAAGTTG GTGTTCCTTA CATCGTYGTA 100 

TTCTTGAACA AAGTAGATAT GGTCGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTGA AAGCTTTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATCCT TGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

50 CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 350 

ATGTATTCTC RATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACAG ' TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGRCTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACKCCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

60 CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 835 

38 
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2) INFORMATION FOR SEQ ID NO: 62 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

10 — —(D)- TOPOLOGY_:_Linear__ _ 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

20 AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACACGT GAACATATCT 50 

TATTATCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 10 0 

ATGGATATGG TTGATGACGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTATCAGAAT ACGATTTCCC AGGCGATGAT GTTCCAGTTA 20 0 

TCGCAGGTTC TGCTTTGAAA GCTTTAGAAG GCGACGAGTC TTATGAAGAA 250 

25 AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATATCC CAACTCCAGA 30 0 

ACGTGATACT GACAAACCAT TCATGATGCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGACG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTGAAGTT 4 00 

CGCGTTGGTG ACGAAGTTGA AATCGTTGGT ATTAAAGACG AAACATCTAA 450 

AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

30 AAGCAGGCGA CAACATCGGT GCTTTATTAC GTGGTGTAGC ACGTGAAGAT 550 

ATCGAACGTG GACAAGTATT AGCTAAACCA GCTACAATCA CTCCACACAC 600 

AAAATTCAAA GCTGAAGTAT ACGTATTATC AAAAGAAGAA GGCGGACGTC 650 

ACACTCCATT CTTCACTAAC TACCGTCCTC 680 

35 



2) INFORMATION FOR SEQ ID NO: 63 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double" 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R487 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



AGTAGTTTCT 
TATTATCACG 

55 ATGGATATGG 
TCGTGACTTA 
TCGCAGGTTC 
AAAATCTTAG 
ACGTGATACT 

60 TCACTGGACG 



GCTGCTGATG 
TAACGTTGGT 
TTGATGACGA 
TTATCAGAAT 
TGCTTTGAAA 
AATTAATGGC 
GACAAACCAT 
TGGTACTGTT 



GTCCTATGCC 
GTACCATACA 
AGAATTATTA 
ACGATTTCCC 
GCTTTAGAAG 
TGCAGTTGAC 
TCATGATGCC 
GCTACAGGCC 
3 



TCAAACACGT 
TCGTTGTATT 
GAATTAGTAG 
AGGCGATGAT 
GCGACGAGTC 
GAATATATCC 
AGTCGAAGAC 
GTGTTGAACG 
9 



GAACATATCT 
CTTAAACAAA 
AAATGGAAGT 
GTTCCAGTTA 
TTATGAAGAA 
CAACTCCAGA 
GTATTCTCAA 
TGGTGAAGTT 



50 
100 
150 
200 
250 
300 
350 
400 
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CGCGTTGGTG ACGAAGTTGA AATCGTTGGT 
AACAACTGTT ACAGGTGTTG AAATGTTCCG 
AAGCAGGCGA CAACATCGGT GCTTTATTAC 
ATCGAACGTG GACAAGTATT AGCTAAACCA 
5 AAAATTCAAA GCTGAAGTAT ACGTATTATC 
ACACTCCATT CTTCACTAAC TACCGTCCTC 



ATTAAAGACG AAACATCTAA 450 
TAAATTATTA GACTACGCTG 500 
GTGGTGTAGC ACGTGAAGAT 550 
GCTACAATCA CTCCACACAC 600 
AAAAGAAGAA GGCGGACGTC 650 

680 



aO_2.)_INEHORMATJON_F_QR_SEQ_ID _N0_:_ 64 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid , 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R482 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



AGTAGTTTCT GCTGCTGACG GCCCAATGCC TCAAACTCGT GAACACATCC 50 

TATTGTCTCG TCAAGTTGGT GTTCCTTACA TCGTTGTATT CTTGAACAAA 100 

GTAGACATGG TTGATGACGA AGAATTACTA GAATTAGTTG AAATGGAAGT 150 

TCGTGACCTA TTAACAGAAT ACGAATTCCC TGGTGACGAT GTTCCTGTAG 200 

30 TTGCTGGATC AGCTTTGAAA GCTCTAGAAG GCGACGCTTC ATACGAAGAA 250 
AAAATTCTTG AATTAATGGC TGCAGTTGAC GAATACATCC CAACTCCAGA . 300 

ACGTGACAAC GACAAACCAT TCATGATGCC AGTTGAAGAC GTGTTCTCAA 350 

TTACTGGACG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGACAAGTT 400 

CGCGTTGGTG ACGAAGTTGA AGTTGTTGGT ATTGCTGAAG AAACTTCAAA 450 

35 AACAACAGTT ACTGGTGTTG AAATGTTCCG TAAATTGTTA GACTACGCTG 500 

AAGCTGGAGA CAACATTGGT GCTTTACTAC GTGGTGTTGC ACGTGAAGAC 550 

ATCCAACGTG GACAAGTTTT AGCTAAACCA GGTACAATCA CACCTCATAC 600 

AAAATTCTCT GCAGAAGTAT ACGTGTTGAC AAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCACTAAC TACCGTCCTC AATTT 685 

40 



2) INFORMATION FOR SEQ ID NO: 65 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
55 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTAYG CCTCAAACAC 50 
60 GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

40 
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TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 200 

ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCTTGA AGGCGATGCT 250 

TCATACGAAG AAAAAATCAT GGAATTAATG GCTGCAGTTG ACGAATACGT 300 

5 TCCAACTCCA GAACGTGACA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTGCTGA 450 

AGAAACTGCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

10 GCTCGTGAAG ACATCCAACG TGGACAAGTA TTAGCTAAAG CTGGTACAAT 600 



CACACCTCAT ACAAAATTTA AAGCTGAAGT TTACGTTTTA ACAAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGTGATA AMGTAACAAT CGACGTTGAA TTGATCCACC 800 

15 CAATCGCTAT CGAAGACGGA ACTCG 825 



2) INFORMATION FOR SEQ ID NO: 66 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

35 TCCTATGCCT CAAACTCGTG AACACATCTT GTTATCACGT AACGTTGGCG 50 

TACCATACAT CGTTGTTTTC TTGAACAAAA TGGATATGGT TGATGACGAA 100 

GAATTGCTAG AATTAGTTGA AATGGAAGTT CGTGACCTAT TGTCTGAGTA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCCGGTTCT GCTTTGAAAG 20 0 

CTCTTGAAGG AGATCCTTCA TACGAAGAAA AAATCATGGA ATTGATGGCT 250 

40 GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGATACTG ACAAACCATT 300 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA TGAAGTAGAA 400 

ATCGTTGGTA TTGCTGACGA AACTGCTAAA ACAACTGTAA CAGGTGTTGA 450 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AGCAGGGGAT AACATTGGTG 500 

CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTG 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTCAAAG CTGAAGTTTA 600 

TGTTTTGACA AAAGAAGAAG GTGGACGTCA CACTCC 636 



2) INFORMATION FOR SEQ ID NO: 67 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

41 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 8 043 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACATAT CCTAYTATCT CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCTTGAACA AAGTAGATAT GGTTGACGAC GAAGAATTAC TAGAATTAGT 150 

10 TGAAATGGAA (STTCGTGACTJATTAACAGA _ATACGAATTC JTCTGGTGACG 2.0.0. 

ATGTTCCTGT AGTTGCTGGT YCAGCTTTGA AAGCTTTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATCCT TGAATTGATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

15 CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 4 50 

AGAAACAGCT CAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

20 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACRA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAACGGT ACTAAATTCT CAATC 83 5 



2) INFORMATION FOR SEQ ID NO: 68 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

CGGAGCAATC TTAGTTGTTT CTGCTGCTGA CGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTATCT CGTCAAGTTG GTGTACCATA CATCGTTGTA 100 

45 TTCTTGAACA AAGTAGATAT GGTTGATGAC GAAGAATTAC TTGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTAA GAGCTTTAGA AGGCGACGCT 250 

KCATACGAAG AAAAAATTCT TGAATTGATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GAACGTGATA ACGACAAACC ATTCATGATG CCAGTTGAGG 350 

50 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG ACGTGTTGAA 400 

CGTGGACAAG YTCGTGTTGG TGACGTTATC GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GATAACATTG GTGCGTTACT ACGTGGTGTT 550 

TCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTAAAC CAGGTACAAT 600 

55 CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCYGTACGA CTGACGTAAC TRGTGTTGTY GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 835 

60 

42 
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2) INFORMATION FOR SEQ ID NO: 69 



5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
10 . _ 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 
15 (B) STRAIN: ATCC 49372 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 



CGGAGCTATC 

20 GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACACTCYAGT 
TCATACRAAG 

25 CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAGACTACGC 

30 GCACGTGAAG 
CACTCCACAT 
AAGGCGGGCG 
TTCCGTACAA 
AATGGTAATG 

35 CAATCGCGAT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGACT 
TATCGCTGGT 
AAAAAATCTT 
GTTCGTGATA 
AATCACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGC 
ACATCCAACG 
ACAAAATTCT 
TCACACTCCG 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGA 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGATGAC 
TATTGTCAGA 
TCAGCYTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GATAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTTGTT 
ACGTAACTAT 
ACTCGTTTCT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGATTTC 
AAGCTTTAGA 
SCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCATTATT 
TTGGCTAAAC 
TTACGTTTTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 
CTATTC 



CCTCAAACAC 
CATCGYTGTA 
TAGAATTAGT 
CCAGGCGACG 
AGGCGACCCT 
ACGAATACAT 
CCAGTCGAAG 
TCGTGTTGAA 
GTATCGCTGA 
CGTAAATTGT 
ACGTGGTGTT 
CAGCTTCAAT 
ACTAAAGAAG 
TCAGTTCTAC 
AAGGTACTGA 
TTAATCCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
836 



2) INFORMATION FOR SEQ ID NO: 70 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH : 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 



(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

CGGAGCTRTC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACTGA ATACGACTTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ATGAATACAT 300 

43 
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CCCAACACCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGYGGAAG 350 

ACGTAYTCTC AATCACTGGT CGTGGAACTG TTGCAACTGG TCGTGTTGAA 4 00 

■ CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATTGCTGA 4 50 

AGAAACTGCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTGT 500 

5 TGGATTACGC TGAAGCGGGC GACAACATTG GTGCATTATT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAC CAGCTTCAAT 600 

CACTCCACAT ACAAAATTCT CTGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGCGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTAGTT GATCTACCAG AAGGTACTGA 750 

1.0 — _ AAT_GCTAATG_ X.CTGGTGA_TA_ ACGTAACTAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCGAT CGAAGACGGA ACTCGTTTCT CTATT 835 



15 2 ) INFORMATION FOR SEQ ID NO: 71 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

. GTGAACACAT CTTGTTATCT CGTAACGTAG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 2 00 

35 ACACTCCAGT TATTGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGATCCA 2 50 

GTTTACGAAG AAAAAATCTT CGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA CTGAAAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GAAATCGTTG GTATCGACGA 4 50 

40 AGAAACAGCT CAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTAT 5 00 

TAGACTACGC TGAAGCAGGC GATAACATCG GTGCTTTATT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TT^CTAAAC CAGGAACAAT ~~ 6UU~ ~ 

CACTCCTCAT ACAAAATTCG TAGCTGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTA GAATTACGCG AAGGTACTGA 750 

AATGGTAATG CCTGGTGACA ACGTAACTAT CGACGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CTATT 835 



25 



2) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

44 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 49428 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

GAGCTATCTT GGTAGTTTCT GCAGCTGATG GCCCAATGCC ACAAACTCGT 50 

GAACATATTT TGTTGTCACG TAATGTAGGT GTACCTTACA TCGTTGTGTT 100 

CTTGAACAAA ATGGATATGG TTGATGACGA AGAATTACTT GAGTTAGTTG 150 

10 AAATGGAA G T ACGTGAT CTA TTATCTGAAT ACGACTTCCC AG GAGATGA T 200 

ACTCCAGTTA TTTCCGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA 250 

ATATGAACAA AAAATTATGG ACTTAATGGA TGCAGTTGAT GACTACATTC 300 

CAACTCCTGA ACGTGACCAT GACAAACCAT TCATGATGCC AATTGAAGAT 350 

GTATTTTCAA TTACAGGCCG TGGTACTGTT GCTACAGGAC GTGTTGAACG 400 

15 CGGGACTATC AAAGTCGGCG ATGAAGTTGA CATTATTGGT ATTCATGAAG 450 

ACGTTAAAAA GACAACAGTT ACTGGTGTAG AAATGTTCCG TAAATTGTTG 500 

GACTACGCTG AAGCAGGCGA TAACATTGGT ACTTTGTTAC GTGGTGTTTC 550 

TCGTGATGAT ATCGAACGTG GTCAAGTATT AGCTAAACCA GGTTCAATCA 600 

CACCACATAC AAGATTCTCT GCTGAAGTTT ATGTTTTGAC TAAAGAAGAA 650 

20 GGCGGACGTC ATACTCCATT CTTCTCAAAC TATCGTCCTC AATTCTACTT 700 

CCGTACAACT GATATCACTG GTGTCATTGA ATTGCCAGAA GGTACTGAAA 750 

TGGTAATGCC AGGTGATAAT GTAACAATGG ATGTTGAATT AATCCACCCA 800 

GTCGCTATCG AAGAAGGAAC TCG 823 



25 



40 



2) INFORMATION FOR SEQ ID NO: 73 



(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 835 bages 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



CGGTGCAATC TTGGTCGTAT CAGCGACAGA TGGCCCAATG CCACAAACAC 50 

GGGAGCATAT TTTGCTTTCT CGTCAAGTGG GTGTGAAACA TTTGATCGTC 100 

45 TTTTTGAATA AGACGGACCT TGTCGATGAT GACGAGTTGA TCGATTTAGT 150 

TGAAATGGAA GTCAGAGAAT TGCTGACTGA ATATGATTTT CCTGGCGACG 200 

ACATTCCTGT GATCAAGGGC TCTGCGTTAA AAGCCTTGGA AGGGGACCCA 250 

GATGCTGAAG CAGCGATCTT AACGCTGATG GATACAGTAG ATGAATATAT 300 

CCCAACGCCA GAACGTGATA CTGACAAACC ATTGTTGTTA CCGATCGAAG 350 

50 ATGTCTTTTC GATCACAGGA CGGGGGACCG TTGCTTCTGG TCGGATCGAT 400 

CGCGGCATGG TAAAAGTCGG GGATGAAGTA GAAATCGTCG GAATCAAACC 450 

TGAAACACAA AAAGCAGTCG TGACAGGGGT AGAAATGTTC CGCAAAACGA 500 

TGGACTTCGG AGAAGCTGGC GATAACGTAG GGGTATTGTT ACGGGGCATC 550 

ACCCGTGATG AAATTGAACG TGGCCAAGTG TTAGCAAAAC CAGGTTCTAT 600 

55 CACACCGCAT ACGAAATTCC AAGCGGAAGT CTATGTGTTG ACAAAAGAAG 650 

AAGGCGGTCG CCATACCCCA TTCTTTAATA ATTATCGCCC ACAATTTTAC 700 

TTCCGTACAA CGGACGTAAC TGGGAATATC GTTTTACCAG AAGGAACGGA 750 

AATGGTGATG CCTGGTGACA ACGTAACGAT CGATGTGGAA TTGATCCATC 800 

CGATCGCTGT AGAAAATGGA ACGACCTTCT CGATT 835 



60 
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10 



2) INFORMATION FOR SEQ ID NO: 74 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



-{ ii) MOLECULE TYPE :~Genomic _ DNA— 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
15 (B) STRAIN: ATCC 153 05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

20 AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

AGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACTTGCTT 200 

TTTTAAGTAT AGGTATAAAA TACGATTGAT TAAAACAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

25 TACAACAACG CTGATTGGCG CGTTACTATT ATATTTGCCA ATCACAGGTA 350 

AACATCCTAT TGATTTTGTG GACGCCCGTT 3 80 



30 2) INFORMATION FOR SEQ ID NO: 75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
(B) STRAIN: ATCC 4 9996 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 



45 



GCAATCTTGG TCGTATCAGC GACAGATGGC CCAATGCCAC AAACACGGGA 50 

GCATATTTTG CTTTCTCGTC AAGTGGGTGT GAAACATTTG ATCGTCTTTT 100 
TGAATAAGAC GGACCTTGTC GATGATGACG AGTTGATCGA TTTAGTTGAA • 150 

ATGGAAGTCA GAGAATTGCT GACTGAATAT GATTTTCCTG GCGACGACAT 200 

50 TCCTGTGATC AAGGGCTCTG CGTTAAAAGC CTTGGAAGGG GACCCAGATG 250 

CTGAAGCAGC GATCTTAACG CTGATGGATA CGGTAGATGA ATATATCCCA 300 

ACGCCAGAAC GTGATACTGA CAAACCATTG TTGTTACCGA TCGAAGATGT 350 

CTTTTCGATC ACAGGACGGG GGACCGTTGC TTCTGGTCGG ATCGATCGCG 400 

GCATGGTAAA AGTCGGGGAT GAAGTAGAAA TCGTCGGAAT CAAACCTGAA 450 

55 ACACAAAAAG CAGTCGTGAC AGGGGTAGAA ATGTTCCGCA AAACGATGGA 500 

CTTCGGAGAA GCTGGCGATA ACGTAGGGGT ATTGTTACGG GGCATCACCC 550 

GTGATGAAAT TGAACGTGGC CAAGTGTTAG CAAAACCAGG TTCTATCACA 60 0 

CCGCATACGA AATTCCAAGC GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 650 

CGGTCGCCAT ACCCCA 665 



60 
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2) INFORMATION FOR SEQ ID NO: 76 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
0 

( i l ) MOLECULE TYPE": Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
15 (B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

TGGTGCGATT TTAGTTGTAT CCGCAACAGA TGGTCCAATG CCTCAAACCC 50 

20 GGG AACATAT CTTGCTTTCG AGACAAGTTG GTGTGAAACA TCTGATTGTT 100 

TTCTTGAACA AAATCGATTT AGTCGATGAC GAAGAATTGA TTGATTTAGT 150 

AGAAATGGAA GTAAGAGAAC TGCTATCTGA ATATAATTTT CCAGGGGATG 2 00 

ACATTCCTGT TATCAAAGGT TCGGCATTAA AAGCGTTGGA AGGAGACCCT 2 50 

GATGCAGAAG CTGCCATCAT GGAATTAATG GATACAGTAG ACAGCTATAT 3 00 

25 CCCAACACCT GAGCGTGATA CAGACAAACC ATTACTCTTG CCAGTTGAAG 3 50 

ATGTCTTTTC GATTACTGGA CGAGGAACAG TTGCTTCCGG ACGGATCGAT 4 00 

CGGGGAACAG TTCGGGTAGG CGATGAAGTA GAAATCGTCG GTATCAAACC 450 

TGAAACCCAA AAAGCTGTAG TGACAGGCGT CGAAATGTTC CGCAAGACGA 500 

TGGACTTTGG GGAAGCCGGT GACAATGTAG GTGTCTTGCT GAGAGGGATC 550 

30 ACTCGTGACG AAATTGAACG AGGACAAGTG TTGGCTAAAC CAGGTTCGAT 600 

CACACCACAT ACAAAATTCC AAGCAGAAGT TTATGTATTG ACGAAAGAAG 650 

AAGGTGGTCG TCATACACCA TTCTTCAACA ACTATCGTCC ACAATTTTAT 700 

TTCCGTACAA CGGATGTGAC AGGGAACATT ACATTGCCTG AAGGAACAGA 750 

A 751 

35 



2) INFORMATION FOR SEQ ID NO: 77 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia canis 
50 (B) STRAIN: Florida 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 



TGCAGCAATA TTAGTAGTGT 
55 GAGAACATAT ATTATTAGCA 
TGGATGAATA AGTGTGATGT 
TGAAATGGAA ATAAGGGAAT 
ATATAGATGT AGTTAGAGGA 
GGCTCAGGTG TGTGGAGTGA 
60 AAAAATAAGT TTACCAGTAA 



CTGCAACTGA TGGAGCAATG 
AAGCAAGTAG GTGTAAAAGA 
TGTAGATGAT GAAGAAATGT 
TGTTATCAAA ATATGGGTAT 
TCTGCAGTTA AAGCATTAGA 
AAAAATAATG GAATTGATGA 
GAGAAAAAGA TAAGCCATTT 
47 



CCACAAACAA 50 

TATAGTAGTG 100 

TGTCATTAGT 150 

CCTGGGGATG 200 

AGAAGAAACA 2 50 

ATGCTTTAGA 300 

TTAATGTCAA 350 
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TAGAAGATGT GTTTTCAATA CCTGGAAGAG GTACAGTAGT AACAGGAAGA 400 

ATAGAAAGAG GAGTAATTAG AGTAGGGGAT AAAATAGAGA TAGTAGGATT 450 

GCGTGAGATA CAAAGTACAG TATGTACAGG TGTTGAAATG TTTCATAAAG 500 

CATTAGATGC AGGAGAAGCA GGGGATAATG CTGGAATATT GTTAAGAGGG 550 

5 ATAAAAAAAG AAGATGTAGA AAGAGGGCAA GTATTGAGTG CACCTGGACA 600 

GATACATTCA TATAAGAGAT TTAAGGCAGA GGTATATATA TTGAAAAAAG 650 

AAGAAGGAGG AAGACATACT CCATTTTTCT CAAATTACCA GCCGCAATTT 700 

TATGTTAGAA CAACAGATGT AACAGGGAAT ATAAAGTTAC CAGAAGGAGT 750 

AGAAATGGTA ATGCCAGGGG ATAATATAAA TATCGAAGTG AGTTTGGATA 800 

10 AGCCTGTTGC TATTGATCAA GGATTGAGAT TTGC 834 



2) INFORMATION FOR SEQ ID NO: 78 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ,817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

30 CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

35 GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTAYAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

40 ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

45 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGT 817 



50 2) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

48 
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(A) ORGANISM: Escherichia fergusonii* 

(B) STRAIN: ATCC 35469 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

5 

CGATCCTGGT AGTTGCTGCG ACTGACGGCC CGATGCCGCA GACTCGTGAG 50 

CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTTG ATGACGAAGA GCTGCTGGAA CTGGTTGAAA 150 

TGGAAGTTCG TGAACTTCTG TCTCAGTACG ACTTCCCGGG CGACGACACT 200 

10 CCGATCGTTC GTGGTTCTGC TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG 250 

GGAAGCGAAA ATCCTGGAAC TGGCTGGCTT CCTGGATTCT TACATTCCGG 300 

AACCAGAGCG TGCGATTGAC AAGCCGTTCC TGCTGCCGAT CGAAGACGTG 350 

TTCTCCATCT CCGGTCGTGG TACCGTTGTT ACCGGTCGTG TAGAACGCGG 400 

TATCATCAAA GTTGGTGAAG AAGTTGAAAT CGTTGGTATC AAAGAGACTC 450 

15 AGAAGTCTAC CTGTACTGGC GTTGAAATGT TCCGCAAACT GCTGGACGAA 500 

GGCCGTGCTG GTGAGAACGT AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 550 

AGAAATCGAA CGTGGTCAGG TACTGGCTAA GCCGGGCACC ATCAAGCCGC 600 

ACACCAAGTT CGAATCTGAA GTGTACATTC TGTCCAAAGA TGAAGGCGGT 650 

CGTCATACTC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 700 

20 TACTGACGTG ACTGGTACCA TCGAACTGCC GGAAGGCGTA GAGATGGTAA 750 

TGCCGGGCGA CAACATCAAA ATGGTTGTTA CCCTGATCCA CCCGATCGCG 800 

ATGGACGACG GTCTGCGTTT CGCAA 825 



25 



2) INFORMATION FOR SEQ ID NO: 80 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 80 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

4 5 GAGATGGAAG TTCGCGAACT GCTGTCCCAG TACGATTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 

AGTGGGAAGA GAAAATCATC GAACTGGCTG GCTACCTGGA TTCCTATATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

50 GCGGTATCAT CAAAGTGGGT GAAGAAGTTG AAATCGTGGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 50 0 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG TTCCATCAAG 600 

CCKCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

55 CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTT CGCAA 829 



60 
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2) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

20 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TTCTGTCCCA GTACGACTTC CCGGGCGACG 200 

ACACCCCGAT CATTCGTGGT TCTGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 

GAGTGGGAAG AGAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCCTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

25 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAARGTKGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCG TGCAGGCGAG AACTGCGGCG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGAGA TCCAGCGTGG CCAGGTTCTG GCTAAGCCGG GCTCAATCAA 600 

30 GCCGCACACC AAGTTCGAAT CCGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCATCCGA 800 

TCGCGATGGA CGACGG 816 

35 



2) INFORMATION FOR SEQ ID NO: 82 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: EuJbacterium lentum 
50 (B) STRAIN: ATCC 43055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

CGGCGCCTCC TCGTTATCGC CGCCACCGAC GGCCCGATGG CCCAGACCCG 50 

55 CGAGCACATC CTGCTCGCCC GTCAGGTCGG CGTGCCCTAC ATCGTGGTCT 100 

TCCTGAACAA GTGCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAAG TTCGCGAGCT GCTCGACTCT TACGAGTTCC CGGGCGACGA 2 00 

CACCCCGATC ATCCGCGGCT CCGCTTTGAA GGCCCTCGAG GGCGACAAAG 250 

AGTGGCAGGA GAAGGTCTGG GAGCTCATGG ACGCCGTCGA CTCCTACATC 300 

60 CCGACGCCGG AGCGCATGGT CGACAAGCCG TTCCTGATGG CCGTCGAGGA 350 

50 
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CACGATGACC ATCACCGGCC GCGGCACCGT 
GTGGTACGCT GCATGTCAAC GACCCGCTGG 
ACCCAGAACA CGGTCTGCAC CGGTATCGAG 
CGAGGCTCAG GCCGGCGACA ACATCGGCTG 
5 GCGAGGAGAT CGTTCGCGGC CAGGTTCTCT 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGCCAC ACGCCGTTCT TCGACGGCTA 
GCACGACGGA CGTGACGGGT GTTGCCCACC 
GTCATGCCGG GCGACAACGT GGAGATCAAG 
10 CGCCAGGAAG AGGGCTGCGT TCGCTAAC 



TGCCACCGGT CGTGTGGAGC 400 

AGATCGTCGG TATCAAGGAG 450 

ATGTTCCGCA AGCTGCTCGA 500 

CCTGCTCCGC GGTGTCAAGC 550 

GCAAGCCCGG TAGCGTGACC 600 

ATCCTGACGA AGGAAGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TTCCCGAGGG CACCGAGATG 750 

GGCGAGCTCA TTCACCCGAT 800 

828 



2) INFORMATION FOR SEQ ID NO: 83 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Eubacteriuflt nodatum 

(B) STRAIN: ATCC 33099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 



30 GGAGCAATTC TGGTTTGTGC AGCAACKGAC GGACCAATGC CTCAGACAAG 50 

AGAACATATC CTTTTGTCAA GGCAGGTAGG AGTGCCATAT ATCATCGTAT 100 

TCCTGAATAA ATGTGACATG GTGGATGAYG AAGAGCTTCT GGACTTGGTA 150 

GAGATGGAAG TAAGAGAACT TCTCAGTGAG TATGAATTCC CGGGAGATGA 200 

TACCCCGATA GTAAGAGGTT CAGCCCTGAA GGCACTGGAA GAACCCAATG 250 

35 GAGAATGGGC AGACAAGATT GTAGAGCTGA TGGAGGAAGT AGATAAATAC 300 

ATTCCTGAAC CAAAGAGAGA TAACGACAAA CCGTTCCTGA TGCCTGTAGA 350 

GGACGTATTC TCAATAACAG GAAGAGGAAC AGTAGCGACA GGAAGRGTTG 400 

AAAGAGGAAT CCTGAAGGTC GGTGATGAAG TAGAAATCGT GGGAATGAGC 4 50 

GAAGAGAGAA GAAAGGTAGT AGTAACGGGA GTTGAAATGT TCAGAAAGCT 500 

40 TCTGGATGAA GCAGAGACAG GAGACAACAT CGGAGCACTG CTGAGAGGAG 550 

TTCAGAGAAC RGAGATCCAG AGAGGTCAGG TATTGGCRGC ACCTGGAACG 600 

ATCAACCCAC ATACl^G^YlZJ^GGGT^G GTATiSGTAC TGAA~GAAGGA~ bbb 

AGAAGGAGGA AGGCATACGC CGTTCTTCAA YGGATACAGW CCACAGTTCT 700 

ACTTCAGAAC AACAGACGTA ACAGGAGATT TGCAGCTGCC GGAAGGARCA 750 

45 GAGATGTGCA TGCCGGGAGA TAATGTGGTA ATGAACRTCA GCCTGATCAC 800 

TCCGATTGCT ATAGAAGAGG GWCTGAGATT TGCCA 835 



50 2) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

51 
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(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

5 

GCGATCCTGG TTGTTGCTGC AACTGATGGC CCTATGCCAC AGACTCGTGA 50 

GCACATCCTG TTGGGTCGYC AGGTTGGCGT TCCATTCATG ATCGTATTCA 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

ATGGAAGTTC GYGAACTTCT GTCTGCTTAC GATTTCCCAG GCGACGACAT 200 

10 CCCAGTTGTT AAAGGTTCAG CGCTGAAAGC ACTGGAAGGC GAAGCAGAGT 250 

GGGAAGCTAA - GATCATCGAG-CTGGCTGGCC^ACCTGGATAG ~CTA'CATCCCA~~~ "300" 

GAACCAGAGC GTGCGATTGA CAAGCCATTC CTGCTGCCAA TCGAAGACGT 350 

ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 400 

GTATCATCAA AGTTGGCGAA GAAGTTGAAA TCGTTGGTAT CAAGGACACT 450 

15 GTTAAGTCTA CTTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCY GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAGCGTG 550 

AAGACATCGA ACGTGGTCAG GTTCTGGCTA AACCAGGTTC AATCAAACCA 600 

CACACCACWT TYGACTCAGA AGTTTATATC CTGAGCAAAG ATGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCACAGTTC TACTTCCGTA 700 

20 CAACTGACGT GACCGGTACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 750 

ATGCCAGGTG ACAACATCAA CATGRTAGTT ACCCTGATCC ACCCAATCGC 800 

GATGGATGAC GGTCTGCGTT TCGCAA 826 



25 



2 ) INFORMATION FOR SEQ ID NO : 85 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Franc i sella tularensis 

(B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 



TGGTGCTATT 
GTGAGCACAT 
TTCTTAAACA 

45 TGAGATGGAA 
ACACTCCAGT 
GCTTACGTTG 
TCCTGCTCCT 
ATGTATTCTC 

50 CGCGGTGTAG 
AACTCAAAAA 
ATAGAGGGGA 
AGAGATGATG 
GCCACATACT 

55 GTGGTAGACA 
CGTACTACAG 
GGTTATGCCT 
TCGCTAGGAT 



CTAGTATGTT 
TCTGCTTTCT 
AGTGTGACAT 
GTTCGTGAGC 
TATTATGGGT 
AGAAAATTGT 
GAGCGTGATA 
AATTTCAGGT 
TTAACGTTGG 
ACTACAGTAA 
AGCTGGTGAT 
TTGAGCGTGG 
AAGTTTGAAG 
TACTCCATTC 
ACATTACTGG 
GGTGATAACG 
GAAGGGTTAC 



CTGCTGCGGA 
CGTCAAGTTG 
GGTTGATGAT 
TTTTAGATCA 
TCAGCTCTTA 
TGAGCTAGTT 
CTGAGAAGCC 
CGTGGTACTG 
TGATGAAGTT 
CTGGTGTGGA 
AACGTTGGTA 
ACAAGTATTA 
CTGAGGTTTA 
TTCAAGGGAT 
AGCTGTTGAG 
TTAAGATGAC 
GTTTTGCA 



TGGTCCTATG 
GTGTACCAAA 
GAAGAGTTAT 
GTATGAGTTC 
GAGCTATTGA 
CAAGCTATGG 
ATTTATTCTT 
TTGTAACTGG 
GAAGTTGTTG 
AATGTTCCGT 
TCCTAGTTCG 
TGTAAGCCAG 
TGTATTATCT 
ATAGACCACA 
CTTCCAGAGG 
TATCACTCTA 



CCACAAACTC 
AATCGTTGTT 
TAGAGCTAGT 
CCAGGTGATG 
AGGTGACGAA 
ATGACTATAT 
CCGATCGAAG 
TCGTATTGAG 
GTATTCGTCC 
AAGCTTTTAG 
TGGACTTAAG 
GTTCAATTAA 
AAAGAAGAGG 
ATTCTACTTC 
GTGTAGAAAT 
ATTAACCCAA 



bU 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



60 
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. 2) INFORMATION FOR SEQ ID NO: 86 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (iilMQLECULE TYPE: Gen omic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphuw 

(B) STRAIN: ATCC 10953 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



CGGTGCTATC ATCGTAGKTG CTGCTACTGA TGGTCCGATG CCTCARACTC 50 

GTGAGCAYAT CYTGCTGGCT CGTCAGGTAA ACGTWCCKAG ACTGGTTGTA 100 

20 TTCATGAACA AGTGYGACAT GGTAGACGAC GCTGAAATGY TGGAACTCGT 150 

TGAAATGGAA ATGCGTGAAC TGCTTTCAGC YTACGAATTC GAYGGYGACA 200 

ACACTCCKTT CATTCAGGGT TCTGCTCTTG GTGCRTTGAA YGGCGTTGAA 250 

AAGTGGGAAG AGAAGGTTAT GGANCTGATG GATGCTTGCG ACACTTGGAT 300 

TCCTTTGCCT CCGCGTGATA TTGAYAAACC GTTCTTGATG CCGGTTGAAG 350 

25 ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCTACTGG TCGTATCGAA 400 

GCTGGTGTTA TCCATGTAGG TGACGAAGTT GAAATCCTCG GTTTGGGTGA 450 

AGACAAGAAG TCTGTTGTAA CTGGTGTTGA AATGTTCCGC AAGTTGCTGG 500 

ATCAAGGTGA AGCTGGTGAC AACGTAGGTY TGTTGCTCCG TGGTATCGAC 550 

AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAGCCCG GTCAGATTAA 600 

30 ACCTCACTCT AAGTTCAAAG CTTCTATCTA CGTTTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTATCTG 700 

CGTACTATGG ACTGTACAGG TGAAATCWCT CTTCCGGAAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TAGAAATCAC TGTAGAACTG ATCTACCCGG 800 

TAGCATTGAA CGTAGGTTTG CGTTTCGCT 82 9 

35 



2) INFORMATION FOR SEQ ID NO: 87 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

CTATCTTAGT AATCGCTGCT ACAGATGGAC CAATGGCTCA AACTCGTGAG 50 

CACATCCTAT TATCTCGTAA CGTTGGAGTA CCAAAAATCG TTGTATTCTT 100 

AAACAAATGT GATATGGTTG ATGACGAAGA GTTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGAACTATTA TCTGAATACG GATTCGACGG AGATGAACTA 200 

CCAGTAATCA AAGGTTCTGC TCTTAAAGCT CTTGAAGGAG ATGCAGATGC 250 

AGAAAAAGCT ATCATCGAAT TAATGGAAAC AGTTGACGAA TACATCCCAA 300 

CTCCAGAACG TGATAACGCT AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

53 
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TTCTCAATCA CAGGTCGTGG TACAGTTGCT 
ACAAGTTAAA GTTGGAGACG TAGTAGAAAT 
CAGCTTCAAC .TACTGTAACA GGTGTTGAAA 
TACGCTGAAG CAGGAGATAA CATCGGTGCA 
5 TGAAGACATC GAACGTGGAC AAGTTTTAGC 
CACACACTCA ATTCGTAGCT GACGTGTACG 
GGACGTCACA CTCCATTCTT CACAAACTAC 
TACTACTGAC GTAACTGGTG TAGTTACTTT 
TAATGCCTGG GGATAACGTA TCAATCAACG 
1 0 GCGATCGAAG AAGGAACTCG TTTC TCAA 



ACTGGACGTG TTGAACGTGG 400 

CGTTGGATTA ACTGAAGAAC 450 

TGTTCCGTAA ATTATTAGAT 500 

TTATTACGTG GTGTTGCTCG 550 

AGCTCCTAAA ACAATCACTC 600 

TATTATCTAA AGAAGAAGGT 650 

CGTCCTCAAT TCTACTTCCG 700 

ACCAGAAGGT ACTGAAATGG 750 

TAGAACTTAT TTCTCCAATC 800 

826 



15 



2) INFORMATION FOR SEQ ID NO: 88 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH ; t 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 



3 0 TCTTAGTAAT CGCTGCTACA GATGGTCCTA TGGCTCAAAC TCGTGAACAC 50 

ATCCTATTAT CTCGTAACGT TGGAGTACCT AAAATTGTTG TATTCTTAAA 100 

CAAATGTGAT ATGGTTGATG ACGAAGAGTT ATTAGAATTA GTAGAAATGG 150 

AAGTTCGTGA ACTATTATCT GAATACGGAT TTGATGGAGA TGAACTACCA 200 

GTAATCAAAG GTTCAGCTCT TAAAGCTCTT GAAGGAGATG CAGATGCTGA 250 

3 5 AAAAGCTATC ATCGAATTAA TGGAAACAGT TGACGAGTAC ATCCCAACTC 300 

CAGAACGTGA TAACGCTAAA CCATTTATGA TGCCAGTTGA GGACGTGTTC 350 

TCAATCACAG GTCGTGGTAC AGTTGCTACT GGACGTGTTG AACGTGGACA 4 00 

AGTTAAAGTT GGTGACGTAG TAGAAATCGT TGGATTAACT GAAGAACCAG 450 

CTTCAACTAC TGTAACAGGT GTTGAAATGT TCCGTAAATT ATTAGATTAC 500 

4 0 GCTGAAGCAG GAGATAACAT CGGTGCATTA TTACGTGGTG TTGCTCGTGA 550 

AGATATCGAA CGTGGACAAG TTTTAGCAGC TCCTAAAACA ATCACTCCAC 600 

ATACTCAATT CGTAGCTGAT GTGTACGTAT TATCTAAAGA AGAAGGTGGA 650 

CGTCACACTC CATTCTTCAC AAACTACCGT CCACAATTCT ACTTCCGTAC 700 

TACTGACGTA ACTGGTGTAG TTACTTTACC AGAAGGTACT GAAATGGTAA 750 

4 5 TGCCTGGGGA CAACGTATCA ATCAACGTAG AACTTATTTC TCCAATCGCT 800 

ATCGAAGAAG GAACTCGTTT CTC 823 



50 2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

54 
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(A) ORGANISM: Haemophilus act inomycetemcomi tana 

(B) STRAIN: ATCC 33384 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 89 

5 

GCTATCTTAG TAGTAGCAGC AACCGACGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCGG GCGATGACAC 2 00 

10 CCCAATCGTA CGCGGTTCTG CATTAAAAGC GCTTGAAGGC GATGCCGCAT 250 

GGGAAGAAAA- AATCGTTGAA- T-TAGGAAAGG-A-TT-TAGATAG- TTACATCCCG "300" 

GAACCTGAGC GTGCTATCGA CCAACCGTTC CTTCTTCCAA TTGAAGATGT 350 

GTACCGTAGT AACGGGTCGT GTTGAGCGCG 400 

GAAGTTGAAA TCGTGGGTAT CAAACCGACT 450 

15 GCAAAAACCA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCG GGTGAAAACA TCGGTGCATT ATTGCGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTATTGGCGA AACCGGGGTC AATCACCCCG 600 

CACACTGACT TCGAATCTGA AGTGTACGTA TTGTCCAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

20 CAACTGACGT AACCGGTACT ATCGAGTTAC CTGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATTC ACCCAATTGC 800 

GATGGACCAA GGTTTACGTT TCGCTATCG 829 



GTTCTCTATC TCCGGTCGTG 
GTATCATCCG TACCGGTGAT 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 90 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus aphrophilus 

(B) STRAIN: ATCC 33389 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 



45 



50 



55 



TGGTGCTATC 
GTGAGCACAT 
TTCTTAAACA 
TGAAATGGAA 
ATACACCAAT 
GAATGGGAAG 
TCCTGAGCCA 
ACGTGTTCTC 
CGTGGTATCA 
GACTGCGAAA 
ACGAAGGTCG 
CGTGAAGAAA 
TCCGCACACT 
GTGGTCGTCA 
CGTACAACTG 
GGTTATGCCT 
TCGCGATGGA 



TTAGTAGTAG 
CTTATTAGGT 
AATGCGACAT 
GTTCGTGAAC 
CGTACGTGGT 
AAAAAATCCT 
CAACGTGCTA 
TATCTCCGGT 
TCCGTACCGG 
ACTACCGTAA 
TGCAGGTGAA 
TCGAACGTGG 
GATTTCGAAT 
TACTCCATTC 
ACGTAACCGG 
GGCGATAACA 
CCAAGGTTTA 



CAGCAACTGA 
CGCCAAGTAG 
GGTAGATGAC 
TTCTTTCTCA 
TCTGCATTAC 
TGAATTAGCA 
TCGACCAACC 
CGTGGTACTG 
TGATGAAGTT 
CCGGTGTTGA 
AACATCGGTG 
TCAAGTATTG 
CTGAAGTGTA 
TTCAAAGGTT 
TACTATCGAG 
TCAAAATGAC 
CGTTTCGCTA 



TGGTCCTATG 
GTGTTCCTTA 
GAAGAGTTAT 
ATATGACTTC 
AAGCGTTAAA 
AACCACTTAG 
GTTCCTTCTT 
TAGTAACAGG 
GAAATCGTAG 
AATGTTCCGT 
CATTATTACG 
GCTAAACCGG 
CGTATTATCC 
ACCGTCCACA 
TTACCGGAAG 
TGTATCCTTA 
TCG 



CCACAAACTC 
CATCATCGTA 
TAGAATTAGT 
CCGGGTGATG 
CGGCGTTGCA 
ATACTTACAT 
CCAATTGAAG 
TCGTGTTGAG 
GTATCAAACC 
AAATTACTTG 
TGGCACTAAA 
GCTCAATCAC 
AAAGAAGAAG 
ATTCTATTTC 
GCGTGGAAAT 
ATCCACCCAA 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
833 
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2) INFORMATION FOR SEQ ID NO: 91 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



CGGCGCTATC TTAGTTGTAG CAGCAACTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTATTAGGC CGCCAAGTTG GTGTTCCTTA CATCATCGTA 100 

20 TTCTTAAATA AATGCGATAT GGTAGATGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGATTTC CCAGGTGACG 200 

ATACTCCTAT CGTTCGTGGT TCAGCATTAC AAGCATTAAA TGGTGTGCCT 250 

GAGTGGGAAG AAAAAATCAT TGAATTAGCA CAACACTTAG ATTCTTATAT 300 

CCCTGAGCCT GAGCGTGCGA TTGATAAACC TTTCTTATTA CCAATCGAAG 3 50 

25 ACGTATTCTC AATTTCAGGT CGTGGTACAG TAGTAACCGG TCGTGTTGAG 4 00 

CGTGGTATCA TCAAATCAGG TGAAGAAGTT GAAATCGTAG GGATTAAAGA 450 

AACGACAAAA ACAACAGTAA CCGGTGTTGA GATGTTCCGT AAACTATTAG 500 

ACGAAGGTCG TGCGGGTGAA AACGTAGGTG CCTTATTACG TGGTACTAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTA GCGAAACCAG GTACAATTAC 600 

30. ACCACACACT GATTTTGAAT CAGAAGTTTA TGTATTATCA AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ATCGTCCTCA GTTCTACTTC 700 

CGYACAACGG ACGTAACAGG AACGATTGAA TTACCTGAAG ATGTTGAGAT 750 

GGTAATGCCT GGTGATAATA TCAAGATGAC AGTAAGCTTA ATTCACCCTA 800 

TCGCGATGGA CGAAG 815 

35 



2) INFORMATION FOR SEQ ID NO: 92 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
50 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCAATG CCACAAACTC 50 

55 GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCATA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCAGGTGACG 200 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCATTAAA TGGCGTAGCA 250 

GAATGGGAAG AAAAAATCCT TGAGTTAGCA AACCACTTAG ATACTTACAT 300 

60 CCCAGAACCA GAGCGTGCAA TTGACCAACC GTTCCTTCTT CCAATCGAAG 350 

56 
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ATGTGTTCTC AATCTCAGGT 
CGTGGTATCA TCCGTACTGG 
TACAGCAAAA ACTACTGTAA 
ACGAAGGTCG TGCAGGTGAA 
5 CGTGAAGAAA TCGAACGTGG 
GCCACACACT GACTTCGAAT 
GTGGTCGTCA TACTCCATTC 
CGTACAACTG ACGTAACTGG 
GGTAATGCCA GGCGATAACA 
10 TCGCGATGGA CCAAGGTTTA 



CGTGGTACAG TAGTAACTGG 
TGATGAAGTA GAAATCGTAG 
CGGGTGTTGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCGAAACCAG 
CAGAAGTTTA TGTATTATCA 
TTCAAAGGTT ACCGTCCACA 
TACTATCGAG TTACCAGAAG 
TCAAGATGAC AGTAAGCTTA 
CGTTTCGCAA 



TCGTGTAGAA 400 

GTATCAAAGA 450 

AAATTACTTG 500 

TGGTACCAAA 550 

GTTCAATCAC 600 

AAAGAAGAAG 650 

ATTCTATTTC 700 

GCGTAGAAAT 750 

ATCCACCCAA 800 
830 



15 



20 



2) INFORMATION FOR SEQ ID NO: 93 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 



30 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 



TCTTAGTAGT 
ATCTTATTAG 
CAAATGCGAT 
AAGTGCGTGA 
GTAGTACGTG 
AGAAAAAATT 
CAGAGCGTGC 
TCAATCTCAG 
CATCAAAGCG 
AAACAACAGT 
CGTGCGGGTG 
AATCGAACGT 



AGCAGCAACA 
GTCGCCAAGT 
ATGGTTGACG 
ACTTCTTTCA 
GTTCAGCGTT 
CTTGAATTAG 
GATTGATAAA 
GTCGTGGTAC 
GGTGAAGAAG 
AACTGGCGTG 
AAAACGTTGG 
GGTCAAGTGT 



GACGGTCCAA 
AGGTGTTCCA 
ATGAAGAATT 
CAATATGACT 
ACAAGCGTTA 
CAAACCACTT 
CCATTCTTAT 
AGTAGTAACA 
TTGAAATCGT 
GAAATGTTCC 
TGCATTATTA 
TAGCGAAACC 



TGCCACAAAC 
TACATCATCG 
ATTAGAATTA 
TCCCAGGTGA 
AACGGCGTAG 
AGATACATAC 
TACCAATCGA 
GGTCGTGTTG 
AGGTATCAAA 
GTAAATTATT 
CGTGGTACAA 
AGGTACAATT 



TCGTGAGCAC 
TATTCTTAAA 
GTTGAAATGG 
TGACACGCCA 
CAGAGTGGGA 
ATCCCAGAGC 
AGACGTATTC 
AGCGTGGTAT 
GACACTGCGA 
AGACGAAGGT 
AACGTGAAGA 
ACACCACACA 



CAGACTTCGA ATCAGAAGTG TACGTATTAT CAAAAGAAGA AGGTGGTCGT 
CACACTCCAT TCTTCAAAGG TTACCGTCCA CAATTCTACT TCCGTACAAC 
TGACGTAACT GGTACTATTG AATTACCAGA AGGCGTAGAA ATGGTAATGC 
45 CAGGCGATAA CATCAAAATG ACAGTATCAT TAATCCACCC AATCGCGATG 
GACGAAGGTT TACGTTTTGC GATT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 



650 
700 
750 
800 
824 



50 2 ) INFORMATION FOR SEQ ID NO: 94 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

57 
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(A) ORGANISM: Haemophilus parainfluenzae 

(B) STRAIN: ATCC 7901 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 



TGGTGCTATC TTAGTAGTAG 
GTGAGCACAT CTTATTAGGT 



CAGCAACTGA TGGTCCTATG 
CGCCAAGTAG GTGTTCCTTA 



TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT 
TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC 
10 ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA 

- — GAATGGGAAG AAAAAATCCT~TGAATTAGCT _ AGCCACTTAG 

TCCTGAGCCT CAACGTGCTA TCGACCAACC GTTCCTTCTT 



CGTGGTACAG TAGTAACAGG 
TGATGAAGTT GAAATCGTAG 



ACGTGTTCTC TATCTCCGGT 
CGTGGTATCA TCCGTACCGG 
GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT 
ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG 
CGTGAAGAAA TCGAACGTGG 
TCCACACACT GATTTCGAAT 
GTGGTCGTCA TACTCCATTC 



TCAAGTATTG GCTAAACCGG 
CTGAAGTGTA CGTATTATCC 
TTCAAAGGTT ACCGTCCACA 
CGTACAACTG ACGTAACCGG AACTATCGAA TTACCGGAAG 
GGTTATGCCT GGTGATAACA TCAAAATGAC TGTATCCTTA 
TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 



CCACAAACTC 

CATCATCGTG 

TAGAATTAGT 

CCGGGTGACG 

CGGCGTWGCA 

-ATTCTTACAT" 

CCAATCGAAG 

TCGTGTTGAG 

GTATCAAACC 

AAATTACTTG 

TGGTACYAAA 

GTTCAATCAC 

AAAGAAGAAG 

ATTCTATTTC 

GCGTGGAAAT 

ATCCACCCAA 



50 
100 
150 
200 
250 
300- 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
833 



25 



2) INFORMATION FOR SEQ ID NO: 95 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus paraphrophilus 

(B) STRAIN: ATCC 29241 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 200 

ATACGCCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTGG ATACTTACAT 300 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 4 00 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 4 50 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

55 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GTGTGGAAAT 750 

GGTAATGCCT GGCGATAACA TCAAAATGAC CGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTT 824 



60 
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2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus segnis 

(B) STRAIN: ATCC 33393 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



GCTATCTTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

20 TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCAG GTGATGATAC 200 

TCCAATCATT CGTGGTTCTG CATTACAAGC GTTAAACGGC GTAGCAGAAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCTCAAG CATTAGATAC TTACATTCCT 300 

GAACCTGAGC GTGCAATCGA CCAACCGTTC CTTCTTCCAA TTGAAGACGT 350 

25 GTTCTCAATC TCTGGTCGTG GTACTGTAGT AACAGGTCGT GTAGAGCGTG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTTGGTAT CAAACCAACT 4 50 

GCGAAAACAA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCA GGTGAAAACA TCGGTGCATT ATTACGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAA GTATTAGCGA AACCGGGTTC AATCACTCCA 600 

30 CACACTGACT TCGAATCTGA AGTGTACGTA TTATCTAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

CAACTGACGT AACCGGTACT ATCGAGTTAC CGGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATCC ACCCAATCGC 800 

GATGGACCAA GGTTTACG 818 

35 



2) INFORMATION FOR SEQ ID NO: 97 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

CGGCGCTATC CTGGTTGTTG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTW CATCATCGTR 100 

TTCCTGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCVJGGYGATG 2 00 

ATACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGYGMACCT 2 50 

GAGTGGGAAG CTAAGATCGT AGAACTGGCT GAAACTCTGG ATTCTTACAT 300 

YCCACARCCA GAACGTGCTA TCGAYAAGCC ATTCCTGCTG CCAATCGAAG 350 

59 
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ACGTATTCTC TATCTCTGGC CGTGGTACWG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG TGAAGAAGTT GARATCGTTG GTATCAAAGA 4 50 

TACCGTTAAA TCAACTTGTA CCGGCGTTGA AATGTTCCGT AAACTGCTGG 500 

ACGAAGGTCG TGCAGGCGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAG 550 

5 CGTGAAGACA TCGAACGTGG TCAGGTTCTG GCTAAACCAG GYTCYATCAA 600 

GCCACACACC AAGTTCGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACCGG TACCATCGAA TTGCCAGAAG GCGTGGAAAT 750 

GGTAATGCCA GGC 763 

10 



2) INFORMATION FOR SEQ ID NO: 98 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 
25 (B) STRAIN: ATCC 23330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

CGGCGCAATC TTGGTATGTT CAGCAGCTGA CGGTCCTATG CCACAAACTC 50 

30 GCGAACACAT CTTGTTGGCT CGCCAAGTAG GTGTACCTTA TATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTCGATGAT GCTGAGTTGT TGGAATTGGT 150 

TGAAATGGAA ATCCGTGACT TGTTGTCTAG CTACGATTTT CCAGGCGACG 200 

ATTGCCCAAT CGTTCAAGGT TCTGCATTGC GYGCATTGGA AGGCGACGCT 250 

GCATACAAAG AAAAAATCTT TGAATTGGCT GCTGCTTTGG ATAGCTACAT 300 

35 TCCTACTCCA GAACGTGCTG TTGATAAACC ATTCTTGTTG CCAATCGAAG 350 

ATGTATTCTC TATCTCTGGT CGTGGTACAG TAGTTACTGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTAGG CGAAGAGATT GAAATCGTTG GTTTGAAAGA 450 

CACGCAAAAA ACCACTTGTA CTGGCGTGGA AATGTTCCGC AAATTGTTGG 500 

ACGAAGGTCA AGCTGGTGAT AACGTTGGTG TATTATTGCG TGGTACGAAG 550 

40 CGTGAAGACG TTGAACGTGG TCAGGTATTG GCTAAACCAG GTTCTATCAC 600 

TCCGCACACT AAATTTGAAG CTGAAGTGTA TGTGTTGAGC AAAGAAGAAG 650 

GTGGC CGT CA ~T ACGC CATTC" TtIiGCTAALT" ACCGCC^ / U U 

CGTACGACTG ACGTAACTGG TGCAGTTACT TTGTCTGAGG GTGTGGAAAT 750 

GGTTATGCCA GGCGAAAACG TGAAAATCAC TGTTGAGTTG ATTGCACCTA 800 

45 TCGCTAGGAA AACGGTTTGC GTTTTGCG 828 



50 



2) INFORMATION FOR SEQ ID NO: 99 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Klebsiella ornithinolytica 

60 
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(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

5 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GG'CCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAGCTGGTT 150 

GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CCGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

10 ACTGGGAAGC GAAAATCATC GAACTGGCTG GCTACCTGGA TTCTTACATC 3 00 

_ CCGGAACCAG_AGCGT.GC.GAT_. .TGACAAGCCG -TTCCTGCTGC-CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

15 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

20 GTCATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TTCACCCGAT 800 

CGCGATGGAC GATGGTCTGC GTTTCGCA 828 



25 2) INFORMATION FOR SEQ ID NO: 100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 9 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 



GATGCCGCAG ACTCGTGAGC ACATCCTGCT GGGTCGTCAG GTAGGCGTTC 50 

CGTACATCAT CGTGTTCCTG MCAAGTGCG ACATGGTTGA TGACGAAGAG " luV * 

CTGCTGGAAC TGGTTGAAAT GGAAGTTCGT GAACTTCTGT CTCAGTACGA 150 

TTTCCCGGGC GACGACACTC CGATCGTTCG TGGTTCTGCT CTGAAAGCGC 2 00 

45 TGGAAGGCGA CGCWGAGTGG GAAKCKAAAA TCATCGAACT GGCTGGCTTC 250 

CTGGATTCTT ACATTCCGGA ACCAGAGCGT GCGATTGACA AGCCGTTCCT 300 

GCTGCCGATC GAAGACGTAT TCTCCATCTC CGGTCGTGGT ACCGTTGTTA 350 

CCGGTCGTGT AGAGCGCGGT ATCATCAAAG TTGGCGAAGA AGTTGAAATY 4 00 

GTTGGTATYA AAGACACTGC TAAGTCTACC TGTACTGGCG TTGAAATGTT 450 

50 CCGCAAACTG CTGGACGAAG GCCGYGCTGG TGAGAACGTT GGTGTTCTGC 500 

TGCGTGGTAT CAAACGTGAA GAAATCGAAC GTGGTCAGGT ACTGGCTAAG 550 

CCGGGCTCTA TCAAGCCGCA CACCAAGTTC GAATCTGAAG TTTATATCCT 600 

GTCCAAAGAC GAAGGCGGCC GTCACACTCC GTTCTTCAAA GGCTACCGTC 650 

CGCAGTTCTA CTTCCGTACA ACTGACGTGA CTGGCACCAT CGAACTGCCG 7 00 

55 GAAGGCGTAG AGATGGTTAT GCCGGGCGAC AACATCAAAA TGGTTGTTA 749 



60 



2) INFORMATION FOR SEQ ID NO: 101 

61 
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10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 
(B) STRAIN:- -ATCC— 3-3 5 31 — 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 



15 TGGCGCGATC 
GTGAGCACAT 
TTCCTGAACA 
TGAAATGGAA 
ACACTCCGAT 

20 GACTGGGAAG 
CCCGGAACCA 
ACGTATTCTC 
CGCGGTATCA 
TACTGCTAAR 

25 ACGAAGGCCG 
CGTGAAGAAA 
GCCGCACACC 
GCGGCCGTCA 
CGTACAACTG 

30 GGTAATGCCG 
TCGCGATGGA 



CTGGTTGTTG 
CCTGCTGGGT 
AATGCGACAT 
GTTCGTGAGC 
CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 
TCAAAGTGGG 
TCWACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
TACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



CTGCGACTGA 
CGTCAGGTAG 
GGTTGATGAC 
TGCTGTCTCA 
TCCGCTCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CYGGCGTTGA 
AACGTTGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGTTTCGCAA 



CGGCCCGATG 
GCGTTCCGTA 
GAAGAGCTGC 
GTACGACTTC 
AAGCGCTGGA 
GGCTACCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATTCTGTCC 
ACCGTCCGCA 
CTGCCGGAAG 
TGTTACCCTG 



CCGCAGACTC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGAAGCA 
ATTCTTACAT 
CCTATCGAAG 
TCGTGTAGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAA 
GCTCTATCAA 
AAAGACGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATTCACCCRA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
830 



35 2) INFORMATION FOR SEQ ID NO: 102 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



CCTGGTTGTT 
TCCTGCTGGG 
AAATGCGACA 
AGTTCGTGAA 
TCGTTCGTGG 
GCGAAAATCA 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTAG 
AACCACCTGT 



GCTGCGACTG 
TCGTCAGGTA 
TGGTTGATGA 
CTGCTGTCTC 
TTCTGCTCTG 
TCGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 



ACGGCCCGAT GCCGCAGACT 
GGCGTTCCGT ACATCATCGT 
CGAAGAGCTG CTGGAACTGG 
AGTACGATTT CCCGGGCGAC 
AAAGCGCTGG AAGGCGACGC 
TGGCCACCTG GATACCTATA 
CGTTCCTGCT GCCGATCGAA 
GTTGTTACCG GTCGTGTAGA 
TGAAATCGTT GGTATCAAAG 
AAATGTTCCG CAAACTGCTG 
62 



CGTGAGCACA 
GTTCCTGAAC 
TTGAGATGGA 
GACACCCCGA 
AGAGTGGGAA 
TCCCGGAACC 
GACGTATTCT 
GCGCGGTATC 
AAACCGCGAA 
GACGAAGGCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 
ATCGAACGTG GTCAGGTACT GGCTAAGCCG 
CAAGTTCGAA TCTGAAGTGT ACATCCTGTC 
ATACTCCGTT CTTCAAAGGC TACCGTCCGC 
5 GACGTGACTG GCACCATCGA ACTGCCGGAA 
GGGCGACAAC ATCAAAATGG TTGTTACCCT 
ACGACG 



GTGGTATCAA ACGTGAAGAA 550 

GGCACCATCA ACCCGCACAC 600 

CAAAGACGAA GGCGGCCGTC 650 

AGTTCTACTT CCGTACTACT 7 00 

GGCGTAGAGA TGGTAATGCC 750 

GATCCACCCG ATCGCGATGG 8 00 

806 



10 



20 



-2-)-I-NF0RMAT-I0N -FOR-SEQ-I-D^NG: - 1-0-3 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 743 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 
'(B) STRAIN: ATCC 13883 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

GCAGACTCGT GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA 50 

TCATCGTGTT CCTGAACAAA TGCGACATGG TTGATGACGA AGAGCTGCTG 100 

GAACTGGTTG AGATGGAAGT TCGTGAACTG CTGTCTCAGT ACGATTTCCC 150 

30 GGGCGACGAC ACTCCGATCG TTCGTGGTTC TGCTCTGAAA GCGCTGGAAG 2 00 

GCGACGCAGA GTGGGAAGCG AAAATCATCG AACTGGCTGG CCACCTGGAT 2 50 

ACCTATATCC CGGAACCAGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC 3 00 

GATCGAAGAC GTATTCTCCA TCTCCGGTCG TGGTACCGTT GTTACCGGTC 350 

GTGTAGAGCG CGGTATCATC AAAGTAGGTG AAGAAGTTGA AATCGTTGGT 400 

35 ATCAAAGAAA CCGCGAAAAC CACCTGTACT GGCGTTGAAA TGTTCCGCAA 450 

ACTGCTGGAC GAAGGCCGTG CTGGTGAGAA CGTAGGTGTT CTGCTGCGTG 500 

GTATCAAACG TGAAGAAATC GAACGTGGTC AGGTACTGGC TAAGCCGGGC 550 

ACCATCAACC CGCACACCAA GTTCGAATCT GAAGTGTACA TCCTGTCCAA 600 

AGACGAAGGC GGCCGTCACA CTCCGTTCTT CAAAGGCTAC CGTCCGCAGT 650 

40 TCTACTTCCG TACTACTGAC GTGACTGGCA CCATCGAACT GCCGGAAGGC 700 

GTAGAGATGG TAATGCCGGG CGACAACATC AAAATGGTTG TTA 743 



45 2) INFORMATION FOR SEQ ID NO: 104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscl eroma ti s 

(B) STRAIN: ATCC 13884 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

63 
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TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAGATGGAAG 150 

5 TTCGTGAACT RCTGTCTCAG TACGATTTCC CGGGCGACGA CACCCCGATC 200 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

GAAAATCATC GAACTGGCTG GCCACCTGGA TACCTATATC CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC GCGGTATCAT 400 

10 CAAAGTAGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAA ACCGCGAAAA 450 

— GGAGGTGIAG— TGGGGT-TGAA -A-TGT-T-GGGGA-AAGTGGTGGA CGAAGGGCGT 50 0 

GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG CGGCCGTCAC 650 

15 ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

GCGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCGAT CGCGATGGAC 800 

GACGGTCTGC GTTTCGCAA 819 



2 ) INFORMATION FOR SEQ ID NO : 105 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA TGGCCCTATG CCACAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

4 0 TTCCTGAACA AATGYGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 200 

ATACTCCAAT C AT CC"g"tGGT" ~T CTGlzT CTG A~ 'aAGCGCTGGA" AGGCGAl^V 2 b U 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAACGTGCTA TCGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

45 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCTATCAA 600 

50 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA CC 832 

55 



2) INFORMATION FOR SEQ ID NO: 106 
60 (i) SEQUENCE CHARACTERISTICS: 

64 
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(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 
10 (B) STRAIN: ATCC 33435 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACTC 50 

15 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCTAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGACGCT 2 50 

GAGTGGGAAG CAAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

20 CCCAGAACCA GAGCGTGCGA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 4 50 

CACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

25 CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCCATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCAA 8 00 

30 TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 



35 



2 ) INFORMATION FOR SEQ ID NO: 107 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:" Genomic' DNA~ 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCTGAG 2 50 

TGGGAAGCGA AAATCATCGA ACTGGCGGGC TTCCTGGATT CTTACATCCC 3 00 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAACGC 4 00 

GGTATCATCA AAGTTGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGACAC 4 50 

CGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 
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GAAGAAATCG AACGTGGTCA GGTACTGGCT 
GCACACCAAG TTCGAATCTG AAGTGTACAT 
GCCGTCATAC TCCGTTCTTC AAAGGCTACC 
ACTACTGACG TGACTGGCAC CATCGAACTG 
5 AATGCCGGGC GACAACATCA AAATGGTTGT 
CGAAGGACGA AGGTCTGCGT TTCGCA 



AAGCCGGGTT CTATCAAGCC 600 
TCTGTCCAAA GACGAAGGCG 650 
GTCCGCAGTT CTACTTCCGT 700 
CCGGAAGGCG TTGAGATGGT 750 
TACCCTGATC CACCCGATCG 800 

826 



10 2) INFORMATION FOR SEQ ID NO: 108 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus casei subsp. casei 

(B) STRAIN: ATCC 3 93 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 



GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACATATCT TACTTTCACG 50 

TCAAGTTGGT GTTCCATACA TCGTTGTATT CATGAACAAA TGTGACATGG 100 

TTGACGATGA AGAATTACTA GAATTAGTTG AAATGGAAAT TCGTGATCTA 150 

TTAACTGAAT ATGAATTCCC TGGCGATGAC ATTCCTGTAA TCAAAGGTTC 200 

30 AGCTCTTAAA GCACTTCAAG GTGAAGCTGA CTGGGAAGCT AAAATTGACG 250 

AGTTAATGGA AGCTGTAGAT TCTTACATTC CAACTCCAGA ACGTGATACT 300 

GACAAACCAT TCATGATGCC AGTTGAGGAT GTATTCTCAA TCACTGGTCG 350 

TGGAACAGTT GCAACTGGAC GTGTTGAACG TGGACAAGTT AAAGTTGGTG 400 

ACGAAGTAGA AGTTATCGGT ATTGAAGAAG AGAGCAAAAA AGTAGTAGTA 450 

35 ACTGGAGTAG AAATGTTCCG TAAATYACTA GATTACGCTG AAGCTGGCGA 500 

CAACATTGGC GCACTTCTAC GTGGTGTTGC TCGTGAAGAT ATCCAACGTG 550 

GTCAAGTATT AGCTAAACCA GGTTCGATTA CTCCACACAC TAACTTCAAA 600 

GCTGAAACTT ATGTTTTAAC TAAAGAAGAA GGTGGACGTC ACACTCCATT 650 

CTTCAACAAC TACCGCCCAC AATTCTATTT CCGTACTACT GACGTAACTG 700 

40 GTATTGTTAC ACTTCCAGAA GGTACTGAAA TGGTAATGCC TGGTGATAAC 750 

ATTGAGCTTG CAGTTGANCT AATTGCACCA ATCGCTATCG AAGACGGTAC 800 

TAA ' »uY~ 



2) INFORMATION FOR SEQ ID NO: 109 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(B) STRAIN: ATCC 19435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

66 
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CGGTGCAATC CTCGTTGTTG CTGCAACTGA TGGACCAATG CCACAAACTC 50 
GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC . 100 

TTCCTTAACA AGGCTGACCT TGTTGATGAT GAAGAATTGA TGGAACTCGT 150 

5 TGAAATGGAA GTTCGTGACC TCTTGAGCGA ATACGACTTC CCAGGTGACG 200 

ATATTCCTGT AATCGCTGGT TCAGCACTTG GTGCTTTGAA CGGTGAACCA 250 

CAATGGGTTG CTAAAGTTGA AGAATTGATG GACATCGTTG ATGAATACAT 300 

CCCAACTCCA GAACGCGACA CTGACAAACC ACTCCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC TATCACTGGT CGTGGTACAG TTGCTTCAGG ACGTATCGAA 4 00 

10 CGTGGTACTG TTAAAGTTGG TGACGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

AGAAACTAAA _AAAG.CT.GTT.G_ .TTACT.GGTAT-CGAAATGT-TC-GGTAAAAGAG 50 0 

TTACTGAAGG TCTTGCTGGT GATAACGTCG GTGCACTTCT CCGTGGTATC 550 

CAACGTGACG AAATCGAACG TGGTCAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CACTCCACAC AAACTTTTCG AAGGTGAAGT TTACGTATTG AGCAAAGAAG 650 

15 AAGGCGGACG TCACACTCCA TTCTTCGACA ACTACCGTCC TCAATTCTAC 7 00 

TTCCACACAA CTGACGTTAC TGGTTCAGTT AAACTTCCAG AAGGAACTGA 750 

AATGGTAATG CCTGGTGACA ACGTGCATAT CGACGTTGAA TTGATCCACC 8 00 

CAGTTGCGAT CGAACAAGGT ACTAC 825 

20 



2) INFORMATION FOR SEQ ID NO: 110 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 



35 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTTC ATCATCGTGT 100 

40 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGTGAACT YCTGTCCCAG TACGACTTCC CGGGCGACGA 200 

CACCCCAATC" GTTG3TGG^ AGCgH'GGAA GGCGAAliCAG " Z blf " 

AGTGGGAAGA GAAAATCATC GARCTGGCTG GCTACCTGGA TTCCTACATC 300 

CCAGAGCCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

4 5 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAARGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAGGAC 450 

ACTGCTAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCYATCAAG 600 

50 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCYA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACKACTGA CGTGACCGGT ACCATCGARC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCAATGGAC GATGGTCTGC GTTC 824 

55 



2) INFORMATION FOR SEQ ID NO: 111 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella micdadei 
10 (B) STRAIN: ATCC 33218 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

CGGAGCGATA TTAGTAGTAT CAGCAGCGGA TGGCCCAATG CCTCAAACGA 50 

15 GAGAGCACAT ACTYTTATCC CGSCAGGTAG GTGTTCCCTA TATAGTAGTG 100 

TTCTTAAACA AAGCTGACAT GGTGGATGAT GCGGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTACGCGAYT TGTTGAGCAG CTATGAATTT CCAGGAGATG 200 

AGATCCCGAT TGTAGTTGGT TCAGCATTAA AAGCATTGGA AGGCGATACG 250 

AGTGATATAG GTGTACCAGC GATTGAGAAG TTAGTTGAGA CGATGGATTC 300 

20 TTATATACCT GAGCCGGTAA GAAACATCGA TAAAAGTTTC TTGTTACCGA 350 

TCGAAGACGT GTTCTCAATA TCTGGACGAG GAACAGTAGT AACAGGACGT 400 

ATCGAAAGCG GGATCATCAA AGTTGGTGAG GAAGTCGAGA TTGTTGGTAT 450 

ACGTGACACT CAAAAGACGA CATGCACAGG CGTTGAAATG TTCCGTAAAT 500 

TACTTGACGA AGGTCGAGCT GGAGACAACG TTGGTATATT GCTACGTGGT 550 

25 ACGAAGCGGG ATGAAGTTGA ACGCGGACAA GTATTAGCTA AGCCGGGAAG 600 

CATTAAACCG CATACTAAAT TTGAAGCTGA AGTGTATGTG TTGTCAAAAG 650 

ATGAAGGTGG ACGTCATACC CCATTCTTTA ACGGATATCG GCCTCAATTT 700 

TACTTCAGGA CCACAGACGT AACTGGTTCT TGTGATTTAC CTGARGGTAT 750 

AGAAATGGTA ATGCCAGGTG ATAACGTCAA GCTGATTGTT AGCTTACACT 800 

30 CACCGATTGC TATGGACGAA GGTTTGCGTT TTGCAATC 838 



2) INFORMATION FOR SEQ ID NO: 112 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

50 CGGAGCGATA CTGGTTGTAT CAGCAGCTGA TGGTCCTATG CCACAAACGA 50 

GGGAACACAT TCTATTGTCT CGCCAGGTAG GTGTTCCATA TATTGTTGT.G ^ 100 

TTCATGAACA AAGCGGATAT GGTTGATGAC CCTGAGTTAT TAGAGTTAGT ' 150 

GGAAATGGAA GTGCGAGATT TATTAAGCAG TTACGATTTC CCAGGGGATG 200 

ACATACCTAT TGTTGTTGGT TCAGCTTTGA AAGCATTGGA AGGTGAAGAC 250 

55 AGTGATATAG GCGTTAAGGC TATTGAGAAA TTGGTTGAAA CAATGGATTC 300 

ATACATTCCT GAGCCAGTTA GAAACATAGA CAAGCCATTT TTGTTGCCGA 350 

TTGAAGACGT ATTTTCAATT TCTGGACGCG GAACAGTGGT AACTGGTCGT 400 

GTAGAGAGTG GAATTGTTAA AGTTGGTGAG GAAGTTGAAA TTGTTGGAAT 450 

AAGAGACACC CAAAAGACGA CTTGTACGGG TGTTGAGATG TTCCGTAAAT 500 

60 TACTTGATGA AGGTCGAGCT GGTGATAACG TTGGTGTGTT ATTACGAGGT 550 

68 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACGAAGCGAG ATGAAGTGGA 
CATCAAGCCA CACACCAAGT 
AAGAAGGCGG ACGTCACACT 
TATTTCAGAA CCACTGACGT 
5 TGAAATGGTA ATGCCTGGAG 
CTCCGATTGC GATGGATGAA 



GCGTGGACAG GTATTGGCGA 
TTGAAGCAGA AGTGTATGTA 
CCATTCTTTA ATGGATACCG 
GACAGGTACT TGTGACTTGC 
ATAATGTGCA ATTAGTTGTT 
GGTTTAAGAT TCGCAATT 



AGCCAGGAAC 600 
TTATCCAAGG 650 
TCCACAATTC 700 
CATCAGGAGT 750 
AGCTTGCATG 800 
838 



10 2) INFORMATION FOR SEQ ID NO: 113 



15 



20 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid , 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

GTGCAATCCT GGTAGTAGCA GCGACTGACG GCCCGATGCC TCAGACTCGC 50 

GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA TCATCGTATT 100 

CCTGAACAAG TGCGATATGG TTGATGACGA AGAGCTGCTG GAGCTGGTTG 150 

ARATGGAAGT TCGCGAACTG CTGTCTCAGT ACGACTTCCC GGGCGACGAC 200 

ACTCCGGTAG TCCGCGGTTC AGCGCTGAAA GCGCTGGAAG GCGAAGCCGA 250 

GTGGGAARCG AAAATCATCG AGCTGGCAGG CCMTCTGGAT ACTTATATCC 300 

CAGAACCTGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC KATCGAAGAC 350 

GTATTCTCTA TCTCCGGCCG TGGTACCGTT GTTACCGGTC GTGTAGAGCG 400 

CGGCATCATC AAAGTCGGTG AAGAAGTGGA AATCGTCGGT ATCAAAGATA 450 

CCACCAAGAC CACCTGTACC GGCGTTGAAA TGTTCCGTAA GCTGCTGGAC 500 

GAAGGCCGTG CGGGCGAGAA CGTGGGCGTT CTGCTGCGCG GTACCAAGCG 550 

TGACGAAATC GAACGTGGTC AAGTTCTGGC CAAGCCGGGC ACCATCACTC 600 

CTCACACCCA GTTCGTGTCA GAAGTGTATA TCCTGAGCAA GGATGAAGGC 650 

GGCCGTCATA CTCCGTTCTT CAAAGGCTAC CGTCCTCAGT TCTACTTCCG 700 

TACGACTGAC GTGACAGGCA CCATCGAACT GCCGGAAGGC GTAGAGATGG 750 

TAATGCCAGG CGACAACATT CAGATGACCG TAAGTCTGAT TGCGCCGATC 800 

GCAATGGACG "aAGGTcYgCG* CTTCGCAA " « ^ * 



2) INFORMATION FOR SEQ ID NO: 114 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella richardii 

(B) STRAIN: ATCC 33998 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

69 
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GCTATCCTGG TTGTTGCTGC GACTGACGGC CCAATGCCTC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGCC AGGTAGGCGT TCCTTACATC ATCGTGTTCC 100 

TGAACAAGTG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

5 ATGGAAGTTC GTGAACTTCT GTCTCAATAC GACTTCCCGG GCGACGATAC 200 

GCCGGTTGTT CGCGGTTCAG CGCTGAAAGC GCTGGAAGGT GACGCYGAGT 250 

GGGAARCGAA AATCATTGAA CTGGCGGAAT CCTTRGATAC TTAYATTCCA 3 00 

GAGCCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCTA TCGAAGACGT 350 

TTTCTCTATC TCTGGCCGTG GTACTGTAGT CACCGGTCGT GTAGAGCGCG 400 

10 GCATCATCAA AGTTGGTGAA GAAGTGGAAA TCGTGGGAAT CAAAGACACC 450 
— ACCAAGACCA "CCTGTACTGG~CGTTGAAATG ~TTCCGTAAGC~TGCTGGACGA~ - - 500" 

AGGCCGTGCA GGTGAGAACG TTGGTGTTCT GCTGCGYGGT ACTAAGCGTG 550 

ACGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCAGGCAC CATCACTCCT 600 

CACACAGAAT TCGTGTCAGA AGTGTATATC CTGAGCAAGG ATGAAGGCGG 650 

15 YCGTCATACT CCGTTCTTCA AAGGCTACCG TCCTCAGTTC TACTTCCGTA 700 

CGACTGACGT GACCGGCACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 750 

ATGCCAGGCG ATAACATCCA GATGGTAGTT ACGCTGATTG CCCCAATCGC 800 

GATGGACGAA GGTCTGCGCT TCGCAA 826 



20 



2) INFORMATION FOR SEQ ID NO: 115 



( i ) SEQUENCE CHARACTERI STICS : 
25 (A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE : Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leptospira interrogans 

(B) STRAIN: ATCC 23581 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



TGCGGCGATT CTTGTAGTAT CCGCAACTGA CGGACCTATG CCACAAACAA 50 

AAGAACATAT CCTTCTTGCT CGTCAGGTAG GTGTTCCATA TGTAATTGTA 100 

40 TTCATTAACA AAGCAGATAT GCTTGCTGCT GACGAAAGAG CAGAAATGAT 150 

CGAAATGGTT GAGATGGACG TTCGTGAACT TCTCAATAAG TATAGCTTCC 2 00 

CAGGAGATAC AACTC^TATcT G TfCMXJ&riT ~C TGC^TAAA \{I{SaCTuImI ~ Z Mj" " 

GGCGATGAAT CTGAAATTGG GATGCCTGCA ATTCTCAAAT TGATGGAAGC 300 

TCTGGATACT TTCGTTCCAA ATCCAAAACG TGTAATCGAC AAACCTTTCC 350 

45 TTATGCCAGT AGAAGACGTT TTCTCGATCA CTGGTCGTGG AACTGTTGCA 4 00 

ACTGGAAGAG TGGAACAAGG TGTTTTGAAA GTGAACGACG AAGTTGAAAT 450 

TATCGGTATC CGCCCAACAA CAAAAACTGT TGTTACCGGT ATCGAAATGT 500 

TCAGAAAACT TCTCGATCAA GCGGAAGCTG GCGACAACAT CGGCGCTCTT 550 

CTTCGTGGAA CTAAAAAAGA AGAAATCGAA AGAGGGCAAG TTCTTGCGAA 600 

50 GCCAGGTTCT ATCACTCCTC ACAAAAAGTT TGCCGCTGAG GTGTATGTAT 650 

TAACTAAGGA TGAAGGCGGA CGTCATACTC CGTTTATCAA TAACTACCGT 700 

CCTCAGTTTT ACTTTAGAAC AACTGACGTA ACCGGAGTTT GTAACCTTCC 750 

TAATGGTGTC GAAATGGTTA TGCCTGGTGA TAACGTTTCT TTGACGGTTG 800 

AATTGATTAG CCCGATCGCA ATGGACAAGG GTCTTAAGTT CGC 843 

55 



2) INFORMATION FOR SEQ ID NO: 116 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Megamonas hypermegale 

(B) STRAIN: ATCC 2 5560 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

CGGTGCTATC CTCGTTGTTA GTGCTGCTGA TGGTCCTATG CCTCAGACTC 50 

GTGAACACAT CCTTCTCGCT CGTCAGGTTG GTGTTCCAGC TATCGTTGTA 100 

TTCCTCAACA AAGCTGACCA GGTTGATGAC CCTGAACTTC TCGAACTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCCAG CTATGACTTC CCAGGCGATG 200 

ACGTTCCAGT AATCACTGGT TCCGCTCTTC AGGCTCTCGA AGGCGACGAA 2 50 

GAAGCTAAAA AGAAAATTCT TGAATTAATG GATGCTGTTG ATGATTACAT 300 

CCCAACTCCA ACACGTGACA CTGATAAACC TTTCTTAATG CCAGTTGAAG 350 

ACGTATTCAC AATTACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGCGAAC TTAAACTTGG TGACAGCGTT GAAATCGTTG GTCTTTCCGA 450 

TGAAAAGAAA TCCACTACTG TAACTGGTAT CGAAATGTTC CGCAAAATGC 500 

TTGATAGCGC TGTTGCTGGT GATAACATCG GTGCACTTCT TCGTGGTATT 550 

GACCGTAAAG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CTGGCACAAT 600 

TCATCCACAC AAAAAATTCA AAGCTCAGGT TTACGTATTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCTCCA ACTATCGTCC ACAGTTCTAT 700 

TTCCGTACTA CTGACGTTAC TGGTGTTGTA ACTCTTCCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACATTGAAAT GAGCATCGAA CTCATCACTC 800 

CAATCGCTAT TGAAAAAGGT CTTCGCTTCG CT 832 



2) INFORMATION FOR SEQ ID NO: 117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE "typeT Genomic* DNA" 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mitsuokella multacida 

(B) STRAIN: ATCC 27723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 



TGGTGCTATC 
GTGAGCACAT 
TTCCTCAACA 
CGAGATGGAA 
ACATCCCTGT 
GAGCAGAAGA 
CCCGACGCCG 
ATGTCTTCAC 
CGTGGTGAGC 
CGAGCCGCGT 
TTGATTTCGC 



CTCGTCGTTT 
CCTGCTCGCT 
AGGTTGACCA 
GTTCGCGAGC 
AATCGCTGGT 
AGAACATCCT 
GTCCGCGACA 
GATCACGGGC 
TCAAGATGAA 
CAGACGGTTG 
TGAGGCTGGC 



CCGCTGCTGA 
CGCCAGGTCG 
GGTTGACGAT 
TGCTCTCCAG 
TCCGCTCTGA 
CAAGCTCATG 
ACGCTAAGCC 
CGTGGTACGG 
CGATACGGTT 
TCACGGGCAT 
GATAACATCG 



TGGCCCGATG CCGCAGACGC 
GTGTTCCGGC AATCGTTGTC 
CCGGAGCTCC TCGAGCTCGT 
CTACGACTTC CCGGGCGATG 
AGGCCCTCGA AGGCGACGAA 
GAAGCTGTCG ATGAGTACAT 
GTTCCTGATG CCGGTCGAGG 
TTGCAACGGG CCGCGTTGAG 
GAGATCGTTG GTCTGCAGGA 
CGAGATGTTC CGCAAGATGC 
GTGCTCTGCT CCGTGGTATC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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GACCGCAAGG AGATCGAGCG TGGCCAGGTT 
TCATCCGCAC ACGAAGTTCA AGGCTCAGGT 
AAGGCGGCCG TCATACGCCG TTCTTCACGA 
TTCCGCACGA CGGACGTAAC TGGCGTAGTC 
5 GATGGTTATG CCTGGCGATA ACGTCGAGAT 
CGATCGCTAT CGAGAAGGGC 



CTCGCAAAGC CGGGCACGAT 600 
CTATGTCCTG ACGAAAGAAG 650 
ACTATCGCCC GCAGTTCTAC 700 
AAACTGCCGG AAGGCACGGA 750 
GGAAGTTGAG CTCATCACCC 800 

820 



10 2) INFORMATION FOR SEQ ID NO: 118 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: MoJbiluncus curtisii subsp. holmesii 

(B) STRAIN: ATCC 35242 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

25 

CGGCGCTATC CTCGTGGTGG CTGCTACTGA CGGTCCGATG GCTCAGACCA 50 

AGGAACACAT CCTGTTGGCT AAGCAGGTTG GCGTGCCCTC CATCCTGGTC 100 

GCTCTGAACA AGTGCGATTC TTCCGATGTG GACGAAGACA TGCTCGAAAT 150 

CGTCGAGGAC GAAATCCGCG ATGACCTGGA GAAGCAGGGC TTCGATCGTG 200 

30 ACTGCCCGAT TATCCACGTT TCCGCTCTGA AGGCCCTGGA AGGCGACCCC 2 50 

GAGTGGACCA AGAAGATTGA AGAGCTCATG GAAGCGGTCG ATACCTACAT 300 

TCCTGAGCCT GTTCGTGACC TCGACAAGCC GTTCTTGATG CCTATCGAAG 350 

ACGTCTTCAC CATTACTGGT CGCGGTACCG TAGTGACCGG TCGTGTGGAA 400 

CGCGGCAAGC TACCGTTGAA CGCCGAAGTG GAAATCGTAG GTATTCGTCC 450 

35 TACGCAAAAG ACCACCGTTA CCGGTATCGA AATGTTCCAC AAGTCCATGG 500 

ACGAAGCCTA CGCCGGCGAG AACTGTGGTC TGTTGCTGCG TGGCACCAAG 550 

CGTGAGGACG TTGAGCGCGG TCAGGTTGTC TGCATTCCTG GCTCCGTGAC 600 

CCCGCACACC AAGTTCGAGG GCAAGGTCTA CATCTTGAAG AAGGACGAAG 650 

GTGGACGTCA CAAGTCGTTC TACGACGGCT ACCGCCCGCA GTTCTTCTTC 700 

40 CGCACCACCG ACGTGACCGG TGTTATTCAC CTGCCCGAAG GCACCGAAAT 750 

GGTTATGCCT GGCGACACCA CCGAAATTAG CGTTGAGCTG ATTCAGCCTA 800 

tcgctatgga" gg^ggtctc" ggcttX^ctaT t " a J 1 



45 

2) INFORMATION FOR SEQ ID NO: 119 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

55 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsenais 

(B) STRAIN: ATCC 35017 



60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 
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GGTGCAATTC 
TGAGCACATC 
TCCTGAACAA 
5 GAAATGGAAG 
CACTCCAGTA 
AGTGGGAAGC 
CCAGAGCCAG 
CGTATTCTCA 

10 GTGGTATCGT 

ACCGTGAAAA 

CGAAGGCCGT 
GTGATGATAT 
CCGCATACAA 

15 TGGCCGTCAT 
GTACAACTGA 
GTAATGCCAG 
TGCAATGGAT 



TGGTTGTTGC 
CTGTTAGGTC 
ATGTGACATG 
TCCGTGAGCT 
ATCCGTGGTT 
TAAAATCATT 
AGCGTGACAT 
ATTTCAGGCC 
TAAAGTCGGT 
_CAACATGTAC- 
GCTGGTGAGA 
CGAACGTGGT 
CTTTCGAATC 
ACTCCATTCT 
CGTAACCGGT 
GTGATAACAT 
GCAGGTCT GC 



TGCAACTGAT 

GTCAGGTTGG 

GTAGACGACG 

GCTGTCTCAG 

CAGCGCTGAA 

GAACTGGCAG 

TGATAAGCCA 

GTGGTACAGT 

GAAGAAGTTG 

_TGG.CGT-T.GAA_ 

ACGTTGGTGT 

CAAGTATTGG 

AGAAGTTTAC 

TCAAAGGTTA 

ACTATCGAAC 

CAAAATGATC 

GTTTT 



GGCCCTATGC 
CGTTCCATAC 
AAGAGCTGTT 
TACGATTTCC 
AGCTCTGGAA 
AAGCACTGGA 
TTCCTGTTAC 
TGTTACTGGT 
AAATCGTTGG 
ATGT-TCCGTA- 
TCTGCTGCGT 
CTAAACCAGG 
ATCCTGAGCA 
CCGTCCACAG 
TGCCAGAAGG 
GTTACTCTGA 



CACAGACTCG 

ATCATCGTTT 

AGAACTGGTT 

CAGGCGATGA 

GGCGAAGCTG 

TTCTTATATC 

CAATCGAAGA 

CGTGTTGAGC 

TATCAAAGAT 

AACTGG-TGGA- 

GGTACTAAAC 

TTCAATCACT 

AAGATGAAGG 

TTCTACTTCC 

CGTTGAGATG 

TCCACCCAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
-500 
550 
600 
650 
700 
750 
800 
825 



20 



2) INFORMATION FOR SEQ ID NO: 120 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



TGGTGCTATC TTGGTTGTTT CTGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCATAT CCTACTATCT CGTCAGGTTG GTGTACCATA CATCATGGTA 100 

40 TTCATGAACA AGTGCGATAT GGTTGATGAT GAAGAGCTAC TAGAATTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTATCTGA CTATGATTTC CCTGGTGATG 200 

ATACCCCAAT ^ AT C AAAG G T~ ~T CA^CACTAG TGGl'lxFi'UAl 7 Z5U 

GGTAAATATG GCGAGCCTGC AGTTCTAGAA CTGCTAGACA CACTAGACAG 300 

CTATATCCCA GAGCCTGAGC GTGATATCGA TAAGTCATTC TTGATGCCAA 350 

45 TTGAAGATGT CTTCTCGATC TCAGGTCGTG GTACAGTTGT GACTGGTCGT 400 

GTTGAATCAG GTATTATTAA AGTTGGTGAT GAAATTGAAA TCATCGGTAT 450 

CAAACCAACT GCTAAAACCA CCTGTACTGG TGTTGAAATG TTCCGTAAAC 500 

TGTTAGACGA AGGTCGTGCA GGTGAGAACT GTGGTATCTT GTTGCGTGGT 550 

ACTAAGCGTG AAGAAGTTCA ACGCGGTCAA GTACTTGCAA AACCAGGTTC 600 

50 AATCACCCCA CATACTAAGT TTGATGCTGA AGTTTATGTA CTGTCAAAAG 650 

AAGAAGGTGG TCGTCACACC CCATTCTTAA ATGGCTATCG CCCACAGTTC 700 

TACTTCCGTA CCACAGATGT GACTGGTGCC ATCACTCTAC AAGAAGGTAC 750 

CGAAATGGTT ATGCCTGGTG ACAATGTTGA GATGAGTGTT GAGCTTATCC 800 

ACCCAATCGC CAGGATAAAG GTCTACG 827 

55 



2) INFORMATION FOR SEQ ID NO: 121 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

CGGCGCTATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACCC 50 

GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCCTGAACA AATGTGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCTGGCGACG 2 00 

ACACGCCAAT CGTTCGCGGT TCAGCGCTGA AAGCACTGGA AGGCGAGCCA 250 

GAGTGGGAAG CTAARATCGT TGAACTGGCA GGTTTCCTGG ATTCTTACAT 300 

CCCTGAGCCA GAGCGTGCAA TTGACAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCCGGC CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

CGCGGTATCA TCAAGGTTGG TGAGGAAGTT GAAATCGTGG GTATCAAAGA 4 50 

TACTGCGAAA ACCACCTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCMGGTGAG AACGTCGGTG TTCTGCTGCG TGGTACCAAG 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAACCAG GTTCAATCAA 600 

ACCACAYACC AAATTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACCACAG ACGTAACAGG TACTATCGAA CTGCCGGAAG GCGTTGAAAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGAT CGTCACCCTG ATCCACCCAA 800 

TCGCAA 806 



2) INFORMATION FOR SEQ ID NO: 122 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: TB 2 99 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 



TGGTGGTCGC 
CTGCTGGCGC 
GGCCGACGCA 
TCCGCGAGCT 
CGGGTCTCGG 
TGTCGAGGAA 
GCGAGACCGA 
ACCGGCCGCG 
CGTGAACGAG 
CCACCGTCAC 
GCGGGCGACA 



CGCCACCGAC GGCCCGATGC 
GTCAAGTGGG TGTGCCCTAC 
GTGGACGACG AGGAGCTGCT 
GCTGGCTGCC CAGGAATTCG 
CGCTCAAGGC GCTCGAGGGT 
CTGATGAACG CGGTCGACGA 
CAAGCCGTTC CTGATGCCGG 
GAACCGTGGT CACCGGACGT 
GAAGTTGAGA TCGTCGGCAT 
CGGTGTGGAG ATGTTCCGCA 
ACGTTGGTTT GCTGCTGCGG 



CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 



74 



SUBSTITUTE SHEET (RULE 26) 
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GCGAGGACGT CGAGCGTGGC CAGGTTGTCA 
CCGCACACCG AGTTCGAAGG CCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
GCACCACCGA CGTGACCGGT GTGGTGACAC 
5 GTGATGCCCG GTGACAACAC CAACATCTCG 
CGCCATGGAC GAAGGTCTGC GTTTC 



CCAAGCCCGG CACCACCACG 600 
ATCCTGTCCA AGGACGAGGG 650 
CCGTCCGCAG TTCTACTTCC 700 
TGCCGGAGGG CACCGAGATG 750 
GTGAAGTTGA TCCAGCCCGT 800 

825 



10 2) INFORMATION FOR SEQ ID NO: 123 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 



CGGTGCGATC TTGGTATGTT CCGCAGCTGA CGGTCCTATG CCGCAAACTC 50 

GCGAACACAT CCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAGCTGT TGGAGCTGGT 150 

TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCAGGTGACG 200 

30 ACTGCCCGAT CGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGACGCA 250 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 300 

CCCAACACCT GAGCGTGCAG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGCGGTACAG TAGTAACCGG TCGTGTAGAG 400 

CGCGGTATCA TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 450 

35 AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 50 0 

ACGAAGGTCA AGCTGGTGAC AACGTAGGTG TATTGCTGCG TGGTACTAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACTATCAC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GTGGTCGTCA CACTCCGTTC TTCGCTAACT ACCGTCCACA ATTCTACTTC 700 

40 CGTACTACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

GGTAATGCCG GGTGAGAACG TAACCATTAC TGTAGAACTG ATTGCGCCTA 800 

tcgcta" auV " 



45 



55 



2) INFORMATION FOR SEQ ID NO: 124 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 

75 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



CGGCGCAATC 
GCGAACACAT 
TTCATGAATA 
5 TGAAATGGAA 
ACTGCCCGAT 
GCTTACGAAG 
CCCGACACCT 
ACGTATTCTC 
10 CGCGGTATCA 
AAGGGAAAAA- 



ACGAAGGTCA 
CGTGAAGAAG 
TCCTCACACC 
15 GTGGTCGTCA 
CGTACTACCG 
GGTTATGCCT 
TCGCTATGGA 



TTGGTATGTT 
CCTGTTGGCC 
AATGCGACAT 
ATCCGTGACT 
CGTACAAGGT 
AAAAAATCTT 
GAGCGTGCCG 
TATCTCCGGC 
TCCACGTCGG 
-AGGAGT-TGTA- 
AGCAGGTGAC 
TGGAACGCGG 
AAATTCAAAG 
TACTCCGTTC 
ACGTAACCGG 
GGTGAGAACG 
AGAAGGTCTG 



CCGCTGCYGA 
CGCCAAGTAG 
GGTTGAYGAT 
TGCTGTCAAG 
TCCGCACTGA 
CGAACTGGCT 
TGGACAAACC 
CGTGGTACAG 
TGACGAGATC 
-GGGGTGT-TGA- 
AACGTAGGCG 
TCAAGTATTG 
CAGAAGTTTA 
TTCGCTAACT 
TGCGGTTACT 
TGGCCATCAC 
CG 



CGGTCCTATG 
GCGTACCTTA 
GCCGAACTGC 
CTACGACTTC 
AAGCCTTGGA 
GCTGCATTGG 
GTTCCTGTTG 
TAGTAACCGG 
GAAATCGTAG 
-AA-TGT-TGGGC- 
TATTGCTGCG 
GCTAAACCGG 
CGTATTGAGC 
ACCGTCCACA 
TTGGAAGAAG 
TGTAGAACTG 



CCGCAAACTC 
CATCATCGTG 
TGGAACTGGT 
CCAGGCGACG 
AGGCGACGCA 
ACAGGTACAT 
CCTATCGAAG 
TCGTGTAGAG 
GTCTGAAAGA 
-AAAGTGGTGG- 
CGGTACCAAA 
GTACCATCAC 
AAAGAAGAGG 
ATTCTACTTC 
GTGTAGAAAT 
ATTGCACCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
-500- 
550 
600 
650 
700 
750 
800 
822 



20 



2) INFORMATION FOR SEQ ID NO: 125 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 



CGGCGCGACT TGGTATGTTC CGCAGCTGAC GGTCCTATGC CGCAAACCCG 50 

CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 

40 TCATGAACAA ATGCGACATG GTAGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TTCGTGACTT GTTGTCAAGC TACGACTTCC CAGGCGACGA 200 

CTGCCCAATCf GTA^f AAGGTT~"c Tb^ACTCAA AGClTT^TuC GGl^l^TlT ~ Z Slf " 

CTTACGAAGA AAAAATCTTC GAATTGGCTG CTGCCTTGGA CAGCTACATC 3 00 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 350 

45 CGTATTCTCT ATCTCTGGTC GTGGTACAGT AGTAACCGGT CGCGTAGAGC 400 

GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACTCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAA GCAGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACTAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATCACT 600 

50 CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

TGGTCGTCAC ACTCCATTTT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACCGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 750 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCAAT 800 

CGCTATGGAA GAAGTCTGCG 820 

55 



2) INFORMATION FOR SEQ ID NO: 126 
60 (i) SEQUENCE CHARACTERISTICS : 

76 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(A) LENGTH: 830 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
10 (B) STRAIN: ATCC 49226 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

GGTGCAATCC TGGTATGTTC TGCTGCCGAC GGCCCTATGC CGCAAACCCG 50 

15 CGAACACATC CTGCTGGCCC GTCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

TCATGAACAA ATGCGACATG GTCGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGCGACCT GCTGTCCAGC TACGACTTCC CCGGCGACGA 200 

CTGCCCGATC GTACAAGGTT CCGCACTGAA AGCCTTGGAA GGCGATGCCG 2 50 

CTTACGAAGA AAAAATCTTC GAACTGGCTA CCGCATTGGA CAGCTACATC 300 

20 CCGACTCCCG AGCGTGCCGT GGACAAACCA TTCCTGCTGC CTATCGAAGA 350 

CGTGTTCTCC ATTTCCGGCC GCGGTACCGT AGTCACCGGC CGTGTAGAGC 4 00 

GAGGTATCAT CCACGTTGGT GACGAGATTG AAATCGTCGG TCTGAAAGAA 450 

ACCCAAAAAA CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAG GCGGGCGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

25 GTGAAGACGT AGAACGCGGT CAGGTATTGG CCAAACCGGG TACTATCACT 600 

CCTCACACCA AGTTCAAAGC AGAAGTGTAC GTATTGAGCA AAGAAGAGGG 650 

CGGCCGCCAT ACCCCGTTTT TCGCCAACTA CCGTCCCCAA TTCTACTTCC 700 

GTACCACTGA CGTAACCGGC GCGGTTACTT TGGAAAAAGG TGTGGAAATG 750 

GTAATGCCGG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 

30 CGCTATGGAA GAAGGTCTGC GCTTTGCGAT 830 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 

50 CGGCGCAATC TTGGTATGTT CCGCCGCCGA CGGCCCTATG 
GCGAACACAT TCTGTTGGCC CGCCAAGTAG GTGTACCTTA 
TTCATGAACA AATGCGATAT GGTCGACGAT GCCGAGCTGT 
TGAAATGGAA ATCCGCGACC TGCTGTCAAG CTACGACTTC 
ACTGCCCAAT CGTACAAGGT TCCGCACTGA AAGCTTTGGA 

55 GCTTACGAAG AAAAAATCTT CGAACTGGCT GCCGCATTGG 
CCCGACTCCC GAGCGTGCCG TGGACAAACC GTTCCTGCTG 
ACGTATTCTC CATCTCCGGC CGCGGTACGG TAGTAACCGG 
CGCGGTGTCA TCCACGTTGG CGACGAGATC GAAATCGTCG 
AACCCAAAAA ACCACCTGTA CCGGTGTCGA GATGTTCCGC 

60 ACGAAGGTCA GGCAGGCGAC AACGTAGGCG TATTGCTGCG 

77 



CCGCAAACCC 
CATCATCGTA 
TGGAACTGGT 
CCAGGCGACG 
AGGCGATGCC 
ACAGCTACAT 
CCTATCGAAG 
CCGTGTAGAG 
GTCTGAAAGA 
AAACTGCTGG 
CGGTACCAAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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CGTGAAGAAG TGGAACGCGG TCAGGTATTA 
TCCGCACACC AAGTTCAAAG CAGAAGTGTA 
GCGGTCGTCA CACTCCGTTC TTCGCCAACT 
CGTACCACCG ACGTAACCGG CGCGGTTACT 
5 GGTAATGCCC GGTGAGAACG TAACCATTAC 
TCGCTATGGA AGAAGG 



GCCAAACCGG GTACCATCAC 600 
TGTATTGAGC AAAGAAGAGG 650 
ACCGTCCGCA ATTCTACTTC 700 
TTGGAAGAAG GCGTGGAAAT 750 
TGTAGAACTG ATTGCGCCTA 800 

816 



10 2) INFORMATION FOR SEQ ID NO: 128 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: ATCC 13077 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 



CGGTGCAATC CTGGTATGTT CCGCAGCCGA CGGTCCTATG CCGCAAACCC 50 

GCGAACACAT CCTGCTGGCC CGTCAAGTAG GCGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTCGACGAT GCCGAGCTGT TGGAACTGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCCAG CTACGACTTC CCCGGCGACG 200 

30 ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCCTTGGA AGGCGATGCC 2 50 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCGACTCCC GAGCGTGCCG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGTGGTACAG TAGTAACCGG TCGTGTAGAG 400 

CGCGGTATCA TCCACGTCGG TGACGAGATC GAAATCGTCG GTCTGAAAGA 4 50 

35 AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCAGGCGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACAATCAC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GCGGCCGCCA TACCCCGTTC TTCGCCAACT ACCGTCCCCA ATTCTACTTC 700 

40 CGTACCACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTGGAAAT 750 

GGTAATGCCG GGCGAGAACG TAACCATCAC CGTAGAACTG ATTGCGCCTA 800 

tcgctatgga" "agaaggtitcT cgcttt^ga" "t ~ a/i * 



2) INFORMATION FOR SEQ ID NO: 129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

78 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGGCGCAATC 
GYGAACACAT 
TTCATGAACA 
5 TGAAATGGAA 
ACTGCCCGAT 
GCTTACGAAG 
CCCGACTCCC 
ACGTATTCTC 
10 CGCGGTGTTA 

AACCCAAAAA" 

ACGAAGGTCA 
CGTGAAGAAG 
TCCGCACACC 
15 GTGGTCGTCA 
CGTACTACCG 
GGTTATGCCT 
TYGCTATGGA 



TTGGTATGTT 
CCTGTTGGCC 
AATGCGACAT 
ATCCGTGACT 
TGTACAAGGT 
AAAAAATCTT 
GAGCGTGCCG 
CATCTCCGGT 
TCCACGTTGG 
"ACCACATGTA" 
AGCCGGTGAC 
TGGAACGCGG 
AAATTCAAAG 
TACTCCGTTC 
ACGTAACCGG 
GGTGAGAAYG 
AGAAG 



CTGCTGCYGA 

CGTCAAGTAG 

GGTTGACGAT 

TGCTGTCAAG 

TCTGCACTGA 

CGAACTGGCT 

TAGACAAACC 

CGTGGTACAG 

TGACGAGATC 

CCGGTGTTGA- 

AACGTAGGCG 

TCAAGTATTG 

CAGAAGTGTA 

TTCGCTAACT 

TGCGGTTACT 

TAGCCATYAC 



CGGTCCTATG 

GYGTACCTTA 

GCCGAAYTGT 

CTACGACTTC 

AAGCCTTGGA 

GCCGCATTGG 

GTTCCTGTTG 

TAGTAACCGG 

GAAATCGTAG 

-AATGTTCCGC- 

TATTGCTGCG 

GCTAAACCGG 

CGTATTGAGC 

ACCGTCCTCA 

TTGGAAGAAG 

TGTAGAACTG 



CCGCAAACCC 

CATCATCGTG 

TGGAACTGGT 

CCTGGYGACG 

AGGCGATGCC 

ACAGCTACAT 

CCTATCGAAG 

CCGTGTAGAG 

GTCTGAAAGA 

-AAACTGCTGG- 

CGGTACCAAA 

GTACCATCAC 

AAAGAAGAGG 

ATTCTACTTC 

GTGTAGAAAT 

ATTGCGCCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
-500- 
550 
600 
650 
700 
750 
800 
815 



20 



2) INFORMATION FOR SEQ ID NO: 130 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double. 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



GGCGCAATCT TGGTATGTTC CGCTGCTGAC GGTCCTATGC CGCAAACCCG 50 

CGAACACATC CTGTTGGCCC GCCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

40 TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGTGACTT GCTGTCAAGC TACGACTTCC CTGGTGACGA 2 00 

CTGCCCGATC GTACAAGGTT CTGCACTGAA AGCCTTGGAA "gGCGACGCCG" ~ 2bo' 

CTTACGAAGA AAAAATCTTC GAACTGGCTG CTGCATTGGA CAGCTACATC 3 00 

CCGACTCCTG AGCGTGCCGT GGACAAACCG TTCCTGTTGC CTATTGAAGA 3 50 

45 CGTATTCTCC ATCTCCGGTC GCGGTACCGT AGTAACCGGC CGTGTAGAGC 4 00 

GCGGTGTTAT CCACGTTGGT GACGAGATTG AAATCGTAGG TCTGAAAGAA 4 50 

ACCCAAAAAA CCACTTGTAC CGGTGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAA GCCGGTGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGAAGT GGAACGCGGT CAAGTATTGG CTAAACCGGG TACCATCACT 6 00 

50 CCTCACACTA AATTCAAAGC AGAAGTTTAC GTATTGAGTA AAGAAGAGGG 6 50 

TGGTCGTCAT ACTCCGTTCT TCGCTAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACCGA CGTAACCGGC GCGGTTACTT TGGAAGAAGG TGTAGAAATG 750 

GTTATGCCTG GTGAGAACGT AGC CATC ACT GTAGAACTGA TTGCACCGAT 8 00 

CGCTATGGAA GAAGGTCTGC GCTTTGCGA 829 

55 



2) INFORMATION FOR SEQ ID NO: 131 
60 (i) SEQUENCE CHARACTERISTICS: 

79 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 



CGGCGCGACT 

15 CGAACACATC 
TCATGAACAA 
GAAATGGAAA 
CTGCCCAATC 
GTTACGAAGA 

20 CCAACACCTG 
CGTATTCTCT 
GCGGTATCAT 
ACCCAAAAAA 
CGAAGGTCAA 

25 GTGAAGACGT 
CCTCACACCA 
TGGTCGTCAC 
GTACTACTGA 
GTAATGCCAG 

30 CGCTATGGAA 



TGGTATGTTC 
CTGTTGGCTC 
ATGCGACATG 
TCCGTGACCT 
GTACAAGGTT 
GAAAATCTTC 
AGCGTGCTGT 
ATCTCTGGCC 
CCACGTTGGT 
CCACTTGTAC 
GCTGGTGACA 
AGAGCGTGGT 
AATTCAAAGC 
ACTCCATTCT 
CGTAACTGGT 
GTGAGAACGT 
GAAG 



CGCAGCTGAT 
GCCAAGTAGG 
GTTGACGATG 
GTTGTCAAGC 
CTGCACTGAA 
GAATTGGCTG 
GGACAAACCT 
GTGGTACAGT 
GACGAGATCG 
CGGCGTTGAA 
ACGTAGGCGT 
CAAGTATTGG 
AGAAGTATAC 
TCGCTAACTA 
GCAGTTACTT 
AACCATTACT 



GGTCCTATGC 
TGTACCTTAC 
CCGAGCTGTT 
TACGACTTCC 
AGCTTTGGAA 
CTGCTCTGGA 
TTCTTGTTGC 
AGTAACTGGT 
AAATCGTAGG 
ATGTTCCGCA 
ATTGCTGCGT 
CTAAACCAGG 
GTACTGAGCA 
CCGTCCACAA 
TGGAAGAAGG 
GTAGAACTGA 



CTCAAACTCG 
ATCATCGTAT 
GGAACTGGTT 
CAGGCGACGA 
GGTGACGCTG 
CAGCTACATC 
CTATCGAAGA 
CGTGTAGAGC 
TCTGAAAGAA 
AACTGCTGGA 
GGTACCAAAC 
TACCATTACT 
AAGAAGAGGG 
TTCTACTTCC 
CGTAGAAATG 
TTGCGCCTAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
814 



35 



2) INFORMATION FOR SEQ ID NO: 132 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE:" Genomic' DNA" 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

50 GCCATCTTGG TATGTTCTGC TGCTGACGGT CCTATGCCGC AAACCCGTGA 50 

GCACATCCTG TTGGCTCGTC AAGTAGGTGT ACCCTACATC ATCGTATTCA 100 

TGAACAAATG CGATATGGTT GATGATGCAG AGCTGCTGGA ATTGGTAGAA 150 

ATGGAAATCC GTGATCTGCT GAGCAGCTAC GATTTCCCTG GCGATGATTG 2 00 

YCCAATCGTG CAAGGTTCTG CTTTGAAAGC TTTGGAAGGT GATGCCGCTT 250 

55 ACGAAGAAAA AATCTTTGAA TTAGCTGCTG CATTGGATAG CTATATTCCA 300 

ACWCCTGAGC GYGCTGTTGA TAAACCATTC CTGTTGCCGA TTGAAGATGT 350 

ATTCTCAATT TCAGGTCGTG GTACAGTAGT AACTGGTCGT GTAGAGCGCG 4 00 

GTATTATTCA CGTAGGCGAT GAAATTGAAA TTGTAGGTTT GAAAGARACY 4 50 

CAAAAAACTA CTTGTACCGG CGTTGAAATG TTCCGTAAAT TGCTGGATSA 500 

60 AGGTCAGGCT GGTGATAACG TAGGCGTATT GTTGCGTGGT ACCAAACGTG 550 

80 



SUBSTITUTE SHEET (RULE 26) 
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AAGACGTTGA GCGTGGTCAA 
CAYACCAAAT TCAAAGCAGA 
TCGTCATACT CCGTTCTTCG 
CTACAGACGT TACCGGTGCK 
5 ATGCCTGGTG AGAAYGTTGC 
KATGGAAGAA GGYTGCGT 



GTATTGGCTA AGCCTGGTWC 
RGTKTATGTW TTGAGYAAGG 
CTAACTATCG TCCGCAATTC 
GTRACTTTAG AAGAAGGTGT 
CATTACTGTW GARYTGATYG 



TATTACTCCG 600 
AAGAAGGCGG 650 
TATTTCCGTA 700 
GGAAATGGTA 750 
CTCCGATTGC 800 
818 



10 2) INFORMATION FOR SEQ ID NO: 133 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid , 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ochrobactrum anthropi 

(B) STRAIN: ATCC 49188 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 



CGGCGCAATT CTGGTTGTTT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTCGCT CGTCAGGTTG GCGTTCCGGC AATCGTCGTG 100 

TTCCTGAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTTCGCGAAC TTCTGTCGAA ATACGATTTC CCGGGCGACG 200 

30 AAGTTCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCG 250 

AAGGAACTGG GCGAAGACGC CGTTCGTTCG CTGATGGCCG CTGTTGACGA 3 00 

CTACATTCCG ACCCCGGAAC GTCCGATCGA CCAGCCGTTC CTGATGCCGA 350 

TCGAAGACGT TTTCTCGATC TCGGGCCGTG GTACGGTTGT GACGGGTCGC 4 00 

GTTGAGCGCG GTATCGTCAA GGTTGGTGAA GAAGTTGAAA TCGTCGGCAT 450 

35 CAAGGCGACG GCGAAGACGA CGGTAACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGAYCA GGGCCAGGCT GGCGACAACA TCGGCGCTCT GATCCGCGGC 550 

GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TCAAGGCAGA AGCCTACATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTTA CGAACTACCG TCCGCAGTTC 700 

4 0 TACTTCCGCA CGACGGACGT GACCGGTGTT GTCACGCTGC CGGAAGGCAC 750 

GGAAATGGTT ATGCCTGGCG ACAACGTCGC TATGGACGTC ACCCTGATCG 800 

TGCCGATCGC CATGGJ^GAG AAGCTCCGcY TCGCTA b^b 



45 



55 



2) INFORMATION FOR SEQ ID NO: 134 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 805 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN: ATCC 27155 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 

81 
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CCTGGTTGTT GCTGCGACTG ATGGCCCAAT GCCACAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT ACATCATCGT GTTCCTGAAC 100 

AAGTGTGACA TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAGATGGA 150 

5 AGTACGTGAC CTGCTGTCAC AGTACGACTT CCCAGGCGAT GACACCCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGTTCC TGAGTGGGAA 250 

GCAAAAATCG TTGAGCTGGC TGAACACCTG GACAACTACA TCCCGGATCC 300 

AGTCCGTGCG ATCGACATGC CGTTCCTGCT GCCAATCGAA GACGTATTCT 350 

CAATCTCTGG CCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGCATC 400 

10 GTTAAAGTCG GCGACGAAGT TGAAATCGTG GGTATCAAAG ATACTGCGAA 450 

ATCAACCTGT" ACCGGTGTTG— AGATGTTCCG- TAAGGTGCTG-GACCAGGGTC 500 

AGGCAGGCGA AAACTGTGGT GTTCTGCTGC GCGGTATCAA GCGTGAAGAC 550 

ATCCAGCGTG GCCAGGTTCT GGCTAAGCCA GGCTCAATCA AGCCGCACAC 600 

CCAGTTCGAG TCAGAAGTTT ACGTTCTGTC TAAAGACGAA GGTGGCCGCC 650 

15 ATACTCCGTT CTTCAAAGGC TATCGTCCAC AGTTCTACTT CCGTACAACT 700 

GATGTAACCG GTTCAGTAGA GCTGCCAGAA GGCGTTGAGA TGGTCATGCC 750 

AGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCAATGG 800 

ACGAA 805 

20 



2) INFORMATION FOR SEQ ID NO: 135 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 1^589 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 



CGCGATCCTG GTTGTTGCTG CGACTGATGG CCCAATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGCCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

40 CTGAACAAGT GTGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

GATGGAAGTT CGCGATCTGC TGTCTCAGTA CGACTTCCCA GGCGACGATA 200 

ccccaat^^^ lxii?M3i£M&r ugal'gUi^LT 

TGGGAAGCGA AAGTCGTTGA GCTGGCTGGT CACCTGGATA CTTACATTCC 300 

AGATCCAGTA CGTGCTATCG ATCTGCCGTT CCTGCTGCCA ATCGAAGACG 350 

45 TATTCTCAAT CTCTGGCCGT GGTACCGTTG TTACCGGTCG TGTTGAGCGC 400 

GGCATCGTGA AAGTGGGCGA CGAAGTAGAA ATCGTTGGTA TCAAAGCGAC 450 

TGCCAAGTCT ACCTGTACCG GTGTTGAAAT GTTCCGCAAA CTGCTGGACC 500 

AGGGTCAGGC AGGCGAGAAC TGTGGTGTTC TGCTGCGCGG TATCAAGCGT 550 

GAAGAGATCC AGCGTGGTCA GGTTCTGGCT AAGCCAGGCA CCATCAAGCC 600 

50 ACACACCAAG TTCGTATCAG AAGTGTACGT ACTGTCTAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

ACYACTGATG TGACCGGCAM CATMGAACTG CCAGAAGGCG TTGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGRCCGT TGAGCTGATC CACCCAATCG 800 

CGATGGACCA GGGTCTGCGT TTCGC 825 

55 



2) INFORMATION FOR SEQ ID NO: 136 
60 (i) SEQUENCE CHARACTERISTICS: 

82 
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(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 
10 (B) STRAIN: NCTC 10322 



(xi')"SEQUENCE~DESCRIPTrON: SEQ ID NO: 136 



CACAAACACG TGAGCACATC CTTTTAGGTC GCCAAGTAGG CGTTCCTTAC 50 

15 ATCATCGTAT TCTTAAACAA ATGCGACATG GTGGATGATG AAGAATTATT 100 

AGAATTAGTT GAAATGGAAG TGCGTGAACT TCTTTCTCAA TATGATTTCC 150 

CAGGTGATGA TACACCAATC GTACGTGGTT CAGCGTTACA AGCGTTAAAC 2 00 

GGYGTAGCTG AGTGGGAAGA GAAAATTCTT GAGTTAGCCA ACCACTTAGA 250 

TACTTACATT CCAGAGCCAC AACGTGCAAT CGACCAACCG TTCCTTCTTC 300 

20 CGATTGAAGA CGTGTTCTCA ATTTCTGGTC GTGGTACAGT AGTAACAGGT 350 

CGTGTTGAGC GTGGTATCAT CCGTACAGGT GAAGAGGTTG AAATTGTTGG 400 

TATTAAAGCG ACAACGAAGA CCACAGTAAC AGGTGTTGAG ATGTTCCGTA 4 50 

AATTATTAGA CGAAGGTCGT GCGGGTGAGA ACGTTGGTGC TTTATTACGT 500 

GGTACTAARC GTGAAGAAAT CGAACGTGGT CAAGTGTTAG CGAAACCGGG 550 

25 TTCAATYACG CCACACACTG ATTTTGAATC AGAAGTTTAC GTGTTATCAA 600 

AAGAAGAAGG TGGTCGTCAT ACACCATTCT TCAAAGGTTA CCGTCCACAG 650 

TTCTACTTCC GTACAACGGA CGTAACAGGT ACAATCGAAT TACCGGAAGG 700 

TGTTGAGATG GTGATGCCTG GTGATAACAT CAAGATGACT GTAAGTTTGA 750 

TTCACCCAAT CG 762 

30 



2) INFORMATION FOR SEQ ID NO: 137 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 
45 (B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

TGGAGCTATC TTAGTTGTAT CAGCAGCGGA TGGACCAATG CCACAAACAA 50 

50 GAGAACACAT CTTATTATCA AGACAAGTAG GAGTACCATA TATCGTAGTA 100 

TATTTGAATA AAGCAGATAT GGTAGAAGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTAAGAGAAT TACTATCTGA ATATGGATTC CCAGGAGATG 200 

AAATTCCAAT CATAACAGGA TCATCCTTAG GAGCATTAAA TGGAGAACAA 250 

AAATGGATAG ATCAAATCAT GGCATTGATG AAAGCCGTAG ATGAATATAT 300 

55 TCCAACACCG GAAAGAGCAG TAGATCAACC ATTCTTGATG CCAATCGAAG 350 

ACGTATTTAC AATTACAGGA AGAGGAACTG TAGTAACAGG AAGAGTTGAA 4 00 

AGAGGAGTTG TAAAAGTWGG AGAAGAAGTT GAAATCGTAG GAATCAAAGC 4 50 

GACAACAAAG ACAACTTGTA CYGGAGTAGA AATGTTCCGA AAATTATTGG 5 00 

ATCAAGGACA AGCAGGAGAT AACATCGGAG CTTTATTRAG AGGAACCAAG 5 50 

60 AAAGAAGATG TAGAAAGAGG ACAAGTATTG GCAAAACCAG GAACAATTCA 600 

83 
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TCCTCATACA AACTTCAGTG GAGAAGTATA TGTATTGACA AAAGAAGAAG 650 

GAGGAAGACA TACTCCATTC TTCTCAGGAT ACAGACCACA ATTTTACTTT 700 

AGAACCACAG ATATTACAGG AGCAGTAACA TTACCAGAAG GAGTAGAAAT 750 

GGTAATGCCR GGAGATAATA TCACAATGAC AGTAGAATTG ATTCACCCAA 800 

5 TTGCAATGGA AACAGGATTA CGATTTGCAA TT 832 



2) INFORMATION FOR SEQ ID NO: 138 



10 



(iXS EQUENCE.. CHARACTERISTICS.: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 

25 TAGTATGTTC AGCAGCAGAY GGTCCAATGC CACAAACAAG AGAACACATT 50 

CTACTAGCAA GACAAGTTGG TGTACCAAAG ATAGTAGTAT TCCTAAACAA 100 

AGAAGACCAA GTAGACGATC CAGAACTAAT TGAATTAGTA GAGATGGAAA 150 

TCAGAGACCT ACTATCAGAA TATGACTTCG AYGGAGACAA CACACCAATC 200 

GTAGTAGGAT CAGCATTAAA AGCCCTAGAC GATCCAGACG GAGAATGGGG 250 

30 AGACAAAATC GTAAAACTAA TGGAAGMAGT AGACGAATAC ATCCCAACAC 3 00 

CAGTAAGAGA TACAGAACAC CCATTCCTAA TGCCAATCGA AGACRTATTC 350 

TCAATYACAG GAAGAGGAAC AGTAGCAACA GGAAGAGTAG AACAAGGTGT 400 

AGTAAAAGTA GGMGACACAG TAGAACTAGT AGGCTTAACA GACGAAAGCA 450 

GACAAGTAGT AGTAACAGGT GTAGAAATGT TTAGAAAACA ACTAGACCTA 500 

35 GCAGAAGCMG GAGACAACAT TGGAGCCCTA CTAAGAGGAG TACAAAGAGA 550 

AGAAATCCAA AGAGGACAAG TACTAGCAGC ACCAGGAACA ATCAAACCAC 600 

ACACAAAATT TGAAGCAGAA GTATACGTAC TAACAAAAGA AGAAGGTGGA 650 

AGACACACAC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTCAGAAC 700 

AACAGACGTA ACAGGAGACA TCCAACTAGC AGACGGAGTA GAAATGGTAA 750 

40 TGCCAGGAGA CAACTCAACA TTTACAGTAA CACTAATCAC ACCAATCGCA 800 

ATGGACGAAG GACTAAGATT CGC 823 



45 2) INFORMATION FOR SEQ ID NO: 139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN : ATCC 9321 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

84 
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CTATCATCGT 
CACATCCTAC 
AAACAAAGAA 
TGGAAATCAG 
5 CCAGTAGTAG 
CCCATGGTCA 
TCGACATCCC 
GACGTAATGA 
AAGAGGAACA 

10 AAGATACAAA 

GTAGACCAAG 

AACAAGAGAT 
TAAACCCACA 
GAAGGTGGAC 

15 CTTTAGAACA 
AAATGGTAAT 
CCAATCGCTC 



AGTATCTGCA 
TAGCAAGACA 
GACCAAGTAG 
AGACCTACTT 
TAGGATCTGC 
GACAAAATCC 
AGAAAGAGAC 
CAATCTCAGG 
CTAAAAGTTG 
AGAAACAGTA 
-GAGAATCTGG- 
CAAATCTCAA 
CACAGAATTC 
GTCACACACC 
ACAGACGTAA 
GCCAGGAGAC 
TAGAAGAAGG 



GCAGACGGTC 

AGTAGGCGTT 

ACGATCCAGA 

TCAGAATACG 

TCTTAAATCA 

TTGACCTAAT 

AACGACCAAC 

ACGTGGAACA 

GTGATACAGT 

GTAACTGGAG 

AGATAACGTA- 

GAGGACAAGT 

GAAGGTCAAG 

ATTCTTCAGT 

CAGGAGACAT 

AACGCAACAT 

ACTAAGATTC 



CAATGCCCAA 
CCAAAAATCG 
ACTAATCGAA 
ACTTCGATGG 
CTAGAAGAAG 
GGCACAAGTA 
CATTCCTAAT 
GTAGCAACAG 
AGAAATCGTA 
TAGAAATGTT 
GGAC-TAGTAG- 
ACTAGCAAAA 
TATACGTACT 
GGATATAGAC 
CGAACTAGAA 
TCAAAATCAC 
GC 



ACAGAGAGAA 

CAGTATTCCT 

TTAGTAGAAA 

AGACAACGCT 

GCGGAGAAGG 

GACGAATACT 

GCCAGTAGAA 

GAAGAGTTGA 

GGACTAACAG 

CCACAAATCM 

-TAAGAGGAGT— 

CCAGGWTCAG 

AACAAAAGAA 

CACAATTCTT 

GAAGGCGTAG 

ACTCCAAAAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
-550 
600 
650 
700 
750 
800 
832 



20 

2) INFORMATION FOR SEQ ID NO: 140 

( i ) SEQUENCE CHARACTER I ST I CS : 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

30 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

CGGTGCTATC ATCGTAGTTG CTGCAACTGA TGGTCCTATG CCTCAGACGC 50 

GTGAGCACAT CCTACTAGCA CGTCAGGTCA ACGTACCTCG TCTAGTTGTC 100 

TTTATGAACA AGTGCGACCT TGTTGATGAC GAGGAGATGC TCGAGCTCGT 150 

40 AGAGATGGAT ATGCGTGAGC TACTAAGCTT CTATGACTTT GACGGCGACA 200 

ACACTCCTGT CATCCGTGGT TCTGCTCTTG GTGCTCTCAA TGGTGAGCCT 250 

AAGTGGGTAG AGAAGgYtAT ^GAGCTCATG" GAGGCTGTAG AGAL'i'TG^A 7 ! 7 ~ iUU " 

CCCACTACCT GAGCGCGACA TCGACAAGCC TTTCCTAATG CCTGTAGAGG 350 

ACGTATTCTC TATCACAGGT CGTGGTACTG TCGCTACTGG TCGTATCGAG 400 

45 ACTGGTGTCG TTAAGGTCAA CGATGAGGTT CAGATCATCG GTCTAGGTGC 450 

TGAGGGTAAG AAGAGCGTCG TAACTGGCGT GGAAATGTTC CGCAAGATCC 500 

TTGATGAGGG TGAAGCTGGT GATAACGTAG GTCTCCTACT CCGTGGTATC 550 

GACAAGGACG AGATCAAGCG CGGTATGGTC CTAGCACACC CAGGTCAGGT 600 

CAAGCCTCAC GATCACTTCA AGGCTGAGGT CTATATCCTG AAGAAGGAAG 650 

50 AGGGTGGTCG TCACACACCA TTCCACAACA AGTACCGTCC TCAGTTCTAC 700 

ATCCGTACGC TAGACGTAAC GGGCGAGATC ACACTCCCAG AGGGTGTAGA 750 

GATGGTTATG CCTGGTGATA ACGTCACCAT CGATGTCAAG CTCATCTCTC 800 

CAGTAGCTTG TAGCGTAGGT CTACGCTTCG C 831 



55 



2) INFORMATION FOR SEQ ID NO: 141 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 818 bases 

85 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: ATCC 33277 

(xi) SEOUE^E^EgCRIPTI ON: SEQ _ID_NOj__141 



CGGTGCTATA ATCGTTGTAG CAGCTACAGA CGGTCCTATG CCTCAGACTC 50 

GCGAGCACAT CCTTTTGGCT CGCCAGGTAA ACGTTCCTCG TCTGGTTGTT 100 

15 TTCATGAACA AATGTGACAT GGTAGACGAT GAAGAGATGC TCGAGCTTGT 150 

TGAAATGGAC ATGCGCGAAC TCCTTTCTTT CTACGATTTC GATGGTGACA 200 

ATACCCCTAT CATCCGTGGT TCTGCTCTGG GCGCTTTGAA TGGAGAGCCT 250 

CAGTGGGAAG ACAAGGTGAT GGAGCTTATG GAAGCTGTTG ACAACTGGGT 300 

TCCCCTGCCT GAGCGCGATA TCGACAAACC GTTCTTGATG CCGGTTGAAG 350 

20 ACGTGTTCTC TATCACGGGT CGTGGTACGG TCGCTACAGG ACGTATCGAA 400 

ACCGGTATTG TGAAGACCGG TGACGAAGTT CAAATCATCG GCCTCGGTGC 450 

AGAAGGAATG AAGTCGGTTG TTACGGGTGT TGAAATGTTC CGTAAGATTC 500 

TTGACGAAGG TCAGGCTGGT GACAACGTTG GTCTCCTCCT GCGTGGTATC 550 

GATAAGGATC AGATCAAGCG TGGTATGGTT ATCTCTCACC CGGGTAAGAT 600 

25 TACTCCTCAC AAGAGATTTA AGGCCGAGGT TTATATCTTG AAGAAAGAAG 650 

AAGGTGGTCG CCACACTCCT TTCCACAACA AATATCGTCC GCAGTTCTAC 700 

ATCCGTACGC TTGACGTGAC CGGTGAAATC ACTCTTCCCG AAGGAACAGA 750 

AATGGTTATG CCCGGTGACA ACGTAACGAT CACTGTAGAA CTCATCTACC 8 00 

CGGTTGCATG TAATGTAG 818 

30 



2) INFORMATION FOR SEQ ID NO: 142 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 
45 (B) STRAIN: ATCC 49100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

CGGCGCTATT CTGGTTGTTG CTGCAACTGA TGGTCCTATG CCTCAAACTC 50 

50 GTGAGCACAT CCTGTTAGGY CGCCAGGTTG GCGTACCATA CATCATTGTG 100 

TTCCTGAACA AGTGTGACAT GGTTGAYGAT GAAGAGCTGT TAGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA GTACGATTTC CCAGGTGATG 2 00 

ATACTCCAGT TGTTCGTGGT TCTGCGCTGA AAGCGTTRGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT TGAATTGGCT GACTCCCTGG AYAGCTACAT 300 

55 TCCACAGCCA GAGCGTGCAA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTTTTCTC AATCTCTGGC CGTGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

TACTGTGAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AARTTACTGG 500 

ATGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAG 550 

60 CGTGATGAAA TCGAACGTGG TCAAGTATTA GCAAAACCAG GTTCAATCAA 6 00 

86 
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CCCGCATACT AACTTCGTAT CAGAAGTTTA 
GTGGTCGTCA TACTCCATTC TTCAAAGGCT 
CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGTGATAACA TTCAGATGAC 
5 TCGCGATGGA CGAAGGTTTA CGCTTCGCTA 



TATCCTGAGC AAAGATGAAG 650 
ACCGTCCACA GTTYTACTTC 700 
CTGCCAGAAG GCGTAGAGAT 750 
TGTAACTCTG ATTGCCCCAA 8 00 

830 



2) INFORMATION FOR SEQ ID NO: 143 



10 



_ _(i ) SEQUEN CE CHAR ACT ERISTI CS : 

(A _ ) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

25 TGGTGCTATC TTGGTTGTAG CTGCTACTGA TGGTCCTATG CCTCAGACTC 50 

GTGAGCACGT ATTGCTCGCT CGTCAGGTAA ACGTACCTCG CTTGGTTGTA 100 

TTCTTGAACA AGTGTGATAT GGTTGACGAT GCTGAGATGC TTGACCTCGT 150 

TGAGATGGAG GTTCGTGAGA TCCTCGAGCA GTACGGTTAT GAGGAGGATA 200 

CTCCTATTAT TCGTGGTTCT GCACTCGGTG CTTTGAACGG TGTTGAGAAG 250 

30 TGGGTAGACT CTGTAATGGA GCTCATGGAT ACTGTTGACA CTTGGATTGA 300 

AGAGCCAGAG CGTGAGATTG ACAAGCCATT CTTGATGCCT GTTGAGGACG 350 

TATTCTCTAT CACAGGTCGT GGTACTGTAG CTACTGGTCG TATCGAGACT 4 00 

GGTATCTGTA AGGTAGGTGA TGAGGTTCAG TTGCTCGGTC TCGGTGAGGA 450 

CAAGAAGTCT GTTATCACTG GTGTTGAGAT GTTCCGTAAG AACCTTCCAA 500 

35 CAGGTCAGGC TGGTGACAAC GTAGGTCTCC TCCTTCGTGG TATCGATAAG 550 

GCTGAGGTTA AGCGTGGTAT GGTTGTTGTG CACCCAGGTG CTATTACTCC 600 

TCACGATCAC TTCAAGGCAT CTATCTATGT ATTGAAGAAG GAAGAGGGTG 650 

GTCGTCATAC TCCATTCGGT AACAAGTATC GTCCACAGTT CTACCTCCGT 700 

ACAATGGACT GTACAGGTGA AATCCACCTC CCAGAGGGCG TTGAGATGGT 750 

40 TATGCCAGGT GACAACGTAG AGATTGAAGT TGTATTGATC TATAAGGTTG 800 

CTTTGAACGA GGGTCTTCGT T 821 



45 2) INFORMATION FOR SEQ ID NO: 144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella oralis 

(B) STRAIN: ATCC 33269 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

87 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TGGTGCTATT CTTGTAGTAG CTGCTACTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACGT GCTTCTTGCT CGTCAGGTGA ACGTACCTCG TTTGGTCGTT 100 

TTCTTGAACA AGTGCGATAT GGTTGACGAT GAAGAAATGC TTGAGCTCGT - 150 

AGAAATGGAG CTTCATGAAC TTCTCGAGCA GTATGAATAT GAGGAGGATA 200 

5 CTCCTATTGT TCGTGGTTCG GCACTTGGCG CTCTGAATGG AGTAGAGAAG 250 

TGGGTTGACA GCGTGATGAA GTTGATGGAT ACCGTTGATG AATGGATACA 300 

GGAACCACCG CGTGATCTTG ATAAGCCTTT CTTGATGCCG GTAGAGGATG 350 

TATTTTCTAT TACTGGTCGT GGAACGGTTG TTACAGGCCG TATTGAAACT 400 

GGTAAGGTTA AGGTGGGCGA TGAAGTTCAA CTTCTTGGTC TCGGTGAAGA 4 50 

10 T AAGAAGTCC GTTGTGACAG GCGTTGAGAT GTTCCGTAAG ATTCTTGACG 500 

MGGTGAA"GC~TGGTGATAAT"GTAGGCTTGC^ TGCTTCGTGG- TATGGATAAG 55 0- 

ACGGAAGTAA AGCGTGGTAT GGTTGTCGTA CATCCGGGGG CTATTACTCC 600 

TCACGATCAT TTCAAGGCTT CAGTTTACGT ATTGAAGAAA GAAGAAGGCG 650 

GTCGCCATAC TCCGTTTGGT AMCAAGTATC GTCCACAGTT CTATCTTCGT 700 

15 ACCATGGACT GTACTGGTGA AATTACTCTT CCGGAAGGAG TTGAGATGGT 750 

AATGCCGGGT GATAACGTCG AAATTGAAGT TAAGTTGATC TATCCGGTAG 800 

CTTTGAACGA GGGACTTCGT TTCGCTA 827 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 14 5 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Propionibacterium acnes 

(B) STRAIN: ATCC 6919 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

CGGCGCCATC CTCGTGGTTG CTGCTACCGA CGGCCCGATG CCTCAGACTC 50 

GCGAGCACGT TCTGCTCGCT CGTCAGGTGG GCGTGCCCGC CATCGTCGTC 100 

GCCCTCAACA AGTGCGACAT GGTTGACGAT GAGGAGCTCA TTGAGCTCGT 150 

40 CGAGATGGAG GTCCGCGAGC TGCTGACCTC GCAGGAGTTC GACGGCGACA 200 

ACTGCCCTGT CGTTCGCATC TCCGCCTTCC AGGCCCTCCA GGGTGATGAG 250 

AAGTGGACCC" AGYcGATC^ GACGCCGTGcT ACGA^lACAl" ~ i uV * 

CCCGCAGCCT GAGCGCGATC TCGACAAGCC CTTCCTTATG CCGATCGAGG 350 

ACGTCTTCAC CATCACCGGC CGTGGCACCG TTGTCACCGG TCGTGTCGAG 400 

45 CGCGGCGTCG TCAAGACTGG CGAAGAGGTC GAGATCGTCG GTATCCACGA 450 

GAAGACCCAG AAGACCACCG TTACCGGTGT CGAGATGTTC CGCAAGATCC 500 

TCGACGAGGG CCGCGCTGGT GAGAACGTCG GCGTTCTGCT CCGTGGCACC 550 

AAGAAGGAGG ATGTCGTTCG CGGCATGGTC CTCTCCAAGC CTGGTTCCAC 600 

' CACCCCCCAC ACCGACTTCG AGGGCCAGGT CTACGTCCTC AAGAAGGATG 650 

50 AGGGTGGCCG CCACAAGCCG TTCTTCTCCC ACTACAGCCC CCAGTTCTAC 700 

TTCCGTACCA CGGACGTGAC TGGCACTGTT GAGCTCCCCG AGGGCACCGA 750 

GATGGTCATG CCTGGCGACA ACACCGACAT GACTGTGCAC CTGATTCACC 800 

CGGTTGCCAT GGAGGATCAG CTCAAGTTCG CTA 833 



55 



2) INFORMATION FOR SEQ ID NO: 14 6 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 74 5 bases 

88 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 35659 

10 

(xi) SEQUENCE DESCRTPTIONr SEQ~ID"NO~14-6 

CACAAACTCG TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC 50 

ATCATCGTAT TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGCTGTT 100 

AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAA TACGATTTCC 150 

CAGGTGATGA CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCACTGGAA 200 

GGCGAAGCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA 250 

TTCTTATATC CCAGAGCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC 300 

CAATCGAAGA TGTATTCTCA ATCTCAGGCC GTGGTACAGT AGTTACTGGT 350 

CGTGTAGAGC GTGGTATCAT CAAAGTAGGT GATGAAGTTG AGATTGTTGG 400 

TATCAAAGAA ACCGCCAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AATTACTTGA CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTGCTGCGT 500 

GGTACAAAAC GTGAAGAAAT CGAACGTGGA CAAGTACTGG CRAAACCAGG 550 

CTCAATCAAC CCACACAACA AATTTGAATC AGAAGTTTAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG 700 

CGTAGAAATG GTAATGCCAG GCGACAACGT GAACATGATC GTTGA 745 



2) INFORMATION FOR SEQ ID NO: 14 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE :_ 

(A) ORGANISM: Proteus pennen 

(B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 

GGAGCTATCC TGGTTGTTGC TGCGACAGAT GGCCCAATGC CACAAACTCG 50 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC ATCATCGTAT 100 

TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGTTACT GGAATTAGTM 150 

50 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CAGGTGATGA 200 

CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCACTGGAA GGCGAAGCAG 250 

AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA TTCATACATC 300 

CCAGARCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC CAATTGAAGA 350 

CGTATTCTCA ATTTCAGGCC GTGGTACAGT AGTAACAGGT CGTGTTGAGC 4 00 

55 GTGGCGTAAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATTAAACCA 450 

ACAGCGAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA AATTACTTGA 500 

CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTTCTGCGT GGTACTAAAC 550 

GTGAAGAAAT CGAACGTGGA CAAGTACTGG CGAAACCAGG TTCAATCAAC 600 

CCACACACTA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

60 TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

89 
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20 



GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG CGTAGAAATG 750 
GTAATGCCAG GTGACAACAT CAACATGATC GTTGAACTGA TTCACCCAAT 800 
CGCGATGGAC GACGGTTTAC GTTTCGCTA 829 



2) INFORMATION FOR SEQ ID NO: 148 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 824 bases 
(B) TYPEt -Nucleic-acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 



CGGAGCTATT CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGCCAGGTTG GTGTACCTTA CATCATCGTA 100 

25 TTCCTGAACA AATGTGACAT GGTTGATGAT GAAGAACTGC TGGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAGT AATCCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 250' 

GAGTGGGAAG CAAAAATTGT TGAATTAGCA GAAGCACTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCTATCGAAG 350 

30 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTAGAG 400 

CGTGGTGTTG TTAAAGTTGG TGAAGAAGTT GAGATTGTTG GTATTAAAGA 450 

CACAGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG TTCTTCTGCG TGGTACTAAA 550 

CGTGAAGAAA TCGAACGTGG ACAAGTACTG GCTAAACCAG GTTCAATCAA 600 

35 GCCACACACT AAATTCGAAT CAGAAGTTTA TATCCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG TACTATCGAA TTACCAGAAG GCGTAGAAAT 750 

GGTAATGCCA GGTGACAACA TCAACATGAT CGTTGAACTG ATTCACCCTA 800 

TCGCGTAGGA CGACGGTTTA CGTT 824 

40 



2) INFORMATION FOR SEQ ID NO: 149 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) ' TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia alcalifaciens 
55 (B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 

CACAAACTCG TGAGCACATC CTGTTAGGTC GCCAAGTAGG TGTTCCTTAC 50 
60 ATCATCGTTT TCCTGAACAA ATGTGACATG GTAGACGACG AAGAACTGTT 100 

90 
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AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC 150 

CAGGCGATGA CACTCCAGTT GTTCGCGGTT CAGCACTGAA AGCGCTGGAA 200 

GGCAACCCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG GTTACCTGGA 250 

TTCTTACATC CCAGAACCAG AGCGTGCAAT TGACAAGCCA TTCCTGCTGC 300 

5 CAATCGAAGA CGTATTCTCA ATCTCTGGTC GTGGTACAGT AGTAACAGGC 350 

CGTGTTGAGC GTGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG 4 00 

TATTCAAGCG ACTGCGAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AACTGCTGGA TGAAGGTCGT GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT 500 

GGTACTAAAC GTGAAGAAAT TCAACGTGGT CAAGTACTGG CTAAACCAGG 550 

10 TTCAATCAAG CCACACACTC AATTCGAATC AGAAGTATAT ATTCTGAGCA 600 

— AAGATGAAGG ~TGGTCGTCAT~~ACTCCATTCT-TCAAAGGGTA— GGGTGGAGAG 65 0- 

TTCTACTTCC GTACAACTGA CGTAACCGGT ACTATCGAAC TGCCAGAAGG 700 

CGTAGAGATG GTAATGCCAG GCGACAACAT CAACATGATC GTGAC 745 



15 



30 



2) INFORMATION FOR SEQ ID NO: 150 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 



CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAAGTAG GTGTWCCTTA CATCATCGTT 100 

35 TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

ACACTCCAGT TGTCCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCAACCCA 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCA GGTCACTTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCAATCGAAG 350 

40 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACGGTTAAA A CAACT TGTA~ CT GG C GT TGA~ AA TGT TCC G T~ ^LAACTGCTGcf ~ b U~u" " 

ACGAAGGTCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAA 550 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCAAAACCAG GTTCAATCAA 600 

45 GCCACACACT AAATTCGAAT CAGAAGTCTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGATAACA TCAACATGAT CGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTTTA CGTTTCGCAA 830 

50 



2 ) INFORMATION FOR SEQ ID NO: 151 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 
5 (B) STRAIN: ATCC 33673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

1 0 GTGA GCACAT CCTGTTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTT 100 

"TTCCTGAAC/T"^ 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCAGT TGTTCGYGGT TCAGCACTGA AAGCGCTGGA AGGTATCCCT 2 50 

GAGTGGGAAG CGAAAATTGT TGAATTAGCC GGTTACCTGG ACAGCTACAT 3 00 

15 CCCAGAACCA GAGCGCGCAA TTGACCGTCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG SCGTGTTGAG 4 00 

CGTGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 4 50 

CACRGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTTG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTG TTTTACTGCG TGGTACTAAG 550 

2 0 CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACT ACTTTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TCAACATGAT CGTGACACTG ATTCACCCAA 800 

25 TCGCGATGGA TGATGGTTTA CGTTTC 826 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 152 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 



45 CGGTGCAATC CTAGTTGTTG CGGCAACAGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGTGATG 200 

ACACTCCAGT TATCCGTGGT TCAGCGCTGA AAGCGTTGGA AGGCAACCCA 250 

50 GAGTGGGAAG CGAAAATCGT TGAACTAGCA GAAGCACTGG ACAGCTACAT 300 

CCCAGAGCCA GAGCGTGCAA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCAGGT CGTGGTACAG TAGTCACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTAG GTATCAAAGA 450 

GACTGCGAAA ACCACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTGG 500 

55 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG TTCTGCTGCG TGGTACTAAG 550 

CGTGAAGAAA TCGAACGTGG TCAAGTTCTG GCGAAACCAG GTTCAATCAA 600 

GCCACACACA ACTTTCGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACGCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

60 GGTAATGCCA GGCGACAACG TGAACATGAA AGTAACTCTG ATTCACCCAA 800 
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TCGCGATGGA CGATGGTTTG CGTTTCGCAA 



20 



5 2 ) INFORMATION FOR SEQ ID NO: 153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 
-1-0 (C) STRAND EDNE SS i Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: ATCC 35554 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 



CCTGGTTTGC TCGGCTGCCG ACGGCCCCAT GCCGCAGACC CGCGAGCACA 50 

TCCTGCTGTC CCGCCAGGTA GGCGTTCCCT ACATCGTCGT GTTCCTGAAC 100 

AAAGCCGACA TGGTCGACGA CGCCGAGCTG CTGGAACTGG TCGAGATGGA 150 

AGTTCGCGAT CTGCTGAACA CCTACGACTT CCCGGGCGAC GACACTCCGA 200 

25 TCATCATCGG TTCCGCGCTG ATGGCGCTGG AAGGCAAGGA TGACAACGGC 250 

ATCGGCGTAA GCGCCGTGCA GAAGCTGGTA GAGACCCTGG ACTCCTACAT 300 

TCCGGAGCCG GTTCGTGCCA TCGACCAGCC GTTCCTCATG CCGATCGAAG . 350 

ACGTGTTCTC GATCTCCGGT CGCGGTACCG TGGTAACCGG TCGTGTAGAG 400 

CGCGGCATCA TCAAGGTCCA GGAAGAAGTG GAAATCGTCG GCATCAAGGC 450 

30 GACCACCAAG ACCACCTGCA CCGGCGTTGA AATGTTCCGC AAGCTGCTCG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTA TCCTGCTGCG TGGCACCAAG 550 

CGTGAAGACG TAGAGCGTGG CCAGGTACTG GCCAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAGT GCGAAGTGTA CGTGCTGTCC AAGGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTCAAGGGCT ACCGTCCGCA GTTCTACTTC 700 

35 CGTACCACKG ACGTGACCGG TAMCTGCGAG CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTCACCCTG ATCGCTCCGA 800 

TCGCGATGGA AGATGGCTGC GTTCGCG 827 



40 



50 



2) INFORMATION FOR SEQ ID NO: 154 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 841 bases 
45 (B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas fluorescens 

(B) STRAIN: ATCC 13525 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

CGGCGCAATC CTGGTTTGCT CGGCCGCTGA TGGTCCGATG CCACAAACCC 50 

GTGAACACAT CCTGCTGTCC CGTCAGGTTG GCGTTCCGTA CATCGTGGTT 100 

TACCTGAACA AGGCTGACCT GGTAGACGAC GCTGAGCTGC TGGAACTGGT 150 

60 TGAGATGGAA GTGCGCGATC TGCTGAGCAC TTACGACTTC CCAGGCGACG 200 
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ACACTCCGAT 
GACAACGAAA 
CAGCTACATC 
CAATCGAAGA 
5 CGTATCGAGC 
TCTGCGTGAC 
AGCTGCTCGA 
GGTACCAAGC 
TTCGGTTAAG 

-10 AAGAAGAAGG. 

TTCTACTTCC 
CGTTGAAATG 
TCAAAACCAT 



CATCATCGGT 
TGGGCACCAC 
CCAGATCCAG 
CGTGTTCTCG 
GCGGTATCGT 
ACTACCGTCA 
CGAAGGTCGT 
GTGACGACGT 
CCGCACACCA 
_CGGTCGTCAC 
GTACTACTGA 
GTTATGCCAG 
CGCAATGGAA 



TCTGCTCGTA 

GTCCGTTCGT 

TTCGTGTTAT 

ATCTCCGGTC 

TAAGGTTCAA 

CCACCTGCAC 

GCTGGCGAGA 

TGAGCGTGGC 

AGTTCGAAGC 

ACTCCGTTCT 

CGTGACTGGT" 

GCGACAACAT 

GACGGTCTGC 



TGGCTCTGGA 
AAACTGGTTG 
CGACAAGCCG 
GCGGTACTGT 
GATCCACTGG 
CGGTGTTGAA 
ACTGCGGCGT 
CAGGTTCTGG 
TGAAGTCTAC 
TCAAAGGCTA 
AACTGCGAGC 
CAAAATGGTT 
GTTTCGCTAT 



AGGCAAAGAC 

AAACTCTGGA 

TTCCTGATGC 

TGTGACTGGT 

AAATCGTTGG 

ATGTTCCGTA 

TCTGCTGCGT 

TTAAGCCAGG 

GTACTGAGCA 

CCGTCCACAG 

TGCCGGAAGG- 

GTTACCCTGA 

T 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
-750- 
800 
841 
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20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas stutzeri 

(B) STRAIN: ATCC 17588 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 



CGGCGCGATC 
GCGAGCACAT 
TTCCTGAACA 

35 CGAGATGGAA 
ACACTCCGAT 
GACAACGAGC 
CAGCTACATT 
CGATCGAAGA 

40 CGCGTAGAGC 
TCTGCGTCCG 



CTGGTCTGCT 
CCTGCTGTCC 
AGGCCGACAT 
GTTCGYGACC 
CATCATCGGC 
TCGGCACCAC 
CCCGAGCCGG 
CGTGTTCTCG 
GCGGCATCGT 
ACCACCAAGA 



CGGCTGCTGA 
CGTCAGGTTG 
GGTTGATGAC 
TGCTGTCGAC 
TCCGCGCTGA 
TGCGGTGAAG 
TTCGTGCCAT 
ATCTCCGGTC 
CAAGGTTCAG 
CTACCTGCAC 



CGGCCCCATG 
GTGTTCCGTA 
GCCGAGCTGC 
CTACGACTTC 
TGGCGCTGAA 
AAGCTGGTCG 
CGACAAGCCG 
GCGGCACSGT 
GAAGAGATCG 
CGGCGTTGAG 



CCGCAGACTC 
CATCGTCGTG 
TCGAGCTGGT 
CCGGGTGAYG 
CGGCGAAGAC 
AGACCCTGGA 
TTCCTGATGC 
GGTAACCGGT 
AGATCGTCGG 
ATGTTCCGCA 



AGCTGCTCGA 
GGCACCAAGC 
CACCATCAAG 
45 AGGAAGAAGG 
TTCTACTTCC 
CGTCGAGATG 
TCAAGCCGAT 



YGARGGTCGT 
GTGACGAAGT 
CCGCACACCA 
TGGTCGTCAC 
GTACCACTGA 
GTAATGCCGG 
CGCCATGGAA 



GCTGGCGAGA 
GGAGCGTGGT 
AGTTCGAAGC 
ACCCCGTTCT 
YGTGACYGGW 
GCGACAACGT 
GACGGCCTGC 



ACTGCGGYGT 
CAGGTTCTGG 
CGAAGTGTAC 
TYAAGGGCTA 
TCGTGCGARC 
GAAGATGGTT 
GCTTCGCGAT 



GLTGCTtiCUT 
CCAAGCCGGG 
GTGCTGTCCA 
CCGTCCKCAG 
TGCCGGAAGG 
GTCACCCTGA 
T 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 



600 
650 
700 
750 
800 
841 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 156 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



94 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 



GCTATTCTAG TAGTATCAGC AACTGACGGT CCAATGCCAC AAACACGTGA 50 

GCACATTCTA TTATCACGTC AGGTTGGTGT ACCATACATC ATCGTATTCA 100 

10 TGAACA AATG TGACATGGTA GATGACGAAG AGTTACTAGA GCTAGTAGAA 150 

ATGGAAGTGC "~GTGAATTACT~TTCAGACTAC GACTTCCCAG-GTGATGAGAG- 200- 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAATGGT AACGACGGTA 250 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT AGACACTTAC 300 

ATTCCAGAGC CAGAGCGTGA CATCGATAAG CCATTCCTAA TGCCAATCGA 350 

15 AGACGTATTC TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGCCGTGTTG 400 

AATCTGGTAT CATCAAAGTT GGTGACGAAA TCGAAATCGT TGGTATCAAA 450 

GACACAGTTA AAACAACTTG TACTGGTATC GAGATGTTCC GTAAGTTACT 500 

AGACGAAGGT CGTGCTGGTG AGAACTGTGG TGTACTATTA CGTGGTACTA 550 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGCTAAGCC AGGTTCAATC 600 

20 ACTCCACACA CCAACTTCGA CGCAGAAGTA TACGTACTAT CAAAAGAAGA 650 

AGGTGGTCGT CACACTCCAT TCTTAAATGG TTACCGTCCA CAGTTCTACT 700 

TCCGTACTAC TGACGTAACA GGTGCAATCA CGTTACAAGA AGGTACTGAA 750 

ATGGTAATGC CAGGCGATAA CGTTGAGATG AGCGTAGAGC TAATCCACCC 800 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA ATC 833 

25 



2) INFORMATION FOR SEQ ID NO: 157 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 
40 (B) STRAIN: ATCC 33071 



GGCGCTATCC 

45 TGAGCACATC 
TCATGAACAA 
GAAATGGAAG 
CATCCCGGTC 
CTTGGGAAGC 

50 CCATTGCCAG 
CGTATTCTCC 
GCGGTATCGT 
ACTGTTAAGT 
CGAAGGCCGT 

55 GTGAAGACAT 
CCACACACCA 
TGGTCGTCAC 
GTACAACTGA 
GTTATGCCTG 

60 CGCGATGGAT 



TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACTCG 50 

CTGCTGGGTC GCCAGGTTGG CGTTCCATAC ATGATCGTGT 100 

ATGCGACATG GTAGATGACG AAGAGCTGCT GGAACTGGTA 150 

TTCGCGAACT TCTGTCTGCT TACGAATTCC CAGGCGACGA 200 

ATCAAAGGTT CAGCGCTGAA AGCACTGGAA GGCGATGCTA 250 

GAAAATCATC GAACTGGCAG AAGCACTGGA CAGCTACATT 3 00 

AGCGTGCTAT CGATAAGCCA TTCCTGCTGC CAATCGAAGA 350 

ATCTCCGGTC GTGGTACAGT GGTTACCGGT CGTGTAGAGC 4 00 

TAAAGTGGGC GAAGAAGTTG AAATCGTCGG TATCAAGGAC 4 50 

CTACTTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAGC 550 

CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

AGTTTGATTC CGAAGTGTAC ATCCTGAGCA AAGATGAAGG 650 

ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTGACAACGT GAACATGGTT GTTACCCTGA TCCACCCAAT 800 

GACGGTCTGC GTTTC 825 

95 
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2) INFORMATION FOR SEQ ID NO: 158 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
_10_ (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ATACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

3 0 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

35 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



40 2) INFORMATION FOR SEQ ID NO: 159 



( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

60 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 
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ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGTGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

5 CGCGGTATCA TCAAAGTGGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGTTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

-10 GCGGGGGT-GA- TAGTCCGTTC-TT.CAAAGGC3L ACCGTCCGCA GTTCTAC7TC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750~ 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



15 



30 



2) INFORMATION FOR SEQ ID NO: 160 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

35 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

40 CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 5 00 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 6 00 

45 CAAGTTCGAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 7 00 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 807 

50 



2) INFORMATION FOR SEQ ID NO: 161 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
5 serotype Heidelberg 

(B) STRAIN: ATCC 8326 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 



10 CGGGGGGATC -CTGGT.TGTTG_CTGC_GA CTGA CGGCCCGATG CCGCAGACCC 5 0 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 — 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

15 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

20 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA TTGCCGGAAG GCGTAGAGAT 750 

25 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



30 2) INFORMATION FOR SEQ ID NO: 162 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houfcenae 

(B) STRAIN: ATCC 43974 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



45 

CCTGGTTGTT 
TCCTGCTGGG 
AAATGCGACA 
AGTTCGTGAA 

50 TCGTGCGTGG 
GCGAAAATCA 
AGAGCGTGCG 
CTATCTCCGG 
ATCAAAGTGG 

55 GTCGACCTGT 
GTGCTGGCGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 

60 GACGTGACTG 



GCTGCGACTG 
TCGTCAGGTA 
TGGTTGATGA 
CTTCTGTCTC 
TTCTGCTCTG 
TCGAACTGGC 
ATTGACAAAC 
TCGTGGTACC 
GCGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GCACCATCGA 



ACGGCCCGAT 
GGCGTTCCGT 
CGAAGAGCTG 
AGTACGATTT 
AAAGCGCTGG 
TGGCTACCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 



GCCGCAGACC 
ACATCATCGT 
CTGGAACTGG 
CCCGGGCGAC 
AAGGCGACGC 
GATTCTTACA 
GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GTGGTATCAA 
GGCACCATCA 
CAAAGATGAA 
AATTCTACTT 
GGCGTGGAGA 



CGTGAGCACA 
GTTCCTGAAC 
TAGAAATGGA 
GACACGCCGA 
AGAATGGGAA 
TCCCGGAACC 
GACGTATTCT 
GCGCGGTATC 
AGACTCAGAA 
GACGAAGGCC 
ACGTGAAGAA 
AGCCGCACAC 
GGCGGCCGTC 
CCGTACGACT 
TGGTAATGCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 
ACGACGG 807 



2) INFORMATION FOR SEQ ID NO: 163 

( i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 7 bases 
-1-0 — -(B) TYPE :-Nuc lei c_acid_ _ 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



40 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43976 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

25 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 200 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 400 

30 GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

35 GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 800 

CRATGGACGA CGGTCTGCGT TTCGCAA 827 



2) INFORMATION FOR SEQ ID NO: 164 



(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

60 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 
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AGTTCGTGAA 
TCGTGCGTGG 
GMGAAAATCA 
AGAGCGTGCG 
5 CCATCTCCGG 
ATCAAAGTGG 
GTCTACCTGT 
GTGCCGGTGA 
ATCGAACGTG 
-1-0- -GAAGT-TGGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACGG 

15 



CTTCTGTCTC 

TTCCGCTCTG 

TCGAACTGGC 

ATTGACAAGC 

TCGTGGTACC 

GCGAAGAAGT 

ACTGGCGTTG 

GAACGTAGGT 

GTCAGGTACT 

-TCTGAAG.TG.T_ 

CTTCAAAGGC 

GCACCATCGA 

ATCAAAATGG 



AGTACGACTT 
AAAGCGCTGG 
TGGCTWCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 



CCCGGGCGAC 
AAGGCGAMGC 
GATTCTTACA 
GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GTGGTATCAA 
GGCACCATCA 
CAAAGATGAA 



TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



AGTTCTACTT 
GGCGTGGAGA 
GATCCACCCG 



GACACGGCGA 

TGAGTGGGAA 

TTCCGGAACC 

GACGTATTCT 

GCGCGGTATC 

AGACTCAGAA 

GACGAAGGCC 

ACGTGAAGAA 

AGCCGCACAC 

GGCGGCCGTC 

CCGTACCACT" 

TGGTAATGCC 

ATCGCGATGG 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
"700" 
750 
800 
807 



2) INFORMATION FOR SEQ ID NO: 165 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
30 serotype Typhi 

(B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

35 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

40 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACY AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

50 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

100 
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(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 
CGGCGCTATC CTGGTTGTAG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 



GTGAGCACAT CCTGCTGGGT' CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTATGACTTC CCTGGTGATG 200 

15 ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCC GGTCACCTGG ATTCCTACAT 300 

CCCAGAACCA GAGCGTGCTA TCGATCAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG. TAGTTACCGG TCGTGTTGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

20 CACCGTTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGACA TCGAACGTGG TCAGGTACTG GCTAAACCAG GTTCCATCAA 600 

GCCGCACACT CAGTTCGATT CAGAAGTGTA TATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

25 CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGATAACG TGAACATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCTATGGA CCAAGGC 817 



30 



40 



2) INFORMATION FOR SEQ ID NO: 167 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 787 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 

GCTGCGACTG ACGGCCCAAT GCCTCAGACC CGTGAGCACA TCCTGCTGGG 50 

TCGTCAGGTT GGCGTTCCTT TCATCATCGT ATTCATGAAC AAATGCGACA 100 

TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAAATGGA AGTTCGTGAA 150 

50 CTTCTGTCTG CTTACGACTT CCCTGGTGAT GACCTGCCGG TTGTTCGTGG 200 

TTCAGCGCTG AAAGCACTGG AAGGCGAAGC TGAGTGGGAA GCTAAAATCA 250 

TCGAGCTGGC CGGTTACCTG GATTCTTACA TCCCAGAACC AGAGCGTGCT 3 00 

ATCGACAAGC CGTTCCTGCT GCCAATCGAA GACGTCTTCT CCATCTCCGG 350 

TCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGTATC GTTAAAGTTG 4 00 

55 GCGAAGAAGT TGAAATCGTT GGTATCAAAG ACACCGTTAA GTCTACCTGT 450 

ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC GTGCTGGTGA 500 

GAACGTTGGT GTTCTGCTGC GTGGTATCAA GCGTGAAGAC ATCGAACGTG 550 

GTCAGGTACT GGCTAAACCA GGTTCAATCA AGCCACACAC CAAGTTCGAC 600 

TCAGAAGTGT ACATCCTGAG CAAAGAAGAA GGTGGTCGTC ATACTCCATT 650 

60 CTTCAAAGGC TACCGTCCAC AGTTCTACTT CCGTACAACT GACGTGACCG 700 

101 
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GTACCATCGA ACTGCCAGAA GGCGTTGAAA TGGTAATGCC 
GTGAACATGG TTGTTACCCT GATCCACCCA ATCGCGA 



AGGTGACAAC 



750 
787 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 168 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 745 bases 
(B) _ JTYPE: JJurleic_acid 

(C) STRANDEDNESS: Double" ~ ~~ " ~ 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA , 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13 8 80 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 200 

AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 250 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 300 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTACCG 350 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 400 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 450 

CAAACTGCTG GACGAAGGCC GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 500 

GTGGTATCAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTTCTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 700 

GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 745 



2) INFORMATION FOR SEQ ID NO: 169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odor if era 

(B) STRAIN: ATCC 33077 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9 

GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 2 00 

CCTGCCAGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 
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AGTGGGAAGC TAAGATTGTA GAACTGGCTG AAGCGCTGGA TTCTTACATC 300 

CCAGAACCAG AGCGTGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTTGAGC 400 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

5 ACCGTTAAGT CTACCTGTAC CGGTGTAGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCTATCAAG 600 

CCGCACACCA AATTCGACTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACGCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

1 0__ GTACTACTGA CGTGACCGGT ACCATC GAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGATAACGT GAACATGGTT GTTACCCTGA TTCXCCCAAT 800" 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 829 



15 



2) INFORMATION FOR SEQ ID NO: 170 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 



CGGCGCAATC 
GTGAGCACAT 
TTCATGAACA 

35 AGAAATGGAA 
ACCTGCCGGT 
GAGTGGGAAG 
CCCAGAACCA 
ACGTATTCTC 

40 CGCGGTATCG 
CACCGTTAAG 
ACGAAGGCCG 
CGCGAAGATA 
GCCACACACC 

45 GTGGTCGTCA 
CGTACAACTG 
GGTAATGCCA 
TCGCGATGGA 



CTGGTTGTTG 
CCTGCTGGGT 
AATGCGACAT 
GTTCGTGAAC 
TGTTCGTGGT 
CTAAAATCAT 
GAGCGTGCTA 
CATCTCCGGT 
TTAAAGTTGG 
TCTACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
AAGTTTGACT 
TACTCCATTC 
ACGTGACCGG 
GGTGACAACG 
CGACGGCCTG 



CTGCGACTGA 
CGTCAGGTTG 
GGTTGATGAT 
TTCTGTCTGC 
TCAGCGCTGA 
CGAGCTGGCT 
TCGACAAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CCGGCGTTGA 
AACGTGGGTG 
TCAGGTCCTG 
CAGAAGTGTA 
TTCAAAGGCT 
TACCATCGAA 
TGAACATGGT 
CGTTTCGCAA 



CGGCCCAATG 
GCGTTCCTTT 
GAAGAGCTGC 
TTACGACTTC 
AAGCACTGGA 
GGTTTCCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTGG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAACCAG 
CATCCTGAGC 
ACCGTCCACA 
CTGCCAGAAG 
TGTAACCCTG 



CCTCAGACCC 
CATCATCGTA 
TGGAACTGGT 
CCTGGTGATG 
AGGCGAACCA 
ATTCTTACAT 
CCAATCGAAG 
TCGTGTTGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAG 
GTTCAATCAA 
AAAGAAGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
830 



50 



2) INFORMATION FOR SEQ ID NO: 171 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTACCTTAC ATCATCGTAT 100 

10 TCATGAACAA ATGCGACATG GTAGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGCGAACT GCTGTCTGCT TACGACTTCC CAGGCGACGA 200 

CCTGCCGGTA^ATCCGTGGTT "CCGCGCTGAAT AGCGCTGGAA - GGCGAAGCCG 250- 

AGTGGGAAGC GAAAATCGTT GAGCTGGCAG AAGCGCTGGA CAGCTACATC 300 

CCAGAGCCAG AGCGTGCTGT AGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCC ATCTCCGGTC GTGGTACCGT' TGTTACCGGT CGTGTAGAGC 400 
GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTAGG TATCAAAGAC * 450 

ACCGTTAAGT CTACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CGAAGCCAGG TTCAATCAAG 600 

20 CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACGT GAACATGAAA GTTACTCTGA TTCACCCAAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 829 

25 



2) INFORMATION FOR SEQ ID NO: 172 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
40 (B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

4 5 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG , ATTCTTACAT 300 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

50 TCGCGATGGA CGACGGTCTG CGTTTC 82 6 
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2) INFORMATION FOR SEQ ID NO: 173 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 



20 TGGTAGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 

25 GAAAATCCTG 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTTGGT 
CTACCTGTAC 

30 GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGT 

35 GCGACAACAT 
GACGGTCTGC 



TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTCGCA 



GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CGAAGCCRGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



CGCAGACTCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGACGCAG 
TTCYTAYATT 
CGATCGAAGA 
CGTGTAGAAC 
TATCAAAGAG 
AACTGCTGGA 
GGTATCAAAC 
CACCATCAAG 
AAGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCACCCGAT 



TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACTCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACYCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCGATGGAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
818 



40 2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 



CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 200 

60 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 
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GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 



GGGCGACAAC 
ACGACG 



TGGAACTGGC TGGCTTCCTG GATTCTTACA TTCCGGAACC 
ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 
TCGTGGTACC GTTGTTACCG GTCGTGTAGA ACGCGGTATC 
GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 
ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 
GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 
GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 
TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGMCGTC 
CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 
GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 
AT CAAAATGG ~TTGTTACCCT~GATCCACCCG "ATCGCGATGG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



20 



25 



30 



35 



40 



45 



50 
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2 ) INFORMATION FOR SEQ ID NO: 175 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 2 9930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 



TCTGCTGCTG ACGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC 50 

ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC AAAGTTGACA 100 

10 TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC 150 

T TATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG TAATCGCTGG 200 

TTCAGCATTA AAAGCTTTAG~AAGGCGATGC "TCAATACGAA-GAAAAAATGT 2 50 

TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC AGAACGTGAT 300 

TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG 350 

15 TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG 400 

GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA AACAACTGTT 450 

ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG AAGCTGGTGA 500 

CAACATTGGT GCATTATTAO GTGGTGTTGC TCGTGAAGAC GTACAACGTG 550 

GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC TGAATTCAAA 600 

2 0 GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC ACACTCCATT 650 

CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT GACGTAACTG 700 

GTGTTGTTCA CTTACC 716 



25 



35 



2) INFORMATION FOR SEQ ID NO: 177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 719 bases 
3 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 50 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

45 TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 200 

GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 250 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 400 

50 AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC 550 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 600 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

55 GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMCTGAAA TGGTAATGCC 700 

TGGTGATAAC GTTGAAATG 719 



60 2) INFORMATION FOR SEQ ID NO: 178 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 



15 



30 



45 



(B) STRAIN: ATCC~~3~359T~ 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 



GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

20 CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

25 ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 625 



2) INFORMATION FOR SEQ ID NO: 179 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

50 GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 150 

TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 200 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 250 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 300 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 350 

55 AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 4 00 

TTGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG TGACAACATT 450 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGTCAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA TACTGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

60 AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT 
TGAC 



AACGTTGAAA 



700 
704 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 180 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 730 bases 
(B) TYPE: Nucleic acid 

( C ) S TRANDEDNES S f~ DbuEle - 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 6538 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACG.TCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

CGTACTACTG ACGTAACTGG TGTTGTTCAC TTACCAGAAG GTACTGAAAT 700 

GGTAATGCCT GGTGATAACG TTGAAATGAC 730 



2) INFORMATION FOR SEQ ID NO: 181 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN : ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACCA AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATACGACTAC CCAGGTGACG 2 00 

ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGCGACAAA 250 
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GAATACGAAC AAAAAATCTT AGACTTAATG CAACAAGTTG ACGATTAGAT 300 

TCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGAAAGA 450 

5 CGGTTCACAA AAAACAACAG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTATT 550 

TCACGTGAAG AAGTACAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACTAAATTCA CTGCAGAAGT TTACGTATTA TCTAAAGATG 650 

AAGGTGGACG TCACACTCCA TTCTTCTCTA ACTACCGTCC ACAATTCTAT 700 

-1-0 T-TCCGTACTA^CTGACGTAAC AGG TGTTGTT ACTTTACCAG AAGGTACAGA 750 

AATGGTAATG CCTGGCGATA ACGTTAAAAT GGAAGTTGAA~TTAATTTCTC 800" 

CAATCGCTAT CGAAGACGGT ACTCGTTTCT CAAT 834 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 182 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 835 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

35 TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 200 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

40 CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

45 GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 83 5 



2) INFORMATION FOR SEQ ID NO: 183 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

110 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Macrococcus caseolyticus 

(B) STRAIN: ATCC 13548 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 

GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATGCCACA AACTCGTGAA 50 

CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG TAGTATTCTT 100 

10 GAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGACTTATTA TCTGAATATG ACTTCCCTGG TGACGATGTA 200 

CCTGTAATCG CTGGATCTGC "TTTAAAAGCA^TTAGAAGGCG^TTGAAGAATA- — ~ — 250 

CGAAGACAAA ATCATGGAAT TAATGGACGC AGTTGATGAG TACATCCCAA 300 

CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

15 TTCTCAATCA CTGGTCGTGG TACAGTTGCA ACTGGACGTG TTGAGCGTGG 400 

ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGAAGAAC 450 

CAGCAAAAAC TACAGTTACA GGTGTAGAAA TGTTCCGTAA ATTATTAGAT 500 

TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTACGTG GTGTTTCTCG 550 

TGAAGACGTA CAACGTGGAC AAGTATTAGC TAAACCAGGT TCAATTACTC 600 

2 0 CACATACTAA ATTCAAAGCT GAAGTTTACG TATTATCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CACTAACTAC CGCCCTCAGT TCTACTTCCG 700 

TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TAATGCCTGG AGATAACATC GAAATGAACG TTGAATTAAT TTCTCCAATC 800 

GCGA 804 

25 



2) INFORMATION FOR SEQ ID NO: 184 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
40 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

45 GTGAACATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTTCAT 300 

50 TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 450 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT ACTTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

50 CAATCGCTAT CGAAGACGGT ACACGTTTCT CT 832 
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2) INFORMATION FOR SEQ ID NO: 185 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 



20 ATCTGCTGCT GACGGTCCAA 
CACGTAACGT TGGTGTACCA 
ATGGTAGACG ACGAAGAATT 
CTTATTAAGC GAATATGACT 
GTTCTGCATT AAAAGCATTA 

25 TTAGACTTAA TGCAAGCAGT 
TTCTGACAAA CCATTCATGA 
GTCGTGGTAC TGTTGCTACA 
GGTGAAGAAG TTGAAATCAT 
TACTGGTGTA GAAATGTTCC 

3 0 ACAACATCGG TGCTTTATTA 
GGTCAAGTAT TAGCTGCTCC 
AGCTGAAGTA TACGTATTAT 
TCTTCACTAA CTATCGCCCA 



TGCCACAAAC TCGTGAACAC 
GCATTAGTTG TATTCTTAAA 
ATTAGAATTA GTTGAAATGG 
TCCCAGGTGA CGATGTACCT 
GAAGGCGATG CTGAATACGA 
TGATGATTAC ATTCCAACTC 
TGCCAGTTGA GGACGTATTC 
GGCCGTGTTG AACGTGGTCA 
CGGTATGCAC GAAACTTCTA 
GTAAATTATT AGACTACGCT 
CGTGGTGTTG CACGTGAAGA 
TGGTTCTATT ACACCACACA 
CTAAAGATGA AGGTGGACGT 
CAATTCTATT TCCGTACTAC 



ATCTTATTAT 50 

CAAAGTTGAC 100 

AAGTTCGTGA 150 

GTAATCGCTG 200 

ACAAAAAATC 250 

CAGAACGTGA 300 

TCAATCACTG 350 

AATCAAAGTT 400 

AAACAACTGT 450 

GAAGCTGGTG 500 

CGTACAACGT 550 

CAAAATTCAA 600 

CACACTCCAT 650 

TGACGTAAC 699 



35 



2) INFORMATION FOR SEQ ID NO: 186 



( i ) SEQUENCE CHARACTERISTICS : 
4 0 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



CGGCGGTATC TTAGTAGTAT 
GTGAACACAT TCTTTTATCA 
TTCTTAAATA AAGTTGACAT 
TGAAATGGAA GTACGTGACT 
ATGTACCTGT AATCGCTGGT 
CAATACGAAG AAAAAATCTT 
TCCAACTCCA GAACGTGATT 
ACGTATTCTC AATCACTGGT 



CTGCTGCTGA CGGTCCAATG 
CGTAACGTTG GTGTACCAGC 
GGTTGACGAT GAAGAATTAT 
TATTATCTGA ATACGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG CAAGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
112 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 200 

AGGCGATGCT 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 400 
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CGTGGGCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATTG G¥A , rcCflTeR , 4*) 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CATTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCAG GTTCAATCAC 600 

5 ACCTCACACA AAATTTAAAG CAGACGTATA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCACAAACT ATCGTCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 829 



2) INFORMATION FOR SEQ ID NO: 187 

15 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 
25 (B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 50 

30 TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 200 

GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC AAGCAGTTGA 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 30 0 

35 CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 40 0 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT ACAACGTGGT CAAGTATTAG CTGCTCCTGG 550 

40 TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTACTACTGA CGTAACTGGC GTTGTTCAAT TACCAGAAGG 700 

TACTG 705 



2) INFORMATION FOR SEQ ID NO: 188 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 23 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

TTTTATCACG TAACGTTGGT GTACCAGCAT TAGTAGTATT CTTAAACAAA 50 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTTG AAATGGAAGT 100 

5 ACGTGACTTA TTATCTGAAT ACGACTTCCC AGGTGACGAC GTACCTGTAA 150 

TCGCTGGTTC AGCTTTAAAA GCTTTAGAAG GCGATGCTCA ATACGAAGAA 200 

AAAATCTTAG AATTAATGCA AGCAGTTGAT GATTACATTC CAACTCCAGA 250 

ACGTGACTCT GATAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 300 

TCACTGGTCG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGTCAAATC 350 

-1-0 AAAGT-TGGTG- AAGAAGTXGA^^TJ'ATTGGT^ ATCAAAGAAA CTTCTAAAAC 4 00 

AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG 450" 

CTGGTGACAA CATCGGTGCT TTATTACGTG GTGTTGCTCG TGAAGATGTA 500 

CAACGTGGTC AAGTATTAGC TGCTCCAGGT TCAATTACAC CTCACACAAA 550 

ATTCAAAGCA GACGTATACG TTTTATCAAA- AGATGAAGGT GGACGTCATA 600 

15 CTCCATTCTT CACTAACTAT CGTCCACAAT TCTATTTCCG TACTACTGAC 650 

GTAACTGGTG TTGTTAACTT ACCAGAAG 678 



20 2) INFORMATION FOR SEQ ID NO: 189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 33 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

10 CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 4 00 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

15 ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 

50 



2) INFORMATION FOR SEQ ID NO: 190 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
5 (B) STRAIN: CSG 8 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TTGAAATGGA 50 

10 AGTACGTGAC TTATTATCTG AATACGACTT CCCAGGTGAC GATGTACCTG 100 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 150 

GAAAAAATCT~TAGAATTAAT"GCAAGCAGTT~ GATGATTACA— TTCCAACTCG 200- 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 250 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 300 

15 ATCAAAGTTG GTGAAGAAGT TGAAATCATT GGTATCCATG ACACTTCTAA 350 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 400 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 450 

GTACAACGTG GTCAAGTATT AGCTGCTCCA GGTTCAATCA CACCTCACAC 500 

AAAATTTAAA GCAGACGTAT ACGTTTTATC TAAAGACGAA GGTGGACGTC 550 

20 ACACTCCATT CTTCACAAAC TATCGTCCAC AATTCTATTT CCG 593 



2 ) INFORMATION FOR SEQ ID NO: 191 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

40 CGGCGCTATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCTTTAA AAGCTTTAGA AGGCGATGCT 250 

4 5 CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTATAT 300 

TCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATTATTG GTATCAAAGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

50 ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGATG TACAACGTGG TCAAGTATTA GCTGCTCCAG GTTCAATTAC 600 

ACCTCACACA AAATTCAAAG CAGACGTATA CGTTTTATCA AAAGATGAAG 650 

GTGGACGTCA TACTCCATTC TTCTCTAACT ATCGTCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

55 GGTAATGCCT GGTGACAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTC 828 



60 2) INFORMATION FOR SEQ ID NO: 192 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) QRGANI SM : Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGTACCAG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 150 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 200 

20 AAAGCTTTAG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 250 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 300 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 400 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 450 

25 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 62 0 



2) INFORMATION FOR SEQ ID NO: 193 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 150 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 200 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAACCATTCA 250 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 350 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
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AGAAGGTACT GAAATGGTAA TGCCTGGTGA CAACGTTGAA AT [ 



5 2) INFORMATION FOR SEQ ID NO: 194 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

__(D.) TO.P_QLOQ3Q Li near 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

20 

CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 50 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG ACTTATTATC TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT GCTCAATACG 200 

25 AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 250 

CCAGAACGTG ACTCTGATAA ACCATTCATG ATGCCAGTTG AGGACGTATT 300 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 350 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 4 00 

AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 450 

30 TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC 550 

ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 650 

CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 684 

35 



2) INFORMATION FOR SEQ ID NO: 195 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 

ACCAGCATTA GTAGTATTCT TAAACAAAGT TGACATGGTT GACGATGAAG 50 

55 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGACGT ACCTGTAATC GCTGGTTCAG CTTTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA TTACATTCCA ACTCCAGAAC GTGACTCTGA TAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

60 TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 
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TTATTGGTAT CAAAGAAACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCTTT 450 

ATTACGTGGT GTTGCTCGTG AAGATGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATTACACCT CACACAAAAT TCAAAGCAGA CGTATACGTT 550 

5 TTATCAAAAG ATGAAGGTGG ACGTCATACT CCATTCTTCA CTAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTA ATGCCTGGCG ACAAC 685 



LO _ 

2) INFORMATION FOR SEQ ID "NO : T96 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 611 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 62 

IS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT GACGACGTAC CTGTAATCGC 50 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

30 GACTCTGATA AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC 200 

TGGTCGTGGT ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 400 

35 GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGCAGACG TATATGTTTT ATCAAAAGAT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA 550 

CTGGTGTTGT TAACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGCGAC 600 

AACGTTGAAA T 611 

10 



2) INFORMATION FOR SEQ ID NO: 197 

15 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
55 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
50 GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 

118 
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TTCTTAAACA AAGTTGACAT 
AGAAATGGAA GTTCGTGATT 
ATGTGCCTGT AATCGCTGGT 
AAATACGAAG CTAAAATCTT 
TCCAACTCCA GAACGTGACT 
ACGTATTCTC AATCACTGGT 
CGTGGTCAAA TCAAAGTCGG 
TACTACTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 
CGT GAAGATG TACAACGTGG 
ACCTCACACT AAATTTAAAG 
GTGGACGTCA TACACCATTC 
CGTACTACAG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 
TCGCTATCGA AGACGGAACT 



GGTTGACGAT GAAGAATTAT 
TATTAACTGA ATATGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG GATGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
TGAAGAAGTT GAAATTATTG 
CTGGTGTAGA AATGTTCCGT 
AACATCGGTG CGTTATTACG 
ACAAGTATTA GCTGCTCCAG 
CTGACGTATATTGTTTTATCT 
TTCTCAAACT ACCGCCCACA 
TGTTGTTAAC TTACCAGAAG 
TTGAAATGAC AGTTGAATTA 
CGTTTCTC 



TAGAATTAGT 150 

CCAGGTGACG 2 00 

AGGCGACGAA 250 

ATAACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 

GTATCCACGA 4 50 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATTAC 600 

AAAGATGAAG 650 

ATTCTATTTC 700 

GTACAGAAAT 750 

ATCGCTCCAA 800 
828 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 



35 



40 



45 



AGTAGTATCT 
TTTTATCACG 
GTTGACATGG 
TCGTGACTTA 
TCTCTGGTTC 
AAAATCTTAG 
ACGTGATTCT 
TCACTGGTCG 
AAAGTCGGTG 
AACAACTGTT 
AAGCTGGTGA 
GTACAACGTG 
AAAATTCAAA 
ATACACCATT 



GCTGCTGATG 
TAACGTTGGT 
TTGACGATGA 
TTAAGCGAAT 
TGCATTAAAA 
ACTTAATGCA 
GACAAACCAT 
TGGTACTGTT 
AAGAAATCGA 
ACTGGTGTAG 
CAACATTGGT 
GTCAAGTTTT 
GCGGATGTTT 
CTTCACTAAC 



GCCCAATGCC 
GTTCCAGCAT 
AGAATTATTA 
ATGACTTCCC 
GCTTTAGAAG 
AGCTGTTGAT 
TCATGATGCC 
GCTACAGGCC 
AATCATCGGT 
AAATGTTCCG 
GCATTATTAC 
AGCTGCTCCT 
ACGTTTTATC 
TACCGCCCAC 



ACAAACTCGT 
TAGTTGTATT 
GAATTAGTAG 
AGGTGACGAT 
GCGACGCTGA 
GACTTCATTC 
AGTTGAGGAC 
GTGTTGAACG 
ATGCAAGAAG 
TAAATTATTA 
GTGGTGTTTC 
GGTACTATTA 
TAAAGATGAA 
AATTCTATTT 



GAACACATTC 
CTTAAACAAA 
AAATGGAAGT 
GTACCTGTAA 
CTATGAGCAA 
CAACACCAGA 
GTATTCTCAA 
TGGTCAAATC 
AATCAAGCAA 
GACTACGCTG 
ACGTGATGAC 
CACCACATAC 
GGTGGTCGTC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
690 
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2) INFORMATION FOR SEQ ID NO: 199 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 



GCATTAGTTG TATTCTTAAA CAAAGTTGAC ATGGTTGACG ATGAAGAATT 50 

ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTTATTAAGC GAATATGACT 100 

10 TCCCAGGTGA CGATGTACCT GTAATCTCTG GTTCTGCATT AAAAGCTTTA 150 

GAAGGCGACG^CTGACT^TGA_GCAAAAAATC ^TAGACTTAA_ TGCAAGCTGT 200 

TGATGACTTC ATTCCAACAC CAGAACGTGA TTCTGACAAA~"CCATTCATGAT 250" 

TGCCAGTTGA GGACGTATTC TCAATCACTG GTCGTGGTAC TGTTGCTACA 300 

GGCCGTGTTG AACGTGGTCA AATCAAAGTC GGTGAAGAAA TCGAAATCAT 350 

15 CGGTATGCAA GAAGAATCAA GCAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT TGGTGCATTA 450 

TTACGTGGTG TTTCACGTGA TGACGTACAA CGTGGTCAAG TTTTAGCTGC 500 

TCCTGGTACT ATTACACCAC ATACAAAATT CAAAGCGGAT GTTTACGTTT 550 

TATCTAAAGA TGAAGGTGGT CGTCATACAC CATTCTTCAC TAACTACCGC 600 

20 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

AGAAGGTACT GAAATGGTTA TGCCTGGCGA TAACGTTGAA ATGGATGTTG 700 

AATTAATTTC TCCAATCGCT ATT 723 



25 



35 



2) INFORMATION FOR SEQ ID NO: 200 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 697 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT GACATGGTTG 50 

ACGATGAAGA ATTATTAGAA TTAGTAGAAA TGGAAGTTCG TGACTTATTA 100 

AGCGAATATG ACTTCCCAGG TGACGATGTA CCTGTAATCT CTGGTTCTGC 150 

45 ATTAAAAGCT TTAGAAGGCG ACGCTGACTA TGAGCAAAAA ATCTTAGACT 200 

TAATGCAAGC TGTTGATGAC TTCATTCCAA CACCAGAACG TGATTCTGAC 250 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG 300 

TACTGTTGCT ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTCGGTGAAG 350 

AAATCGAAAT CATCGGTATG CAAGAAGAAT CAAGCAAAAC AACTGTTACT 400 

50 GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG CTGGTGACAA 450 

CATTGGTGCA TTATTACGTG GTGTTTCACG TGATGACGTA CAACGTGGTC 500 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CACATACAAA ATTCAAAGCG 550 

GATGTTTACG TTTTATCTAA AGATGAAGGT GGTCGTCATA CACCATTCTT 600 

CACTAACTAC CGCCCACAAT TCTATTTCCG TACTACTGAC GTAACTGGTG 650 

55 TTGTTAACTT ACCAGAAGGT ACTGAAATGG TTATGCCTGG CGATAAC 697 



2) INFORMATION FOR SEQ ID NO: 201 

60 

120 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus sciuri subsp. sciuri 
(B) STRAIN: J^TCC_29060 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 



15 CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 200 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 250 

20 GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 300 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAACTGA 450 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

25 TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 600 

CACACCTCAC ACTAAATTCA AAGCTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AACTTACCAG AAGGTACTGA 750 

3 0 AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 800 

CAATCGCTAT TGAAGACGGT ATCGTTTCTC AATCA 835 



35 2) INFORMATION FOR SEQ ID NO: 202 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02 



CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

55 ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

60 CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AGACGGTACT 



AACATCGGTG 
TCAAGTATTA 
CGGAAGTTTA 
TTCAGTAACT 
CGTTGTTCAA 
TTGAAATGAC 
CGTTTCTCAA 



CTTTATTACG TGGTGTTGCT" 
GCTGCTCCTG GTTCAATTAC 
CGTTTTATCT AAAGACGAAG 
ACCGCCCACA ATTCTATTTC 
TTACCAGAAG GTACTGAAAT 
AGTAGAATTA ATCGCTCCTA 
C 



5%a 



600 
650 
700 
750 
800 
831 
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20 
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30 
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45 



50 



55 



2 ^INFORMATION FOR SEQ ID N O: 203 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 600 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 829 



2) INFORMATION FOR SEQ ID NO: 204 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 15707 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 



PCT/CAOO/01150 



TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCCCAGACTC 50 

GCGAGCACGT GCTGCTCGCC CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 100 

5 GCCCTGAACA AGTGCGACAT GGTCGACGAT GAAGAGCTCA TCGAGCTCGT 150 

CGAAGAAGAG GTCCGCGACC TCCTCGACGA GAACGGCTTC GACCGTGACT 200 

GCCCGGTCAT CCACACCTCC GCTTACGGTG CTCTGCACGA CGACGCTCCG 250 

GACCACGAGA AGTGGGTCCA GTCCGTTAAG GACCTCATGG ACGCTGTCGA 300 

CGACTACATC CCGACCCCGG TTCACGACCT GGACAAGCCG TTCCTGATGC 350 

L0 CGATCGAGGA CGTCTTCACC ATCTCCGGCC GTGGTACCGT TGTCACCGGT 400 

CGT GTCGAG C GTGGCCAGCT GGCCGTCAAC ACCCCGGTCG AGATCGTTGG 450 

TATCCGTCCG ACCCAGCAGA"CCACCGTCAC~ CTCCATCGAG^ACCTTCCACA 500~ 

AGACCATGGA CGCCTGCGAG GCTGGCGACA ACACCGGTCT GCTTCTGCGT 550 

GGTCTCGGCC GTGACGATGT CGAGCGTGGC CAGGTTGTGG CCAAGCCGGG 600 

15 CTCCGTCACC CCGCACACCA AGTTCGAGGG- CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCATCGAGC TGCCGGAAGG 750 

CGTCGAGATG GTTCAGCCGG GCGACCACGC TACCTTCACC GTTGAGCTGA 800 

TTCAGCCCAT CGCTATGGAG GAAGGCCTGA CCTTCGCTG 839 

20 



2) INFORMATION FOR SEQ ID NO: 205 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
35 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

10 GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCGACCAG 2 50 

AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGTCGACG CGCTGGACAG 300 

15 CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 350 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 4 00 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

50 ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 750 

CGAA 754 

55 

2) INFORMATION FOR SEQ ID NO: 206 
50 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidomini/nus 
LO (B) STRAIN: ATCC 51726 



(xi) SEQUENCE DESCRIPTION: SEQ ID'"NO~20"6" 



TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

L5 GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ATCTTCCAGT TGTTCAAGGT* TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 250 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 300 

20 TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 350 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTT.GCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

25 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

30 CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 835 



2) INFORMATION FOR SEQ ID NO: 207 



35 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
J0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
a5 (A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

50 CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

55 CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

50 GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

124 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC 
CACACACTAA ATTTAAAGGT GAAGTTTACA 
GGACGTCATA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTAACAGGTT CAATCGAACT 
5 TTATGCCTGG TGATAACGTT ACTATCGAAG 
GCCGTAGAAC AAGGTACTA 



TAAACCAGGT ^MfcrfftS* StfO 
TCCTTTCTAA AGAAGAAGGT 650 
CGTCCACAAT TCTACTTCCG 700 
TCCAGCAGGA ACAGAAATGG 750 
TTGAATTGAT TCACCCAATC 800 

819 



L0 2) INFORMATION FOR SEQ ID NO: 208 



L5 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



20 



25 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 

CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT CGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 750 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



45 



55 



2) INFORMATION FOR SEQ ID NO: 209 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
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PCT/CAOO/01150 



AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA ,250 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 300 

AGAACCAGAA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

LO GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 450 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT _ TGCAGGGGAC _ AACGTTGGTG "TTCTTCTTCG" "TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 650 

L5 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA ATTCTACTTC 700 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 750 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT ■ AC 822 

20 



2) INFORMATION FOR SEQ ID NO: 210 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDCSS-1073 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



CGGAGCTATC 
GTGAGCACAT 

10 TTCATGAACA 
TGAAATGGAA 
ACCTTCCAGT 
AAATACGAAG 
TCCAGAACCA 

45 ATGTATTCTC 
CGTGGTACTG 
AGATATCCAA 
TTGACGAAGG 
CAACGTGATG 

50 CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 

55 



CTTGTAGTTG 
CCTTCTTTCA 
AAGTTGACCT 
ATTCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGATA 
AATCACTGGA 
TTCGTGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTTA 
TCATACTCCA 
CTGACGTAAC 
CCTGGTGATA 
AGAACAAGGT 



CTTCAACTGA 
CGTCAAGTTG 
TGTTGATGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAGTT 
TTACTGGTGT 
GACAACGTTG 
TGGTCAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTTACTAT 
ACTAC 



TGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCACTTGA 
AGCACTGTTG 
TTTACTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
CTTGCTAAAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGAAGTTGAA 



CCACAAACTC 50 

CCTTATCGTA 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGCGACGAA 250 

ATGAGTACAT 3 00 

CCAGTTGAAG 350 

ACGTATCGAC 400 

GTATTAAAGA 450 

CGTAAACAAC 500 

TCGTGGTGTT 550 

CAGGTTCAAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

CAGGAACAGA 750 

TTGATTCACC 800 
825 



2) INFORMATION FOR SEQ ID NO: 211 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 
10 (B) STRAIN: ATCC 33397 



~(xlTSE'QUENCE — DESCRIPTION': SEQ - ID - NO: - 2'1~1~ 



GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 50 

15 TGAACACATC CTTCTTTCAC GCCAAGTAGG TGTTAAATAC CTTATCGTCT 100 

TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 150 

GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 200 

AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 250 

AATATGAAGA CATCATCATG GAATTGATGG ATACTGTTGA TGAATACATT 300 

20 CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA 350 

TGTATTCTCA ATTACTGGAC GTGGTACTGT TGCTTCAGGA CGTATCGACC 400 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 450 

GAAATCCAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT 500 

GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 550 

25 AACGTGACGA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 600 

CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 650 

AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 700 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 800 

30 AATTGCCGTA GAACAAGGAA CTACAT 826 



2) INFORMATION FOR SEQ ID NO: 212 



35 



( i ) SEQUENCE CHARACTERI ST I C S : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 5 (A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

50 TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC 50 

GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

AAATCCCTGT AATCCAAGGT TCAGCTCTTA AAGCCCTTGA AGGTGACACT 250 

55 CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 300 

TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

CGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

50 TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGTATC 550 
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CAACGTGATG AAATCGAACG TGGTCAAGTT 
CCACCCACAC ACTAAATTCA AAGGTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCAACA 
TTCCGTACAA CTGACGTTAC AGGTTCAATC 
5 AATGGTAATG CCTGGTGATA ACGTTACTAT 
CAATCGCCGT TGAACAAGGT ACTACAT 



CTTGCTAAAC CAGGTTCAAT 600 
TTACATCCTT ACTAAAGAAG 6 50 

ACTACCGTCC TCAATTCTAC 700 
GAACTTCCAG CAGGTACTGA 750 
CGACGTTGAA TTGATTCACC 800 

827 



10 2 ) INFORMATION FOR SEQ ID NO: 213 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 27823 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 50 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

30 ATGGAAATCC GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT 200 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 250 

ATGAAGACAT CATCATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 300 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 350 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 4 00 

35 GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 4 50 

ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 550 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAACCAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCCTTACTA AAGAAGAAGG 650 

40 TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 700 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 750 

GTAATGCCTG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 800 

TGCCGTAGAA CAAGGAACTA C 821 



45 



2) INFORMATION FOR SEQ ID NO: 214 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTGA 50 

ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCGTCTTCA 100 

5 TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 200 

CCCTGTTGTT CAAGGTTCAG CTCTTAAAGC CCTTGAAGGT GATACAGCTG 250 

CCGAAGACAA GATCATGGAA TTGATGGACA TCGTTGATGA CTACATTCCA 300 

GAACCAAAAC GTGATACTGA TAAGCCATTG CTTCTTCCAG TCGAAGACGT 350 

10 ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 400 

GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 450 

M(TCAAS£AG~< CGGTTGTTAC" CGGAGTTGAA" atgttccgta-aacaattgga —500 

TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GCTTCTTCGT GGTATCCAAC 550 

GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCAT 600 

15 CCACACACTA AATTCAAGGG TGAAGTTTAO ATCCTTTCTA AAGATGAAGG 650 

TGGACGTCAC ACTCCATTCT TCAACAACTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT TCAATCGAAT TGCCAGCAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TCCACCCAAT 800 

CGCTGTTGAA AAAGGTACTA C 821 

20 



2) INFORMATION FOR SEQ ID NO: 215 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cristatus 
35 (B) STRAIN: ATCC 51100 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

4 0 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 250 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 300 

45 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 350 

TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 450 

CCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 700 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 800 

55 CCGTTGAACA AGGTACTCCT T 821 



60 



2) INFORMATION FOR SEQ ID NO: 216 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

15 AGTAGCTTCT ACTGATGGAC CAATGCCACA AACTCGTGAA CACATCTTGC 50 

TTTCACGTCA GGTTGGTGTT AAGAACCTTA TCGTCTTCAT GAACAAGGTT 100 

GACTTGGTTG ACGATGAAGA ATTGCTTGAA TTGGTTGAAA TGGAAATCCG 150 

TGACCTGCTT TCAGAATACG ATTTCCCAGG TGATGATATC CCTGTTGTTC 200 

AAGGTTCAGC TCTTAAGGCT CTTGAAGGTG ATACAGCTGC CGAAGACAAG 250 

2 0 ATCATGGAAT TGATGGACAT CGTTGATGAC TACATTCCAG AACCAAAACG 300 

TGATACTGAT AAGCCTTTGC TTCTTCCAGT CGAAGATGTA TTCTCAATCA 350 

CTGGACGTGG TACTGTAGCT TCAGGACGTA TCGACCGTGG TACTGTTAAG 400 

GTCAACGACG AAGTTGAAAT CGTTGGTATC AAGGACGAAA TCCAAAAAGC 450 

AGTTGTTACC GGAGTTGAAA TGTTCCGTAA ACAATTGGAC GAAGGTCTTG 500 

25 CAGGGGATAA CGTTGGTGTG CTTCTTCGTG GTATCCAACG TGATGAAATC 550 

GAACGTGGTC AAGTGTTGGC TGCGCCTGGT TCGATTCACC CACACACTAA 600 

GTTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT GGACGTCATA 650 

CTCCATTCTT TAACAACTAC CGTCCACAGT TCTACTTCCG TACAACTGAC 700 

GTAACTGGTT CAATCGAATT GCCAGCGGGT ACTGAAATGG TTATGCCTGG 750 

3 0 TGATAACGTT ACTATCGACG TTGAATTGAT CCACCCAATT GC 792 



2) INFORMATION FOR SEQ ID NO: 217 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
JO (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43 078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG ACCAATGCCA CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 100 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 150 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 2 00 

CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 250 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 3 00 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 4 00 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 450 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 500 

50 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 550 
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TCGAACGTGG TCAAGTTATT GCTAAACCAG 

AAATTCAAAG GTGAAGTATA TATCCTTTCT 

CACTCCATTC TTCAACAACT ATCGTCCACA 

ACGTAACAGG TTCAATCGAA CTTCCAGCTG 

5 GGTGATAACG TGACAATCAA CGTTGAGTTG 



PCT/CA00/01150 

GTTCAATCAA CCCACACACT 600 

AAAGACGAAG GTGGACGTCA 650 

ATTCTACTTC CGTACAACTG 700 

GTACAGAAAT GGTTATGCCT 750 

ATCCACCCAA TCGCC 795 



2) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARA CTER ISTIC S: 

(A) LENGTH: 828 bases ' — 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Streptococcus egui subsp. egui 

(B) STRAIN: ATCC 9528 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

25 CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 100 

TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 250 

3 0 AAATACGAAG ATATCATCAT GGAATTGATG GATACTGTTG ATTCATACAT 300 

TCCAGAACCA GAACGTGACA CAGACAAGCC ATTGCTTCTT CCAGTCGAGG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 4 00 

CGCGGTACTG TTCGTGTTAA CGACGAAATC GAAATCGTTG GTATCAGAGA 450 

CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 500 

35 TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGCTAAGC CAGGTTCTAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT ATATATCCTT ACTAAAGAAG 650 

AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 750 

4 0 AATGGTTATG CCTGGTGATA ACGTGACTAT TGACGTTGAG TTGATCCACC 800 

CAATCGCCGT AGAACAAGGT ACTACATT 828 



10 



15 



45 2) INFORMATION FOR SEQ ID NO: 219 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA 
GTGAGCACAT CCTTCTTTCA CGTCAGGTAG 
TTCATGAACA AAGTTGACTT GGTTGACGAT 
TGAAATGGAA ATCCGTGACC TGCTTTCAGA 
5 ACCTTCCAGT TATCCAAGGT TCAGCTCTTA 
GCTCAAGAAG ATGTTATCAT GGAATTGATG 
CCCAGAACCA GAACGTGATA CTGACAAACC 
ATGTATTCTC AATCACAGGT CGTGGTACTG 
CGTGGTACTG TAAGAGTCAA CGATGAAGTT 

10 CGAAATCACT AAAGCAGTTG TTACCGGTGT 

TGGACGAAGGT f CTTGCTGGT~GATAACGTTG" 
CAACGTGATG AAATCGAACG TGGTCAAGTA 
CAACCCACAC ACTAAATTTA AAGGTGAAGT 
AAGGTGGACG TCATACACCA TTCTTCAACA 

15 TTCCGTACAA CTGACGTAAC TGGTTCAATC 
AATGGTTATG CCTGGTGATA ACGTGACTAT 
CAATCGCCGT TGAACAAGGT ACTAC 



TGGACCAATG CCACAAACTC 50 

GTGTTAAACA CCTTATCGTC 100 

GAAGAATTGC TTGAATTGGT 150 

ATATGATTTC CCAGGTGATG 200 

AAGCGCTTGA AGGTGATACT 250 

AAAACCGTTG ATGAGTACAT 300 

ATTGCTTCTT CCAGTCGAAG 350 

TAGCTTCAGG ACGTATCGAT 400 

GAAATCGTTG GTATCAAAGA 450 

TGAAATGTTC CGTAAACAAT 500 

GTGTGCTTCT "CCGTGGTGTG 550 

TTGGCTAAAC CAGGTTCAAT 600 

TTACATCCTT ACTAAAGAAG 650 

ACTACCGTCC ACAGTTCTAC 700 

GAATTGCCAG CAGGTACTGA 750 

CGACGTTGAA TTGATCCACC 800 

825 



20 



2) INFORMATION FOR SEQ ID NO: 220 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 
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(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 



CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 

iO TGAAATGGAA 
ATCTTCCAGT 
AAATATGAAG 
CCCAGAACCA 
ACGTATTCTC 

i5 CGTGGTATCG 
AGAAATCCAA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 

50 AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTCTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ATATCATCAT 
GAACGCGACA 
AATCACTGGA 
TTAAAGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTTAC 
CCTGGTGATA 
TGAACAAGGT 



CTTCAACTGA 
CGCCAAGTTG 
GGTTGACGAT 
TCTTGTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
TGACGAAATC 
TTACTGGTGT 
GACAACGTTG 
TGGACAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTAACTAT 
ACTACT 



TGGTCCTATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTGCTTCT 
ATTGCTAAAC 
TTATATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGACGTTGAG 



CCACAAACTC 
CTTGATCGTG 
TTGAGTTGGT 
CCAGGTGACG 
AGGTGACTCT 
ATGAGTACAT 
CCAGTCGAAG 
ACGTATCGAC 
GTATCAAAGA 
CGTAAACAGC 
TCGTGGTATC 
CAGGTTCAAT 
ACTAAAGAAG 
ACAGTTCTAC 
CAGGTACTGA 
TTGATCCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
826 



50 



2) INFORMATION FOR SEQ ID NO: 221 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

10 

(xiJ.SEQUENCE^ESCRIPTION^^EQ ID NO: 221 



TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 50 

TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 100 

15 GTTGACTTGG TTGACGATGA AGAATTGCTT< GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 200 

TCCAAGGTTC AGCACTTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 250 

ATCATCATGG AATTGATGAA TACTGTTGAT GAATATATTC CAGAACCAGA 3 00 

ACGTGATACT GACAAACCAT TGCTTCTTCC AGTCGAAGAT GTATTCTCAA 350 

20 TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 400 

AAAGTCAACG ATGAAGTTGA AATCGTTGGT ATCCGCGAGG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATCCA ACGTGACGAA 550 

ATTGAACGTG GACAAGTTCT TGCTAAACCA GGTTCAATTC ATCCACACAC 600 

25 TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT CCGTACTACA 700 

GACGTTACAG GTTCAATCGA ACTTCCTGCA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 799 



30 



2) INFORMATION FOR SEQ ID NO: 222 



( i ) SEQUENCE CHARACTERI STI CS : 
35 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 35911 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 



TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG CCTCAAACGC 50 

GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA CCTTATTGTT 100 

50 TTCATGAATA AAGTTGACTT AGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC CCAGGCGATG 200 

AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA AGGTGATACT 250 

AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG ATGATTACAT 300 

CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT CCAGTCGAAG 3 50 

55 ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG ACGTATTGAC 4 00 

CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG GTATTCGTGA 450 

CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT TCGTGGTATC 550 

CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC CAGGATCAAT 600 

50 TCATCCACAT ACTAAATTCA AAGGTGAAGT TTATATTCTT ACTAAAGAAG 650 

133 



SUBSTITUTE SHEET (RULE 26) 



15 



WO 01/23604 PCT/CAOO/01150 

AAGGTGGACG TCATACTCCA TTCTTTAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGATGTAAC TGGTTCAATT GATTTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ATGTTACGAT TGATGTTGAA CTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACAAC 825 



2 ) INFORMATION FOR SEQ ID NO: 223 



10 (i) SEQUENCE CHARACTERISTICS: 
_ (A) L ENGTH : 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
20 Streptococcus mitis) 

(B) STRAIN: ATCC 33399 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

2 5 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

30 CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

35 GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

40 TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC AT 822 



45 2) INFORMATION FOR SEQ ID NO: 224 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 2 5175 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 
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CCCTGGTGCT 
CTCGTGAACA 
GTCTTCATGA 
GGTTGAAATG 
ATGATATTCC 
ACTGCTCAAG 
CATTCCAGAT 
AAGATGTTTT 
GATCGTGGTA 
TGATGACATT 
AATTGGATGA 
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ATCCAACGTG 
AATTCACCCA 
AGGAAGGTGG 
TACTTCCGTA 
TGAAATGGTT 
ATCCAATCGC 



ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGCCACAAA 
CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 
ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 
GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 
AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 
AAGATATCAT CATGGAATTA ATGCATACTG TTGATGACTA 
CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 
CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 
CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 
CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 
AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 
ATGAAATCGA ACGTGGTCAA GTTCTTGCTA - AACCAGGTTC 
CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 
ACGTCATACA CCATTCTTCA ATAACTATCG TCCACAATTC 
CAACTGACGT AACTGGTTCA ATTGAGTTGC CAGCAGGTAC 
ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 
TGTTGAACAA GGTACTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
827 



20 



25 



30 



35 



40 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 225 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus parasanguinis 

(B) STRAIN: ATCC 15912 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTCTAAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 300 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2) INFORMATION FOR SEQ ID NO: 226 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 19645 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 



TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTGACTT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTATCAT GGAATTGATG AAAACAGTTG ATGAGTACAT 300 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 350 

2 0 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2) INFORMATION FOR SEQ ID NO: 227 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
15 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 250 

ATCATCATGG AATTGATGGA CACTGTTGAT GAGTACATCC CAGAACCAGA 3 00 

35 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 400 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

30 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 
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TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 228 

10 (i) SEQUENCE _CHARACTERISI^ICSj 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sobrinus 
20 (B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 

TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

25 TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 200 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 250 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 300 

30 ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 400 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 4 50 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

35 ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 600 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 650 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 

40 



2) INFORMATION FOR SEQ ID NO: 229 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 
55 (B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
50 TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG TTGACGATGA AGAATTGCTT GAGTTGGTTG AAATGGAAAT 150 

CCGTGACCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA GTACGAAGAC 250 

ATCGTTATGG AATTGATGAA CACTGTTGAT GAGTACATTC CAGAACCAGA 300 

5 ACGCGACACT GACAAACCAT TGTTGCTTCC AGTCGAGGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTA GCTTCAGGAC GTATCGACCG TGGTACTGTT 4 00 

CGTGTCAACG ACGAAATCGA AATCGTTGGT CTTCAAGAAG AAAAATCTAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAACTT GACGAAGGTC 500 

TTGCCGGCGA TAACGTTGGT GTGCTTCTTC GTGGTGTACA ACGTGATGAA 550 

_1_ 0 A TCGAACGTG_ GTCAAGTTAT CTCTAAACCA GGTTCTATCA ACCCACACAC 600 

TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA - GGTGGACGTC 650" 

ACACTCCATT CTTCGACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCAA ATTGCCAGAA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGCCG 797 

15 



2) INFORMATION FOR SEQ ID NO: 230 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uteris 
30 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 

TTGTTGTTGC ATCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC 50 

35 CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 200 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CCAGAACCAG 300 

4 0 AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGTACTGT 4 00 

TCGTGTCAAC GACGAAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 4 50 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

45 AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGTACTGAA ATGGTAATGC 7 50 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 793 

50 



2) INFORMATION FOR SEQ ID NO: 231 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 
5 (B) STRAIN: ATCC 49124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

10 CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACT TG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC - CAGGTGATGA "TATTCCAGTT 200" 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GACTTGATGA ACACTGTTGA CGAATACATT CCAGAACCAG 300 

15 AACGTGACAC TGACAAACCA TTGTTGCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 400 

TCGTGTTAAT GACGAAGTTG AAATCGTTGG TCTTAAAGAA GAAATCCAAA 450 

AAGCAGTTGT TACTGGTGTA GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 550 

20 AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 

25 



2) INFORMATION FOR SEQ ID NO: 232 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
40 (B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 200 
CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG / 250 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 400 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

50 CGCGATGGAC GATGGTCTGC GTTTCGCAA 829 
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2) INFORMATION FOR SEQ ID NO: 233 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 



( ii ) MOLECULE "TYPE : Genomi c DNA~ — 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 4 9490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 2 00 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

25 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 200 

60 AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 
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CAATATGTAG CTAAAATTGA CGAATTGATG GACGCTGTAG ACTCCTACAT 300 

CCCAACACCA GTTCGTGACA CTGATAAACC ATTCTTGATG CCTGTGGAAG 350 

ATGTTTTCAC AATCACTGGT CGTGGTACAG TAGCAACTGG CCGTGTTGAA 4 00 

CGTGGTCAAG TAAACGTTGG TGATACTGTT GAAGTAGTAG GCTTGAAAGA 450 

5 AAAAGCTGAA CAATACGTAG TAACAGGTCT TGAAATGTTC CGTAAAGTGT 500 

TGGATTCTGC AGTAGCAGGT GACAACGTAG GTGCATTGCT TCGTGGTGTT 550 

GATCGTAAAG ACATYGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCCAT 600 

CAACCCACAY ACAAAATTCA AAGCAGAAGT ATACGTATTG ACTAAAGAAG 650 

AAGGTGGTCG TCATACTCCA TTCTTCTCCA ACTACCGTCC ACAATTCTAC 700 

10 TTCCGTACAA CAGACGTAAC AGGTGTTGTA AACCTTCCTG AAGGTGTAGA 750 

AATGTGTAT G CCTGGCGATA ACGTAACAAT GGAAATCGAA TTGATTACTC 800 

CAATCGCTAT - CGAAGAAGGT "CTTCG ~ " —825 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 235 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

35 AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 2 00 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 300 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 400 

40 GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 450 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

45 GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2) INFORMATION FOR SEQ ID NO: 236 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DN A 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 



GGCGCGATCC TGGTTGTTGC TGCCACTGAT GGCCCGATGC CACAGACTCG 50 

CGAGCACATT CTGTTAGGGC GTCAGGTGGG TGTTCCTTAC ATCCTGGTCT 100 

10 TCCTGAACAA ATGTGACATG GTTGACGACG AAGAGCTGCT GGAACTGGTA 150 

_GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CTGGCGACGA 200 

CACTCCAGTT TSTCC'GTGGTT "CTGCGCTGAA^AGCGCTGGAA- GGCGAAGCTG — 250- 

AGTGGGAAGC AAAAATCATC GAATTGGCTG AGGCGCTGGA TAGCTATATT 300 

CCACAGCCAG AGCGTGCGAT TGATAAACCA TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTCACCGGT CGTGTAGAGC 4 00 

GCGGTATCGT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 450 

ACCATCAAGA CTACCTGTAC TGGTGTTGAA ATGTTCCGCA AATTGCTGGA 500 

CGAAGGCCGT GCGGGTGAGA ACGTTGGTGT TCTGCTACGT GGTACTAAAC 550 

GTGATGACGT ACAACGTGGT CAGGTATTGG CAAAACCAGG TTCTATCAAG 600 

20 CCACACACCA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGCCAT ACTCCGTTCT TCAAAGGTTA TCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTGATGCCAG GTGATAACAT TCAAATGATT GTTAACCTGA TTGCTCCTAT 800 

CGCAATGGAT GACGGTCTGC GCTTTGCG 828 

25 



2) INFORMATION FOR SEQ ID NO: 237 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
40 (B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

45 TCCTGCTAGG TCGTCAGGTG GGTGTTCCTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG* 2 00 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 250 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 300 

50 AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 350 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 4 00 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 450 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 500 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550 

55 GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 600 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 700 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 750 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

50 ATGATGGTCT GCG 813 
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2) INFORMATION FOR SEQ ID NO: 238 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



( 1 ijMOLECULE" TYPET Genomi-c~DNA- 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Yersinia pestis 

. (B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

20 GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC CGCAGACTCG 50 

TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCCCATAC ATTATTGTCT 100 

TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT AGAGTTGGTT 150 

GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC CAGGCGACGA 200 

CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA GGTGACGCTG 250 

25 AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA TAGCTATATT 300 

CCGCAACCAG AACGCGCTAT TGATAGACCA TTCCTATTGC CAATTGAAGA 350 

CGTATTCTCT ATTTCTGGTC GTGGTACTGT AGTTACTGGT CGTGTAGAAC 400 

GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG TATTATCGAT 450 

ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA AGCTGCTGGA 500 

30 TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG TTCTATCAAG . 600 

CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

CGGCCGTCAT ACACCGTTCT TCAAGGGCTA CCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG CGTTGAAATG 750 

35 GTCATGCCTG GTGACAACGT AAACATGGTT GTTAACCTAA TTGCTCCTAT 800 

CGCAATGGAT GATGGTCTGC GCTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 239 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 



TGGAGCGATC TTGGTTGTTG ■ CTGCAACCGA TGGCCCTATG CCGCAGACTC 50 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTCCCATA CATTATTGTC 100 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 150 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

50 ACACTCCAGT CATCCGTGGT TCAGCGTTGA AAGCCCTGGA AGGTGACGCT 250 
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GAGTGGGAAG CTAAAATTAT CGAGTTGGCA GAAGCTCTGG ATAGCTATAT 300 

TCCGCAACCA GAACGCGCTA TTGATAGACC ATTCCTATTG CCAATTGAAG 350 

ACGTATTCTC TATTTCTGGT CGTGGTACTG TAGTTACTGG TCGTGTAGAA 4 00 

CGCGGTATTG TTAAGGTCGG CGAAGAAGTT GAAATCGTTG GTATTATCGA 450 

5 TACGATTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGC AAGCTGCTGG 500 

ATGAAGGCCG TGCTGGTGAA AATGTTGGTG TTCTGCTGCG TGGTACTAAG 550 

CGTGACGATG TTCAGCGTGG TCAAGTACTG GCGAAACCAG GTTCTATCAA 600 

GCCACACACG AAGTTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GCGGCCGTCA TACACCGTTC TTCAAGGGCT ACCGTCCTCA GTTCTACTTC 700 

10 CGTACAACTG ACGTGACCGG TACCATTGAG CTGCCAGAAG GCGTTGAAAT 750 

G^CAI^CT^CTGACAACG_ TAAACA TGGT TGT TAACCTA ATTGCTCCTA 800 

TCGCAATGGA TGATGGT " 817 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 240 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
•(D) TOPOLOGY: Linear 



( ii ) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCAATG CCACAGACTC 50 

GCGAGCACAT CCTGTTGGGT CGTCAAGTGG GTGTTCCTTA CATCTTAGTC 100 

TTCCTGAACA AGTGTGACAT GGTTGACGAC GAAGAGTTGC TGGAACTGGT 150 

35 TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCTGGCGATG 200 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAGGCC 250 

GAGTGGGAAG CCAAAATTAT TGAACTTGCT GAAGCACTGG ATAGCTACAT 300 

TCCACAGCCA GAGCGCGCGA TTGATAAACC ATTCTTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCAGGC CGTGGAACAG TTGTTACCGG GCGTGTTGAG 4 00 

40 CGCGGTATCG TCAGAGTGGG CGAAGAAGTT GAAATCGTGG GTATCATCGA 450 

CACCATTAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTTGGTG TTCTGCTGCG CGGTACTAAA 550 

CGCGATGACG TGCAACGTGG TCAAGTGTTG GCTAAACCAG GTTCTATTAA 600 

GCCGCATACC AAATTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

45 GTGGTCGTCA TACTCCGTTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGACG GTGTTGAGAT 750 

GGTGATGCCA GGTGATAACA TTCAAATGAT TGTTAACCTG ATTGCGCCTA 800 

TTGCAATGGA TGACGGTCTA CGATTTGCA 829 



2) INFORMATION FOR SEQ ID NO: 241 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGG AA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACTCCGAT CATCCGTGGT~TCTGCTCTGA~AAGCGCTGGA— AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

20 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGC 804 

25 



2) INFORMATION FOR SEQ ID NO: 242 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
4 0 denitrificans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

15 TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 50 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 100 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 150 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 200 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 300 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 350 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 4 00 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 450 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 500 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 550 

GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

CCGAGAAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 700 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

50 TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 800 
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AGCTGCAAGA GCGCATCACC TCGACCAAGA CCGGCTCGAT CACCTCGAT* 849 



5 2) INFORMATION FOR SEQ ID NO: 243 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

. (A) ORGANISM: Acinetobacter baumannii 
(B) STRAIN: ATCC 19606 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 



TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCATCGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT TACTGTAACT 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 150 

TATCATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 200 

25 CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAACTGGT ATTAAAGTCA TCGACTTACT 300 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 350 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 400 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 450 

30 TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTGACTAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 600 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACACTTGC AGGTACTGAA GTATCAGCAT TGTTAGGTCG TATGCCATCT 700 

35 GCGGTAGGTT ACCAACCTAC ACTTGCAGAA GAAATGGGTG TTCTTCAAGA 750 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 787 



40 2) INFORMATION FOR SEQ ID NO: 244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter lwoffii 

(B) STRAIN: CDCF 3697 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAACTAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 200 

TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 
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TGCCGATTCA CCGTCAAGCA CCTTCTTATG 
GACCTTTTAG AAACTGGTAT TAAAGTCATC 
TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG 
TAAACATGAT GGAGTTGATC AACAACATCG 
TCTGTATTCG CTGGTGTTGG TGAGCGTACT 
TCACGAGATG AAAGACTCAA ACGTTCTAGA 
GTCAGATGAA CGAGCCACCG GGTAACCGTT 
TTGACCATGG CTGAGTACTT CCGTGACGAG 
CCGTGACGTA CTATTGTTCG TAGATAACAT 
GTACTGAAGT ATCAGCACTT CTAGGTCGTA 
CA ACCGAC AC TTGCAGAAGA GATGGGTGTT 



CTGAACAAGC AGCTTCTACT 
GACTTACTTT GCCCGTTCGC 
TGCCGGTGTT GGTAAAACTG 
CTAAAGCGCA CTCAGGTTTA 
CGTGAAGGTA ATGACTTCTA 
CAAAGTAGCA ATGGTCTACG 
TACGCGTAGC GTTGACTGGT 
AAAGACGAAA ACGGCAAAGG 
CTACCGTTAT ACACTAGCAG 
TGCCGTCTGC AGTAGGTTAC 
CTTCAAGAAC GTATTACATC 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
-825- 



GACTAAGTCT GGTTCGATTA CGTCA 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 246 

( i ) SEQUENCE CHARACTERI ST I CS : 

(A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 



TTCCCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 50 
CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTTGAA GTACAGCAAC 100 
10 AATTGGGTGA CGGTGTAGTT CGCACCATCG CCATGGGTTC CAGCGACGGC 150 
CTG.CGCCGCG_GTATGGAAGT GGCCGGTTCG GGCGCTCCCA TCTCCGTTCC 200 



CGTGGGTGTC GGCACCCTGG GTCGCATTAT - GGACGTTCTG _ GGTCGCCCTA~ 250- 

TTGACGAAGT CGGTCCTATT CAGTCCGACG AGCGTCGCGC CATTCACCAG 300 

CCTGCGCCTA CTTTCGACGA ACTGTCGCCT TCCGTAGAGC TGCTGGAAAC 350 

15 CGGTATTAAA GTGATTGACC TGGTTTGCCC GTTCGCCAAG GGTGGTAAGG 400 

TTGGTCTGTT CGGTGGTGCC GGTGTGGGCA AGACCGTGAA CATGCTGGAG 450 

CTGATCAACA ACATCGCCAA GGCACACAGC GGTCTGTCCG TGTTTGCCGG 500 

TGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGGCCG 550 

ATGCTGGCGT TATCCAGATG GACAACCTGA GCGAGTCCAA AGTGGCCATG 600 

20 GTGTTCGGTC AGATGAACGA ACCTCCAGGC AACCGTCTGC GTGTGGCACT 650 

GTCCGGCCTG ACCATGGCCG AGAAGTTCCG TGACGAAGGC CGTGACATCC 700 

TGTTCTTTGT GGACAACATC TACCGCTACA CGCTGGCCGG TACAGAAGTG 750 

TCCGCTCTGC TGGGTCGTAT GCCTTCCGCA GTGGGTTACC AGCCTACGCT 800 

GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG 850 

25 G 851 



2) INFORMATION FOR SEQ ID NO: 247 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
JO (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

15 GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

50 ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

55 TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

50 ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 846 



5 2) INFORMATION FOR SEQ ID NO: 248 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 



CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

25 ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 450 

30 CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

35 CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT 810 



iO 



50 



2) INFORMATION FOR SEQ ID NO: 249 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 944 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

50 CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 200 
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TGCCGACCGG CGACCAAGTC AAAGGACGTT TAATGAATGT CACCGGCGAC 250 

CCTATCGATG GCATGGCCCA GCTTACTAAA GACGGGGCTC TTCCCATCCA 300 

TCGTGAGCCT CCTAAATTCG AGGATCTGAC AACGACCCAA GAGGTCTTGT 350 

ACACGGGTAT TAAAGTAATC GATTTATTGG AACCTTACGC CAAAGGAGGC 400 

5 AAGATCGGAC TTTTCGGAGG AGCCGGAGTC GGCAAAACGG TATTGATCAT 450 

GGAATTGATC AACAACATCG CAAAGAAAAA CAACGGATTC TCCGTCTTCG 500 

CCGGTGTGGG TGAACGTACA CGTGAAGGAA ATGACCTATT GCGTGAAATG 550 

ATCCAATCCG GTGTCATCCG ATACGGCGAG GAGTTTAAGA AGAGTATGGA 600 

GGCAGGCAAC TGGGACTTGT CGAAGATCGA TTACGATGAA TTGGCTAAGT 650 

10 CGCAGGCTAC CTTGGTATTC GGGCAGATGA ACGAACCGCC GGGCGCCCGC 700 

ICGICC_GTGG_ CCTT^CC(^_^TG^GP^C^CCGAATCCT TCCGTGACAA 750 

AGCCTCTGAG GGAGAAAGAA AAGATATATT ATTCTTCATC "GATAATATCT 8 00 

TCCGTTTCAC CCAAGCCGGT TCCGAGGTTT CCGCCTTGCT GGGGCGTATG 850 
CCTTCCGCCG TAGGTTACCA ACCGACATTG GCCACGGAAA TGGGAGCTAT ' 900 

15 GCAAGAGCGT ATCACTTCAA CCAAGAAGGG< CTCCATCACC TCCG 944 



20 



55 



2) INFORMATION FOR SEQ ID NO: 250 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacteroides ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

35 TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGCCAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 100 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 200 

ATTGGCGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

40 CGACGGTATG AAAGGACTCG ACCGCAAAGG TGCATATTCC ATTCATCGCG 300 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GCTCTTCACA 350 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 400 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 500 

45 GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 700 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 750 

50 AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 800 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 900 

AACGTATCAC GTCTACCCGC AAAGGTTCTA TCACCTCCG 93 9 



2) INFORMATION FOR SEQ ID NO: 251 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 83 3 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

10 

(xi) SEQ UENCE DE SCRIPTION: SEQ ID NO: 251 



CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTTC 50 

AGAATGGTAA TGAGAGCCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

15 GGTATTGTGC GTACCATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCAAT CGAAGTACCA GTAGGTAAAG 200 

CAACCCTGGG CCGTATCATG AACGTCCTGG GTCAGCCGAT CGACATGAAA 250 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGTT CAGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

20 TTATCGACCT GATGTGTCCG TTCGCGAAGG GCGGTAAAGT TGGTCTGTTC 400 

GGCGGTGCGG GTGTAGGTAA AACCGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TATCCCTGGT TTACGGCCAG ATGAACGAGC CACCAGGAAA 600 

25 CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAGTTCCGTG 650 

ACGAAGGTCG TGACGTACTG CTGTTCGTTG ACAACATCTA CCGTTACACC 700 

CTGGCCGGTA CGGAAGTATC CGCACTGCTG GGTCGTATGC CATCAGCAGT 750 

AGGCTACCAG CCGACCCTGG CGGAAGAGAT GGGTGTTCTG CAGGAACGTA 800 

TCACCTCTAC CAAAACCGGT TCTATCACCT CCG 833 

30 



2) INFORMATION FOR SEQ ID NO: 252 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
45 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 50 

50 GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 100 

ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 150 

GTGCCGGTCG GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 200 

TCCGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 

ACCGCGCTGC GCCGTCGTAT GAAGACCAGT CCCCGGCCAC CGAGCTGCTG 300 

55 GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 350 

CAAGGTCGGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC GTCAACATGA 400 

TGGAACTGAT CAACAACATC GCCAAGGCGC ACAGCGGCCT GTCCGTGTTC 450 

GCCGGCGTGG GTGAGCGTAC CCGTGAGGGC AACGACTTCT ACCACGAGAT 500 

GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTAC GGCCAGATGA 550 

60 ACGAGCCGCC GGGCAACCGT CTGCGCGTCG CCCTGACCGG CCTGACCATG 600 
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650 
700 
750 
800 
819 



GCCGAGTACT TCCGCGATGA GAAGGACGAA AACGGCAAGG GCAAGGACGT 
CCTGCTGTTC GTCGACAACA TCTACCGCTA CACCCTGGCC GGTACCGAAG 
TGTCGGCACT GCTGGGCCGC ATGCCGTCCG CGGTGGGTTA CCAGCCGACC 
CTGGCCGAGG AAATGGGCGT CCTGCAGGAG CGCATCACCT CGACCAAGAA 
TGGTTCGATC ACCTCGATC 



20 



25 



30 



35 



40 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 253 

(i.)_SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases " 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

AGTTTGAAGG CCCTTTGCCA AATATTCTCA ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG ATACTACCGA TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 200 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 250 

AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 300 

TGGAGCAATC AACCGCATCA GAAATCCTTG TGACTGGTAT TAAAGTCGTT 350 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 400 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 450 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500 

CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCGAAAGCC GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 650 

TCAGGATTGA CCATTGCAGA AAGTTTCCGT GATGAGGGAC AAGATGTTTT 700 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 750 

CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 

CTCTATTACC TCTG 864 



2) INFORMATION FOR SEQ ID NO: 254 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 866 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 
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TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 50 

GKCCAACACC GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

TTGMGGTTGA ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 150 

5 AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 200 

CCCGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGCCAC GTATTCGACG 250 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 300 

GAACGTTGGT CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 350 

CAAGACCCAG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 400 

10 CGTATGTGCA GGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGT^J^ATCCAGGA J]ATCATTCAG CGTGTGGCTC AGAACCACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT" GAGGGTAACG — 550" 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 600 

GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 650 

15 GACCGCACTG ACCATGGCAG AGTACTTCCG TGACGTACAG AATCAGGACG 700 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 750 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 800 

CCTGGCCGAT GAGATGGGCG CGCTGCAGGA GCGAATCACT TCGACCCGTG 850 

GACACTCCAT CACCTC 866 

20 



2) INFORMATION FOR SEQ ID NO: 255 

25 ( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
35 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

AAGGCCAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 50 

4 0 CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 100 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 150 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 200 

CGCATCATGA ACGTCATCGG CGAGCCGGTT GACGAAGCAG GCCCCATCAA 2 50 

GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 3 00 

45 AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 350 

CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG GCGGTGCAGG 400 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 450 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 500 

GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 550 

50 CGGCGGCGAA GGCTCCAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 600 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 650 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 700 

CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTCTTCTC GGCCGTATTC 750 

CTTCCGCTGT GGGTTATCAG CCGACGCTGG CAACCGACAT GGGCGCCATG 800 

55 CAGGAACGCA TCACCACGAC GACCAAGGGT TCGATCACCT CG 842 



2) INFORMATION FOR SEQ ID NO: 256 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Cedecea davisae 
(B) STRAIN: ATCC 33431 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 50 

AAAATAACAG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CGACCCTTGG CCGTATCATG AACGTGCTGG GTCAGCCGAT CGACATGAAA 250 

20 GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCGACCT TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 4 00 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 450 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 500 

25 GTACTCGTGA GGGTAACGAC TTCTATCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 750 

30 AGGTTACCAG CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 833 



35 2) INFORMATION FOR SEQ ID NO: 2 57 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 



TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

55 TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

50 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 
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TCGTGAGGGT AACGACTTCT 
ACAAAGTTGC ACTGGTTTAC 
CTGCGCGTAG CGCTGACCGG 
AGGCCGTGAC GTTCTGCTGT 
CCGGTACAGA AGTTTCTGCA 
TATCAGCCTA CTCTGGCAGA 
CTCCACCAAG ACCGGTTCCA 



ACCACGAGAT 
GGCCAGATGA 
TCTGACCATC 
TCGTCGATAA 
CTGCTGGGTC 
AGAGATGGGT 
TCACTTCCG 



GACCGACTCC 
ACGAGCCGCC 
GCGGAGAAAT 
CATCTATCGT 
GTATGCCATC 
GTTCTTCAGG 



AACGTTCTGG 
AGGTAACCGT 
TCCGTGACGA 
TATACCCTGG 
TGCGGTAGGT 
AGCGTATTAC 



550 
600 
650 
700 
750 
800 
829 



20 



25 



30 



35 



10 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 258 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGCTTGA GGTACAGAAC 50 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 100 

CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 150 

TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 200 

CTTGGCCGTA TCATGAACGT GCTGGGTCAG CCGATCGACA TGAAAGGCGA 250 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 300 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 350 

GACCTGATTT GTCCGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 4 00 

TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 450 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA ACGTTCTGGA 550 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 600 

TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 650 

GGCCGCGACG TTTTGCTGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 700 

CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 750 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC 800 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA 830 



2) INFORMATION FOR SEQ ID NO: 259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chryseobacterium meningosepticum 

(B) STRAIN: CDC B7681 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 200 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 250 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 300 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 350 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 400 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 



AACATTGCTA - AAGGACACGG _ TGGTCTTTCT _ GTATTCGCAG-GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG. AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 700 

TTTCTGGTCT TACTTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

20 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTACATCT G 931 



25 2) INFORMATION FOR SEQ ID NO:260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonaticus 

(B) STRAIN: ATCC 25405 



JO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 200 

15 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 250 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 300 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 350 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 400 

TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 450 

50 TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 700 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



50 



2) INFORMATION FOR SEQ ID NO: 261 

156 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter braakii 

(B) STRAIN: ATCC 43162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

15 GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

20 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 4 50 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

25 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 700 

TATCCGCTCT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCGACC 750 

30 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC AC 812 



35 2) INFORMATION FOR SEQ ID NO: 262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci troJbacter koseri 

(B) STRAIN: ATCC 27156 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 



GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTTCA GCAGCAGCTC GGTGGCGGTA TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG ACCTTGAGCA 150 

CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 200 

55 TACTGGGCGA ACCAGTAGAC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGCTATCC ACCGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 300 

GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 350 

CGAAGGGCGG TAAAGTGGGT CTGTTCGGTG GTGCGGGTGT AGGTAAAACC 4 00 

GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 450 

50 CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 

157 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
CCTGACCATG GCGGAGAAAT 
TCGTCGACAA CATCTACCGT 
CTGCTGGGTC GTATGCCTTC 
AGAGATGGGT GTTTTGCAGG 
TCACCTCCGT A 



AACGTTATCG 
GGGAAACCGT 
TCCGTGACGA 
TACACCCTGG 
AGCGGTAGGT 
AACGTATCAC 



ACAAAGTATC CCTGGTTTAC 
CTGCGCGTTG CGCTGACCGG 
AGGTCGTGAC GTTCTGCTGT 
CCGGTACGGA AGTATCCGCA 
TACCAGCCGA CCCTGGCGGA 
CTCCACCAAA ACCGGTTCTA 



550 
600 
650 
700 
750 
800 
811 
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30 



35 



40 



45 



50 



55 



2) INF ORMA TIO N FOR SE Q ID NO: 263 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid - 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter farmer i 

(B) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 100 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 200 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 800 

AACCGGTTCT ATTACC 816 



2) INFORMATION FOR SEQ ID NO: 264 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 



158 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 



PCT/CA00/01150 



AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 50 

GAGAGCCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG 100 

5 TACCATCGCC ATGGGTTCTT CTGACGGTCT GCGTCGTGGT CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 200 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

TGAAGAAGAG CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 300 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

10 ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 400 

TGTAGGT AAA ACC GTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG 450 

AACACTCCGG TTACTCCGTG~TTTGCGGGCG~TAGGTGAACG"TACTCGTGAG 500" 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCTGGAAAC CGTCTGCGTG 600 

15 TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GACGTTCTGC TGTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 700 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 800 

AAAACCGGTT CTATCACCT 819 

20 



2) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 
35 (B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

40 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 300 

45 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAA 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG ■ AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 800 

55 AACCGGTTCT ATCACCTCCG TA 822 



60 



2) INFORMATION FOR SEQ ID NO:266 

159 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter sedlakii 
(B) STRAIN_:_ATCC _5 1115 _ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

15 GATGCCGTAC CGCGCGTGTA CGACGCCCTT- GAGGTACAGA ATGGTAATGA 50 

GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATTGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GACATCGGTG 250 

20 AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA TGAAGAGTTG 300 

TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

25 TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG TCTGCGCGTC 600 

GCACTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

30 ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCAACCAA 800 

AACCGGTTCT ATCACCTCCG 820 



35 2) INFORMATION FOR SEQ ID NO: 267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 200 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 300 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

50 TCCGGTTACT CAGTGTTTGC GGGCGTTGGT GAACGTACTC GTGAGGGTAA 500 

160 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTATATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGTC TGACCATGGC TGAGAAGTTC 
TCTGCTGTTC GTTGATAACA TCTATCGTTA 
TATCTGCACT GCTGGGTCGT ATGCCATCAG 
CTGGCGGAAG AGATGGGTGT TCTGCAGGAA 
CGGTTC 



PCT/CAOO/01150 ^ 

CGTTCTGGAC AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 700 

CGGTAGGCTA CCAGCCAACC 750 

CGTATCACCT CTACCAAAAC 800 

806 



10 



20 



15 



.2)_INF_QRMAT.IPN_F0R_ SEQ_ ID NO: _26_8_ 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

30 CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 200 

CATGAACGTT CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 250 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

35 GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

JO ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 269 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

50 

161 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 



PCT/CA00/01150 



TTGAGAACGG 
AAAGACAGTG 
5 ACGTGTCCGC 
TGGAAGCCAT 
ATTCTGGGAA 
ACCTGTAGGA 
TTGAGGAGCA 
10 ATTGACCTGT 
TGGTGCGGGA 



TCGCCAAGGA" 
ACCCGTGAAG 
TGATAAGACC 
15 GAATGCGTGT 
CACGACCATC 
CCAGGCGGGA 
TAGGCTATCA 
ATTACATCCA 

20 



AGATTTGCCG 
AATCTCTGAT 
TGTATCGCTA 
TGATACAGGA 
GAATGTTCAA 
ACGGATAACA 
GCAGACATCC 
TATGTCCATA 
GTAGGTAAAA 
"ACATGGTGGA" 
GAAACGACAT 
GTACTGGTTT 
CGGTCTGACC 
AGGATGTATT 
AGTGAAGTAA 
GCCGACACTT 
CGAAGGATGG 



CAGCTATTGA 

TGTCGAAGTT 

TGGGCGGTAC 

TCCGCAATCC 

TGTCCTCGGA 

CACTGCCGAT 

GCAGAAATGC 

TTCCAAGGGT 

CCGTACTGAT 

"ATGTCCGTCG- 

GTATCATGAA 

ACGGACAGAT 

GGGCTGACGA' 

GCTGTTTATT 

GTGCCCTGCT 

GCGACAGAAA 

TTCCATT 



CCGCTATTGA 

GCTCAGCATA 

AGATGGTCTG 

GTGTACCGGT 

CGTGAAATTG 

CCACAGACAG 

TGGAAACAGG 

GGTAAGATTG 

TCAGGAGCTG 

TTACCGGTGT- 

ATGAAGGACA 

GAATGAATCA 

TGGCGGAATA 

GATAATATTT 

GGGACGTATG 

TGGGACAGCT 



AATTCCTCTG 
TCGGTGATGA 
GTTCGTGGAA 
GGGAAAAGAA 
ATGGTCTGGG 
GCACCGGGCT 
AATTAAGGTC 
GTTTGTTTGG 
ATTCATAATA 
"AGGGGAGAGA 
GCGGTGTCCT 
CCGGGTGCCA 
TTTCCGTGAT 
TCCGTTTTAC 
CCAAGTGCAG 
GCAGGAGCGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
-500 
550 
600 
650 
700 
750 
800 
827 



2) INFORMATION FOR SEQ ID NO: 270 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 
35 (B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

GCAAATGATG AACTTCCTAA TATATTTAAT GCAATACACA TAAAAATGGA 50 

40 CGATGGAAAA ATCTTAGTTT GTGAGGTAGA GCAACACGTA GGAGACGATA 100 

TAGTTAGAAC TATAGCTATG GAAGCTACTG AAGGACTAAG AAGAGGTGTA 150 

GAAGCTGTTG ATACAGGAGC ACCTATATCA GTACCAGTTG GTGAATGCGT 200 

ATTAGGAAGA ATATTTAACG TATTAGGTAA ACCACTAGAT AGTGGAGCTG 250 

AAGTTAATAA CGAAGAAAAA TATCCAATTC ATAGACCAGC TCCATCATTT 300 

15 GAAGAACAAT CAGTTGTTCC TCAAATGTTT GAGACAGGAA TAAAGGTTAT 350 

CGACCTTTTA GCACCTTACC AAAGAGGGGG AAAAATCGGT CTATTTGGAG 4 00 

GTGCAGGTGT TGGTAAAACA GTTCTTATCC AAGAGCTTAT AAACAACATA 450 

GCTAAAGAGC ACGGTGGACT TTCTGTATTC ACAGGAGTTG GAGAAAGATC 500 

AAGAGAAGGT AATGACCTTT ACTATGAAAT GATGGAATCA GGAGTTATAA 550 

50 AAAATACAGC ATTAGTATTT GGACAAATGA ACGAACCACC TGGAGCAAGA 600 

ATGAGAGTTG CTTTAACAGG ACTTACTATG GCTGAGTACT TCAGAGACCA 650 

AGGTCAAGAC GTGTTATTAT TCATAGATAA CATATTCAGA TTCTCACAAG 700 

CTGGATCAGA GGTTTCAGCT TTATTAGGAA GAATACCATC AGCTGTTGGT 750 

TACCAACCAA CTCTTGCTAC AGAGATGGGA GCTCTTCAAG AGAGAATCAC 800 

55 ATCAACTACC CATGGATCAA TTACATCAG 829 



2) INFORMATION FOR SEQ ID NO: 271 

50 

162 
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10 



30 



50 



60 



WO 01/23604 PCT/CAOO/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
_TTGTCCAC.GT_^GGATRTCTT CA 2 2 



15 2 ) INFORMATION FOR SEQ ID NO: 272 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC. 27010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



CCCGCGTGGC GAGCTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA 50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTG.CT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 200 

35 GCGACGTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGCTTGGAT 250 

GAGCCAGGTC TCGGCCGCGA TGGTGAGCAG TGGGGAATTC ACCGCGATCC 300 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 350 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 400 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

4 0 CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

4 5 CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 800 

AGGAGCGTAT TACCTCTA 818 



2) INFORMATION FOR SEQ ID NO: 273 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

CGATGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 50 

GTCGCCAAGA CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 

GATCCGGACC ATTGCGTTGG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 

10 AGGTTATCGA CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 200 

AAAGGACACG _TCTTCAATGC GCTCGGTGAG TGTTTGGACG AACCAGGATT 250 

GGGCCGCGAC GGCGAACAGT GGGGAATCCA - ~CCGCGATCCG~CCACCATTCG 300" 

ATGCGCTGGA GGGCAAAACC GAGATTCTGG AGACTGGAAT CAAGGTTATC 350 

GACCTCCTTA CCCCTTACGT TAAGGGTGGC AAAATTGGTC TGTTCGGTGG 400 

15 CGCCGGCGTC GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 

CTCGTAACTT CTCCGGTACT TCCGTGTTCG CCGGCGTCGG TGAGCGTACC 500 

CGTGAGGGTA CTGACCTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 550 

AGACACCGCC CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTCGTA 600 

TGCGCGTGGC CTTGTCTGGT CTAACCATGG CTGAATATTT CCGCGACGTT 650 

20 CAAAACCAGG ACGTTTTGTT GTTCATTGAC AACATCTTCC GTTTTACTCA 700 

GGCAGGTTCC GAGGTTTCCA CGCTGTTGGG CCGTATGCCT TCCGCCGTGG 750 

GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 800 

ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 833 



25 



10 



2) INFORMATION FOR SEQ ID NO: 2 74 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 417 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 



CCGTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 50 

ACCTCTGTGT TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

15 CTGGGTCGAG ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

TCGGCCAGAT GGACGAGCCG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 200 

GCGCTGACCA TGGCGGAGTA CTTCCGCGAT GTGCAGAACC AGGACGTGCT 250 

GCTGTTCATC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 300 

CCACCCTGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 350 

50 GCGGACGAGA TGGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 400 

CTCCATCACC TCGATGC 417 



55 2) INFORMATION FOR SEQ ID NO: 275 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

10 

GGGCAGCAGC^CAGCACTAXT^AACJ3CGCTG^CACGTCGAGG TTGACCTCGA 50 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 100 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCGGT 150 

GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG TCGGCGATGT 200 

15 TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 250 

GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 300 

TTCGACGAGC TCGAGGGTAA GACCGAGATC CTGGAGACCG GCGTTAAGGT 350 

CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 400 

GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 450 

20 ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 500 

TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 550 

TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC GCCGGGAGTC 600 

CGTATGCGCG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT ACTTCCGCGA 650 

TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC TTCCGTTTCA 700 

25 CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA 750 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG 800 

CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC 835 



30 



10 



2) INFORMATION FOR SEQ ID NO: 276 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 

CCCGTCACGC TGTCCCGAAA GTGTACGATG CTTTGCAGGT GGATGAAAAT 50 

AATTTAACGC TCGAAGTCCA ACAGCAACTC GGGGACGGTG TCGTGCGCAC 100 

AATTGCCATG GGCAGCACTG AGGGCTTAAA ACGCGATATC GCCGTAAAAA 150 

50 ATACGGAAAA ACCGATTGAA GTTCCCGTAG GAAAAGAAAC TTTAGGTCGT 200 

ATCATGAACG TGCTGGGTGA GCCGATCGAT GAGTTAGGTC CCATTAATTC 250 

AAAAGAAAAA CTCCCTATTC ATCGTCCTGC GCCGAGCTTT ATTGAGCAAT 300 

CTGGCGCTAC CGAATTATTA GAAACCGGTA TTAAAGTGGT CGATTTGCTT 350 

TGCCCCTTTG CTAAGGGAGG CAAAGTGGGT CTTTTTGGAG GCGCGGGCGT 400 

55 TGGAAAAACG GTTAATATGA TGGAATTAAT CCGTAATATC GCCATTGAAC 450 

ACAGCGGTTA TTCTGTTTTT GCGGGTGTGG GAGAAAGAAC GCGAGAAGGC 500 

AATGATTTTT ATCATGAAAT GAAAGAATCC AATGTCTTGG ATAAAGTGGC 550 

GTTGGTGTAC GGACAAATGA ACGAGCCGCC AGGGAACCGC TTGCGGGTGG 600 

GTTTGACGGG GCTTACGCTG GCGGAAGCCT TCCGTGACGA AGGACGCGAC 650 

50 GTTCTGTTAT TTATCGATAA TATCTTTCGT TACACTTTGG CAGGGGTTGA 700 
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AGTCTCTGCC CTCCTCGGTC GGATGCCATC GGCTGTGGGT TATCAGCCGA 750 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG AACGCATTAC TTCCACTAAA 800 
AAAGGGTCCA TTACGTCG 818 

5 

2) INFORMATION FOR SEQ ID NO: 277 

( i ) SEQUENCE CHARACTERI STI CS : 
10 (A) LENGTH: 829 bases 

JB) TYPE : Nucleic acid 

(Cj STRANDEDNESST Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

(B) STRAIN: ATCC 33379 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 

TTCCCGCAGG ACGCCGTGCC GAAGGTGTAT AACGCACTGG AAGTAAAAGG 50 

CGGTGCCACG AAACTGGTAC TGGAAGTGCA GCAGCAGCTG GGTGGCGGCG 100 

25 TAGTTCGCTG CATCGCGATG GGCTCTTCCG ACGGTCTGCG CCGTGGGCTA 150 

GAGGTTGAAG ACCAAGACCA TCCGATCGAG GTTCCTGTTG GCAAGGCGAC 200 

TCTGGGCCGT ATCATGAACG TACTGGGTGA TCCGGTCGAC ATGAAGGGCG 250 

AGATCGGTGA AGAAGAGCGT TGGGCTATCC ATCGTGCTGC ACCGAGCTAT 300 

GAAGATCTGT CTAACTCTCA GGAACTGCTG GAGACCGGCA TTAAGGTTAT 350 

30 CGACCTGATT TGCCCGTTCG CTAAAGGCGG TAAAGTGGGC CTGTTCGGTG 400 

GGGCCGGTGT GGGTAAGACC GTTAACATGA TGGAGCTTAT CCGTAACATC 450 

GCTATCGAGC ACTCCGGTTA CTCAGTCTTC GCCGGTGTGG GTGAGCGTAC 500 

CCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGATTCC AACGTATTGG 550 

ATAAAGTTTC TCTGGTGTAT GGTCAGATGA ACGAGCCACC GGGAAACCGT 600 

35 CTGCGCGTGG CGCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGATGA 650 

AGGTCGTGAT GTACTGTTGT TCATCGATAA CATCTACCGT TATACCTTGG 700 

CCGGTACTGA AGTCTCCGCT CTGCTGGGCC GTATGCCGTC GGCGGTAGGT 750 

TATCAGCCGA CTCTGGCGGA GGAAATGGGG GTGCTGCAAG AGCGTATTAC 800 

CTCCACTAAG ACCGGGTCCA TCACCTCTG 829 

40 



2) INFORMATION FOR SEQ ID NO: 278 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella tarda 
55 (B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 50 
50 ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC 
CAAGACCATC CGATCGAGGT TCCGGTCGGT 
CATGAACGTA CTGGGTGATC CGGTCGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCGC 
5 AACTCTCAGG AGCTGCTGGA GACCGGCATC 
CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT 
GTAAGACCGT TAACATGATG GAGCTTATCC 
TCCGGTTACT CCGTATTTGC CGGTGTAGGC 
CGACTTCTAC CACGAGATGA CCGACTCCAA 
10 TGGTATACGG CCAGATGAAT GAGCCGCCGG 

-TTGAGGGGTG— TGACCATGGC— GGAGAAATTC 

GTTGTTGTTC ATCGATAACA TTTATCGTTA" 
TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG 
CTGGCGGAAG AGATGGGTGT GTTGCAAGAG 
15 GGGCTCTAT 



PCT/CA00/01150 



GTGGGCTGGT 




ish 


AAGGCGACCC 


TGGGCCGTAT 


200 


GAAGGGCGAG 


ATCGGCGAAG 


250 


CGAGCTATGA 


AGATCTGTCC 


300 


AAGGTTATCG 


ACCTGATTTG 


350 


GTTCGGTGGT 


GCCGGTGTGG 


400 

x V V 


GTAACATCGC 


TATCGAGCAC 


450 


GAGCGTACCC 


GTGAGGGTAA 


500 


CGTATTGGAT 

V-r X * * A ^ V-J Null * «A 


AAAGTTTCTC 


550 


GAAACCGTCT 


GCGTGTGGCG 


600 

>J V V 


CGTGATGAAG 


GTCGCGATGT 


650 


TACCTTGGCT 


GGTAPPGZiAG 


700 — 


CGGTAGGTTA 


TCAGCCGACC 


750 


CGTATCACCT 


CAACGAAGAC 


800 






809 



2) INFORMATION FOR SEQ ID NO: 279 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

35 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 100 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 150 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 200 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 250 

40 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 300 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 350 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 40 0 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 4 50 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

45 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 650 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 700 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 750 

50 TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 800 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 840. 



55 2) INFORMATION FOR SEQ ID NO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 



10 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 

-GCCGTACCAC_GAGT£TACGA TGCACTT GAG_ GTAAAGAATG 
TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT" 
TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTCTGGA 
CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC 
CATGAACGTA CTGGGCGAAC CAGTAGACAT GAAAGGCGAC 
AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA 
AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG 
TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT 
GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC 
CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC 
TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT 
CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGACGAAG 
ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC 
TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT 
CGG 



GTGAAGAGCG 50 

"GTACGTACCA - 100 

AGTAACGGAC 150 

TGGGTCGTAT 200 

ATCGGTGAAG 250 

AGAGTTGTCA 300 

ACCTGATGTG 350 

GCGGGTGTAG 400 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCG 600 

GTCGTGACGT 650 

GGTACGGAAG 700 

TCAGCCGACG 750 

CCACCAAAAC 800 
803 



30 



10 



2) INFORMATION FOR SEQ ID NO: 281 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 

(B) STRAIN: ATCC 33072 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GTACTATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

50 TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 250 
GGCGACATCG GTGAAGAAGA CCGTTGGGCA ATCCACCGTG CAGCACCTTC ' 300 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 400 

?5 GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGCTGAG AAGTTCCGTG 650 

SO ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 700 

168 
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PCT/CAOO/01150 



CTGGCCGGTA CTGAAGTATC TGCGCTGCTG GGCCGTATGC CTTCAGCGGT 750 
AGGTTACCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 800 
TCACTTCTAC CAAAACCGGT TCTATCACCT CCG 833 

5 

2) INFORMATION FOR SEQ ID NO: 282 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 810 bases 
(B) T-Y-PE Nuc 1 e i c^ac id 

(C) STRANDEDNESS: Double " ~ ~ 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:' Enterobacter asburiae 

(B) STRAIN: ATCC 35953 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

25 TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

30 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

35 CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC 810 

40 



2) INFORMATION FOR SEQ ID NO: 283 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 
55 (B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
60 CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 

169 
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TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTAAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCC 300 

5 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTATT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

10 TGGTGTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC' GTTGATAACA- TCTACCGTTA- eAeeeTGGGG-GGTAGGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CAACCAAAAC 800 

15 CGGTTCTATC A - 811 



2) INFORMATION FOR SEQ ID NO: 284 



20 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

35 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

10 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

50 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC ACTTCCG 817 



55 2) INFORMATION FOR SEQ ID NO: 285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

170 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 

AATGAGAGCC TG GTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT 50 

GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC^GGTCTGGAAG 100- 

TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC 150 

GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGATAT 200 

15 CGGCGAAGAA GAGCGTTGGG CGATCCACCG CGCTGCGCCG TCCTATGAAG 250 

AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA GGTAATGGAC 300 

CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 350 

GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 4 00 

TCGAGCACTC CGGCTACTCC GTGTTTGCGG GCGTGGGTGA ACGTACTCGT 450 

20 GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA 500 

AGTATCCCTG GTGTACGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC 550 

GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TGACGAAGGT 600 

CGTGACGTTC TGCTGTTCGT CGATAACATC TACCGCTATA CCCTCGCCGG 650 

TACTGAAGTA TCCGCACTGC TGGGCCGTAT GCCTTCTGCA GTAGGTTACC 700 

25 AGCCGACGCT GGCGGAAGAG ATGGGTGTTC TGCAGGAACG TATCACCTCC 750 

ACCAAAACCG GTTCTA 766 



30 2 ) INFORMATION FOR SEQ ID NO: 286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 



GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTGAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 200 

50 CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

55 TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

50 TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACG 750 

171 
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CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT " CGACCAAAAC 800 
CGGTT 805 



2) INFORMATION FOR SEQ ID NO: 287 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 791 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 



15 



(D) TOPOLOGYT Linear ~ — 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter sakasakii 

(B) STRAIN: ATCC 2 9544 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 50 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

CCGACGGTCT GCGTCGCGGT CTGCCTGTTG CAGACCTTGA GCACCCGATC 150 

25 GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 200 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 250 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCCAGCTC TCAGGAACTG 300 

CTGGAAACCG GCATCAAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 350 

CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 400 

30 TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 4 50 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 550 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 600 

ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

35 CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 700 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 750 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAACCGGTT C 791 



40 



50 



2) INFORMATION FOR SEQ ID NO: 288 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 839 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 

50 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 2 00 

172 
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GATACTCTAG GTCGTGTGTT TAACGTATTA GGTGAAACGA TTGATAAAGA 250 

AGCACCTTTT CCAGAAGATG CAGAAAGAAG CGGCATTCAT AAAAAGGCGC 3 00 

CTGCTTTTGA AGACCTTAGT ACAAGTAACG AGATTTTGGA AACAGGGATC 3 50 

AAGGTTATCG ACTTATTAGC CCCTTATTTA AAAGGTGGGA AAGTCGGACT 400 

5 ATTCGGTGGT GCCGGTGTTG GTAAAACCGT TTTGATCCAA GAATTAATTC 4 50 

ATAATATCGC CCAAGAACAC GGTGGTATTT CAGTGTTTAC CGGTGTTGGG 500 

GAACGTACTC GTGAAGGGAA CGACCTTTAT TATGAAATGA AAGACTCTGG 550 

CGTTATTGAG AAAACAGCCA TGGTGTTCGG ACAAATGAAC GAGCCGCCTG 600 

GTGCACGTAT GCGTGTTGCC TTGACTGGTT TGACATTAGC TGAATATTTC 650 

10 CGTGATGAAG AAGGACAAGA TGTGTTGCTA TTTATTGACA ACATCTTCCG 700 

CTTTACTC AA GC CGGATCAG AAGTTTCTGC CTTATTAGGA CGTATGCCAT 750 

CAGCCGTTGG GTATCAACCA ACTTTGGCAA - CTGAAATGGG~ GCAATTACAA 800" 

GAACGAATCA CTTCAACCAA AAAAGGTTCG ATCACTTCA 839 



15 



30 



2) INFORMATION FOR SEQ ID NO: 289 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



TTCTCTCTAG ACCAATCATT ACCAGATATC AACAATGCGT TGATTGTTTA 50 

CAAAAAAGAT GAGCAGAAAA CAAAAGTTGT GTTAGAAGCT GCCTTAGAAC 100 

3 5 TTGGCGACGG CGTTATCCGT ACGATTGCCA TGGAATCAAC AGATGGCTTA 150 

CAACGAGGAA TGGAAGTAAT CGATACAGGC GCCTCCATTT CTGTTCCAGT 2 00 

TGGGACAGAA ACCTTAGGAC GGGTGTTTAA TGTCTTAGGG GACACCATCG 2 50 

ATTTAGAAGC GCCGTTCCCT GAAGAAGCAC CCCGCAGTGG GATTCACAAA 3 00 

AAAGCACCTG ACTTTGATGA ATTGTCAACA AGTACGGAGA TCCTTGAAAC 350 

*0 TGGGATCAAA GTTATCGATT TGTTAGCCCC TTATTTAAAA GGGGGGAAAG 4 00 

TTGGACTTTT CGGTGGTGCC GGTGTTGGTA AAACCGTCTT GATCCAAGAA 4 50 

TTGATCCACA ACATCGCCCA AGAGCATGGT GGGATCTCTG TCTTCACAGG 500 

TGTTGGTGAA CGGACACGTG AAGGAAATGA CCTTTATAAT GAAATGAAAG 550 

AATCTGGCGT TATCGAAAAA ACAGCCATGG TGTTTGGACA AATGAACGAA 6 00 

15 CCACCAGGTG CTCGGATGCG GGTAGCCTTG ACTGGTTTGA CATTAGCCGA 650 

GTACTTCCGT GATGTGGAAG GACAAGACGT GCTCTTGTTT ATCGATAATA 700 

TCTTCCGCTT CACTCAAGCA GGTTCTGAAG TATCTGCCTT ACTAGGTCGG 750 

ATGCCGTCTG CCGTTGGGTA TCAGCCAACA TTAGCAACTG AGATGGGGCA 8 00 

ATTACAAGAA CGGATCACAT CGACGAAGAA AGGTTCCGTT ACGTCTA 84 7 



2) INFORMATION FOR SEQ ID NO: 290 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
5 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCTTT AGTTGTTTAT 50 

10 AAAAATGATG AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

_ _ AGGTGACG GT GTCA TC CGTA CA ATCGCGAT GGAATCAACA GATGGTTTAC 150 

AACGCGGAAT GGAAGTCATT GATACAGAAA _ AAGCAATTTC~TGTACCAGTG ~~200 

GGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 250 

TTTATCTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 300 

15 AAGCACCAAA CTTTGATGAG TTAAGTACAA GTACTGAGAT CCTTGAAACT 350 

GGGATCAAAG TTATTGACTT GCTTGCTCCT TACTTAAAAG GTGGGAAAGT 400 

TGGATTATTC GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 450 

TGATCCATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 

GTTGGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 

20 TTCAGGAGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 650 

TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCCG GTTCAGAAGT TTCTGCCCTA TTAGGTCGTA 750 

TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 

25 TTACAAGAAC GGATCACTTC AACGAAAAAA GGTTCAATCA CTTCA 845 



2) INFORMATION FOR SEQ ID NO: 291 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
B5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£0 (A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: ATCC 2 9212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 

15 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGCTTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC CTGTTGGTAA 200 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 300 

CCAGCATTTG ATGAATTAAG TACCAGTAAT GAAATTTTAG AAACAGGGAT 350 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 400 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 450 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 500 

55 TGAACGGACA CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AACATTTTCC 700 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

50 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 

174 
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AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 840 



5 2) INFORMATION FOR SEQ ID NO: 292 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

_( D)_ TOPOLOGY^ Li near _ 



20 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 50 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 200 

25 GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 250 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 300 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 350 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 400 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATCCAAGAAC 450 

30 TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

TATTTCCGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 700 

35 TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 800 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 831 



40 



50 



2) INFORMATION FOR SEQ ID NO: 293 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 100 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

60 TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 200 

175 
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GGCCGAGTAT TTAATGTACT 
CCCAGAAGAT GCCAAACGTA 
ATGAATTGTC AACAAGTACA 
GATTTATTAG CTCCTTACTT 
5 TGCCGGTGTT GGTAAAACCG 
CTCAAGAGCA TGGGGGAATT 
CGTGAAGGTA ATGACTTGTA 
AAAGACAGCC ATGGTTTTCG 
TGCGGGTTGC TTTGACTGGT 
10 GAAGGACAAG ATGTGCTCTT 
AGCGGG TTCT GAAGTATCTG 
" GTTATCAACC AACTCTA'GCA" 
ACTTCTACGA AAAAAGGATC 



AGGCGATACT ATCGACTTGG 
GTGGCATCCA CAAAAAAGCC 
GAAATCCTTG AAACTGGGAT 
AAAAGGTGGT AAAGTCGGCT 
TATTGATTCA AGAATTGATT 
TCAGTATTTA CCGGTGTTGG 
TTATGAAATG AAAGAATCAG 
GTCAAATGAA TGAACCACCA 
TTGACCATTG CTGAGTATTT 
GTTTATCGAT AATATTTTCC 
CCTTGTTAGG CCGGATGCCA 
ACTGAAATGG-GTCAATTACA- 
TGTAAC 



PCT/CAOO/01150 



AAGCACCATF' 


'2*: 


CCAGATTTCG 


300 


CAAAGTTATC 


350 


TGTTCGGTGG 


400 


CACAATATCG 


450 


CGAACGGACG 


500 


GCGTTATCGA 


550 


GGTGCCCGGA 


600 


CCGTGACGTT 


650 


GTTTCACACA 


700 


TCAGCCGTTG 


750 


AGAACGAATC - 


- 800- 




826 



15 



2) INFORMATION FOR SEQ ID NO: 294 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 

B0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 

TTTCTTTGGA CCAATCCTTA CCAGACATCA ACAATGCGTT GGTGGTCTAT 50 

AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAGAACT 100 

35 TGGTGATGGT GTCGTACGTA CGATTGCTAT GTCGTCAACA GACGGTTTGC 150 

AACGTGGGAT GGAAGTCATC GATACAGGAG CATCAATTTC TGTTCCTGTT 200 

GGGAAAGAGA CATTAGGACG TGTATTTAAC GTTTTAGGGG AGACTATCGA 250 

CTTAGATGGT CCATTCCCAG AAGAAGTAGC ACGAGATGGT ATTCATAAAA 300 

AGGCACCTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA 350 

10 GGGATTAAAG TAATCGATTT ATTAGCGCCT TACTTAAAAG GTGGGAAAGT 400 

TGGTTTATTC GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT 450 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTGCGGGT 500 

GTTGGTGAGC GTACTCGTGA AGGAAATGAC CTTTATTATG AAATGAAAGA 550 

GTCGGGCGTT ATTGAGAAAA CAGCGATGGT TTTTGGACAA ATGAACGAAC 600 

15 CACCAGGTGC ACGTATGCGA GTTGCTTTAA CTGGTTTAAC CATTGCAGAA 650 

TACTTCCGTG ATGTTGAAGG ACAAGATGTA TTACTATTTA TTGATAACAT 700 

TTTCCGTTTT ACTCAAGCTG GTTCAGAAGT TTCAGCTTTA TTAGGACGTA 750 

TGCCTTCAGC GGTAGGGTAT CAACCGACAT TAGCAACAGA AATGGGACAA 800 

TTACAAGAAC GTATTACGTC AACGAAAAAA GGCTCAATTA CATCAA 846 

50 



2) INFORMATION FOR SEQ ID NO: 295 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

;o 

176 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCA ATGGG GTC CTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT' AAAGCGACTC "TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 300 

15 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA' CGTTATCGAC AAAGTATCCC 550 

20 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



2) INFORMATION FOR SEQ ID NO: 296 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

4 5 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 2 50 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

55 TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

50 ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 800 

177 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AACCGGTTCT ATCACCTCCG TA 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 297 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

_(D)_ _ TOPOLOGY; JLinear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 



CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

CACCCGATCG AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 200 

25 CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 250 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 300 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 450 

30 TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG ATGACCGACT CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 700 

35 CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 800 

TATCACCT 808 



40 



50 



2) INFORMATION FOR SEQ ID NO: 298 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium lentum 

(B) STRAIN: ATCC 43055 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 

CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 100 

TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 150 

50 GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 200 

178 



SUBSTITUTE SHEET (RULE 26) 
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GGGTCCCGAG ACCCTGGGTC GCATCTGGAA CGTCATGGGC GAGCCCGTCG 250 

ACGAGAAGCC GATGCCCGAG GTGAAGGGCT ACATGCCCAT CCACCGTCCG 300 

GCTCCGGACT ACGACGAGCT GTCCACCACC ACCGAGATCT TCGAGACCGG 350 

CATCAAGGCC ATCGACCTCG TCGAGCCCTT CGTCAAGGGC GGCAAGACGG 400 

5 GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA CGGTTATCAT CCAGGAGCTC 450 

ATCAACAACC TGGCCCAGGA GCACGGCGGC ACGTCGGTGT TCACGGGCGT 500 

GGGCGAGCGT ACCCGCGAGG GTACCGACCT CTACCTGGAG ATGAGCGACT 550 

CGGGCGTCAT CAACAAGACC TGCCTCGTGT ACGGTCAGAT GAACGAGCCT 600 

CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG GGCCTCACCG AGGCGGAGTA 650 

10 CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT GTTCGTGGAC AACATCTTCC 700 

GCT-TCACGCA GGCC GGCTCC GAGGT GTCCG CTCTGCTGGG CCGCATGCCC 750 

TCTGCCGTGG GTTACCAGCC GACGCTGGCA ACCGAGATGG" GCGACCTGCA - 800 

GGAGCGCATC ACGTCGACGT CCACCGGCTC CATCACGTCC GTG 843 



15 



30 



2) INFORMATION FOR SEQ ID NO: 299 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

35 TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 200 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 250 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 300 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACCGGTAT CAAAGTTATG 350 

40 GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 400 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 450 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

45 TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 700 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 750 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 800 

TCTACCAAAA GTGGTTCTAT CACCTCCGT 829 

50 



2) INFORMATION FOR SEQ ID NO: 300 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

179 



SUBSTITUTE SHEET (RULE 26) 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 
5 (B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 



AACACGCCTA AAGTATATGA TGCTTTAAAT GTAGTAGAAG CTGGTTTAGT 50 

L0 ATTAGAAGTT CAGCAACAAA TTGGTGATGG CGTAGTTCGT ACAATTGCTA 100 

. _TGG^TCJ^G_TC TGGAAGTTAA GAACACAAAT 150 

GCGCCTATTT CTGTTCCAGT TGGACATGGC~ACACTTGGAC~GTATCATGAA- — — 2 00 

TGTTTTAGGT GAACCAATTG ATGAAGCTGG TCCAATTGAA TATACTGAGA 2 50 

AAAGATCTAT CCATCAAGCT CCTCCTGCAT ATGATGAGTT AGCATTAAGT 300 

15 ACAGAAATAT TAGAAACAGG TATCAAAGTA GTTGACCTTA TTTGTCCATT 350 

TGCTAAGGGC GGTAAAGTTG GTTTATTTGG CGGTGCAGGT GTTGGTAAAA 4 00 

CTGTAACGAT GATGGAACTT ATCAACAATA TTGCAAAAGA ACATAGTGGC 450 

TACTCTGTAT TTTCCGGTGT TGGTGAAAGA ACTCGTGAAG GTAATGACTT 500 

CTACTATGAG ATGAAATATT CTAATGTATT GGATAAAGTA TCATTAGTAT 550 

20 ATGGTCAGAT GAATGAGCCG CCTGGAAACA GATTAAGAGT AGCTCTTAGT 600 

GGCTTAACAA TAGCAGAAGG ATTCCGTGAT GAAAAGCGTG ATGTTTTGAT 650 

GTTTATCGAT AACATCTATC GTTATACATT AGCAGGTACA GAGGTATCGG 7 00 

CGCTACTTGG TCGTATGCCA TCTGCTGTGG GTTATCAGCC AACGCTTGCA 7 50 

GCTGAGATGG GTGCTTTACA GGAGCGTATT ACATCTACTA AGACAGGATC 800 

25 TATTA 805 



2) INFORMATION FOR SEQ ID NO: 301 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 25563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

45 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 100 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 200 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 250 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 350 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG 40 0 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 450 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 650 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 700 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 

50 CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 800 

180 
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WO 01/23604 
AAAATCAGGA TCTATCACTT CGGTA 
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5 2) INFORMATION FOR SEQ ID NO: 302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 
(D) _ TOPOLOGY Linear 



(ii) MOLECULE TYPE: Genomic DNA 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorum subsp. 
necrophorum 

(B) STRAIN: ATCC 25286 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 100 

AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 150 

25 CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 2 00 

GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 250 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 300 

TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 350 

GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 4 00 

30 TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 4 50 

TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 500 

ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

35 GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 700 

GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 750 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



40 



2) INFORMATION FOR SEQ ID NO: 303 

( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphum 

(B) STRAIN: ATCC 10953 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 



GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

50 CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 150 

181 



SUBSTITUTE SHEET (RULE 26) 
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GATACAGGTA AACCAATTAC AGTACCAGTT 
AATATTAAAT GTTTTAGGAG AACCTGTTGA 
CTGAAACATT TTTACCTATT CATAGAGAAG 
GAAACTGAAA CTGAAATATT TGAAACAGGA 
5 AGCACCATAT ATTAAAGGTG GAAAAATAGG 
TAGGAAAAAC AGTTTTAATA ATGGAACTTA 
CATGGAGGAA TTTCAGTTTT TGCAGGAGTT 
TAGAGACTTA TATGGTGAAA TGACTGAATC 
CTCTTGTTTA TGGACAAATG AATGAGCCAC 
10 GCATTAACAG GGCTTACTGT TGCAGAAAAC 
AGATGTT CTT CTATTTATAG ATAATATATT 
CAGAAGTTTC AGCTTTACTT"GGAAGAATA"C 
CCAAACCTAG CAACTGAAAT GGGTGCTTTA 
AAAATCTGGT TCAATTACAT C 



PCT/CAOO/01150 

GGTAAAGCTG TTCTTGGTAG 200 

TAATCAAGGT CCTATAAATG 250 

CACCAGAATT TGATGACTTA 300 

ATAAAAGTTA TAGACTTATT 350 

ATTATTTGGT GGAGCTGGAG 400 

TCAACAACAT TGCAAAAGGA 450 

GGAGAAAGAA CAAGAGAAGG 500 

AGGAGTTATC ACAAAAACAG 550 

CTGGAGCAAG ACTTAGAGTT 600 

TTTAGAGATA AAGATGGGCA 650 

TAGATTTACA CAAGCAGGTT 700 

CATCAGCTGT-TGGATATCAA 750 

CAAGAAAGAA TAACATCTAC 800 

821 



2) INFORMATION FOR SEQ ID NO: 304 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 
30 (B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 

TTCCCAGTTG GCTATCTTCC AGATATTTAT AATGCTCTCA AGGTTGATAT 50 

35 CAACACCGTT GGAAACACGG AGGGAGATAC CGTCCACGAG ATTACATTGG 100 

AAGTTGAGCA GCACCTTGGT GATTCAACTG TGCGAGCAGT GGCACTTAAG 150 

CCTACGGACG GCTTGGTCCG TGGTGCTTTA GTGCGAGATA CTGGTGGCCC 200 

AATTTCTGTG CCTGTTGGAG ATGTTACAAA AGGTCACGTT TTTGACGTAA 250 

CTGGTAACAT TTTAAACGCT AAACC AGGCG AAAACATTGA GGTGACCGAG 300 

40 CGCTGGCCAA TCCACCGCAA CCCACCTGCT TTCGATCAGC TTGAGTCTAA 350 

GACTCAAATG TTTGAAACAG GCATTAAGGT TATCGATTTG CTTACGCCTT 4 00 

ACGTTCAGGG CGGAAAGATT GGTCTGTTCG GTGGTGCAGG CGTTGGTAAA 450 

ACTGTGTTGA TTCAGGAGAT GATTCAGCGC GTTGCACAGA ACCACGGCGG 500 

TGTGTCTGTG TTTGCTGGCG TTGGCGAACG TACTCGTGAG GGTAACGATT 550 

45 TGATTGGCGA AATGGCTGAG GCTGGCGTTT TGGAGAAAAC AGCGCTTGTC 600 

TTTGGTCAGA TGGATGAGCC TCCTGGGACT CGTCTTCGTG TGCCTCTTAC 650 

TGCTTTGACT ATGGCTGAGT ATTTCCGTGA TGTTCAGAAT CAGGATGTGT 700 

TGCTGTTTAT CGACAACATC TTCCGCTTTA CTCAGGCAGG TTCTGAGGTT 750 

TCCACGTTGC TTGGTCGTAT GCCTTCTGCA GTTGGTTATC AGCCAAACTT 800 

50 GGCGGATGAA ATGGGTGCGT TGCAGGAGCG CATTACTTCT ACGCGCGGTC 850 

ATTCTATTAC GTCG 864 



55 2) INFORMATION FOR SEQ ID NO: 305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

182 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 

10 

TCGAATCAGG _GCATATGCCA _AAT^TATTAA_ ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG "TTTCTCTTGA 100 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 200 

15 GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGT GAAGAAGCAG GAGCAGAAGT TCGTAAAGAT TCAATTCACA 300 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 350 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CCATATATTA AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 450 

20 AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATTCACA 500 

GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAACAGCCAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 700 

25 CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 750 

GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 848 



30 



2) INFORMATION FOR SEQ 



ID NO: 306 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 



55 



50 



50 



TCGAATCAGG 
GAAAAAGGCG 
AATCGGGGAT 
TAAACAGAGG 
GTAGGTAACT 
TGACCACGGA 
AAGAAGCTCC 
ACAGGTATTA 
GATTGGACTA 
AACTTATTAA 
GGGGTAGGTG 
AGACTCTGGA 
AGCCACCAGG 
GAGTACTTCC 



GCATATGCCT AATCTACTAA 
ATGGAAAAAA AGAAAAATTA 
AATGTCGTAA GAACTATTGC 
GGCAGAAGTA GTTGATACTG 
ATACATTAGG ACGTGTGTTC 
GAAGAAGCTG GAGCAGAAGT 
AACTTTCGAA GAATTATCAA 
AAGTTATCGA CCTTCTTGCA 
TTCGGTGGTG CTGGAGTTGG 
CAACGTAGCA CAACAACACG 
AACGTACTCG TGAGGGTAAC 
GTTATTAATA AAACTGCCAT 
TGCACGTATG CGTGTTGCCT 
GTGATGAAGA AGGACAAGAC 



ACGCTTTAGA 
GTTCTTGAAG 
GATGTCATCT 
GAGCGCCAAT 
AACGTATTAG 
TCAAAAAGAA 
CACATGTTGA 
CCATATATTA 
GAAAACAGTT 
GAGGACTTTC 
GACTTGTACT 
GGTATTTGGT 
TAACAGGATT 
GTACTATTAT 



AGTTTATATA 
TTTCTCTTGA 
ACTGATGGAT 
TACAGTGCCA 
GTGAAGCAGT 
TCTATTCATA 
GGTATTAGAA 
AAGGTGGTAA 
CTTATCCAAG 
AGTATTTACT 
ATGAAATGAA 
CAAATGAATG 
AACAATGGCA 
TTATCGATAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
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TATCTTCCGT TTCACACAAG CAGGGTCTGA GGTATCTGCA TTATTAGGGC 750 
GTATGCCTTC AGCCGTTGGA TATCAACCAA CTCTTGCAAC AGAAATGGGA 800 
CGTCTTCAAG AACGTATTAC ATCAACTAAA AAAGGATCTG TTACATCT 848 



2) INFORMATION FOR SEQ ID NO: 307 

(i) SEQUENCE CHARACTERISTICS: 
L0 (A) LENGTH: 813 bases 
_ _ (B) TYPE: Nucleic acid 

(C) ~S TRANDEDNES S": ~ Doubl~e 

(D) TOPOLOGY: Linear 

L5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: - Haemophilus ducreyi 

(B) STRAIN: DSM 8925 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 



GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

25 TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 200 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 300 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 350 

30 TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 400 

CCGTTAATAT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 4 50 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 600 

35 GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 700 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 800 

AATCACGGCA GTA 813 

10 



2) INFORMATION FOR SEQ ID NO: 308 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
55 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

TGAATTTCCA CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
50 AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 

184 
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CGTTGTATCG CATTAGGTGC TTCTGACGGT TTAAAACGTG GTTTAAAAGT 150 

AGAAAACACG AATGATCCGA TTCAAGTACC GGTAGGCACA AAAACCCTTG 200 

GTCGTATCAT GAATGTATTG GGTGAACCAA TTGACGAACA AGGTCCAATC 250 

GGTGAAGAAG AGCGTTGGGC TATCCATCGT TCTGCACCAA GCTATGAAGA 300 

5 ACAATCAAAC AGTACGGAAT TATTAGAGAC TGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCAAAA GGTGGTAAAG TTGGTCTATT CGGTGGTGCG 400 

GGTGTAGGTA AAACCGTTAA CATGATGGAA TTAATCCGTA ACATCGCGAT 450 

CGAGCACTCA GGTTACTCCG TATTTGCGGG TGTAGGTGAA CGTACTCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGAAAG ATTCTAACGT ATTAGATAAA 550 

10 GTATCTTTGG TTTATGGTCA GATGAATGAG CCACCAGGTA ACCGTTTACG 600 

TGT TGCGTTA ACTGGTTTAA CCATGGCAGA AAAATTCCGC GATGAAGGTC 650 

GTGATGTATT ATTCTTCGTG "GATAATATCT "ATCGTTATAC "CCTTGCTGGT 70 0~ 

ACGGAAGTAT CTGCGTTATT AGGTCGTATG CCATCTGCGG TAGGTTACCA 750 

ACCAACTCTT GCTGAAGAAA TGGGTGTGTT ACAAGAACGT ATCACTTCAA 800 

15 CCAAAACAGG TTCTATTACA TCTGTA > 826 



2) INFORMATION FOR SEQ ID NO: 309 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

35 GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 100 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 200 

CGTATTGGGT GAGCCGATTG ACGAAAAAGG TCCTATCGGT GAAGAAGCAC 250 

40 GCTGGGCAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 300 

ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCCCATT 350 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 4 00 

CCGTAAATAT GATGGAGTTA ATCCGTAACA TCGCGATCGA ACACTCTGGT 450 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 500 

4 5 CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGCTTTAACA 600 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGTCGTG ACGTATTATT 650 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 700 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 750 

50 GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 800 

TATCACCTC 809 



55 2) INFORMATION FOR SEQ ID NO: 310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE : Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parainfluenzae 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 

10 

CGAATTTCCA_CAAGATGCAG TAC CAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG~TGGTGTGGTA TOO— 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 200 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 300 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 400 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 4 50 

20 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA ACCGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 700 

25 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 750 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACATCAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



30 



2) INFORMATION FOR SEQ 



ID NO: 311 



35 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 811 bases 

(B) TYPE': Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 



50 



55 



60 



GCCGTGCCTA 
ACTGGTACTG 
TCGCTATGGG 
CTGGAGCACC 
TATGAACGTA 
AAGATCGCTG 
AACTCGCAAG 
TCCGTTCGCT 
GTAAAACAGT 
TCAGGTTACT 
CGACTTCTAC 
TGGTTTATGG 
CTGACCGGTC 
ACTGCTGTTC 



AAGTGTATAA CGCACTTGAG 
GAAGTTCAGC AGCAGCTAGG 
TACTTCTGAC GGTCTGCGTC 
CGATTGAAGT CCCAGTAGGT 
CTGGGTGAGC CAATTGATAT 
GGCTATTCAC CGTGAAGCTC 
AACTGCTGGA AACTGGTATC 
AAGGGCGGTA AAGTTGGTCT 
AAACATGATG GAGCTGATCC 
CTGTATTTGC CGGCGTGGGT 
CACGAAATGA CCGACTCCAA 
CCAGATGAAC GAGCCACCAG 
TGACTATGGC TGAGAAGTTC 
ATCGATAACA TCTACCGTTA 



GTGAAAGGCG GTGCCACTAA 
CGGCGGCGTT GTACGCTGTA 
GCGGACTGGA CGTTGTTGAC 
AAAGCGACCT TAGGCCGCAT 
GAAGGGTGAT ATCGGCGAAG 
CAAGCTACGA AGAACTGTCT 
AAGGTAATGG ACCTGATTTG 
GTTCGGTGGT GCGGGTGTTG 
GTAACATCGC GATCGAGCAC 
GAACGTACTC GTGAGGGTAA 
CGTATTGGAC AAAGTATCAC 
GAAACCGTCT GCGCGTTGCG 
CGTGACGAAG GTCGTGACGT 
TACCTTGGCC GGTACCGAAG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
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TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 750 
CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC 800 
GGGTTCAATC A 811 

5 

2) INFORMATION FOR SEQ ID NO: 312 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 831 bases 
(B) TYPE: Nucleic acid 

(C) STRANDEDNE S S ~T~Dbubl e — 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA< 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) STRAIN: ATCC 23330 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 

GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 50 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

25 TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 200 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 250 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 300 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 350 

30 CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 4 00 

CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 450 

GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 500 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGCCAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 600 

35 TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 650 

GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 700 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 831 

10 



2) INFORMATION FOR SEQ ID NO: 313 

J 5 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 
55 (B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 50 
50 GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 

187 
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CATCGCCATG 
ACCTCGAGCA 
ATCATGAACG 
AGAAGAGCGT 
5 CCAGCTCTCA 
TGTCCGTTCG 
AGGTAAAACT 
ACTCCGGTTA 
AACGACTTCT 
LO CCTGGTGTAC 
CGCTGACCGG 



GTACTGCTGT 
AGTATCCGCG 
CCCTGGCGGA 
15 ACCGGTTCTA 



GGTTCTTCTG 
CCCGATCGAA 
TACTGGGTCA 
TGGGCTATCC 
GGAACTGCTG 
CCAAGGGCGG 
GTAAACATGA 
CTCTGTGTTT 
ACCACGAAAT 
GGCCAGATGA 
CCTGACCATG 
TCGTCGATAA 
CTGCTGGGTC 
AGAGATGGGC 
TC 



ATGGTCTGCG 
GTCCCGGTAG 
ACCGGTTGAC 
ACCGCGCGGC 
GAAACCGGCA 
TAAAGTTGGT 
TGGAGCTGAT 
GCGGGCGTAG 
GACCGACTCC 
ACGAGCCGCC 
GCTGAGAAAT 
CATCTATCGT 
GTATGCCTTC 
GTTCTGCAGG 



CCGCGGTCTG 
GTAAAGCAAC 
ATGAAAGGCG 
ACCGTCCTAT 
TCAAAGTTAT 
CTGTTCGGCG 
CCGTAACATC 
GTGAGCGTAC 
AACGTTATCG 
GGGAAACCGT 
TCCGTGACGA 
"TACACCCTGG 
AGCGGTAGGT 
AACGTATCAC 



PCT/CA00/01150 

GATGTAAAAG 150 

GCTGGGTCGT 200 

ACATCGGCGA 250 

GAAGAGCTGT 300 

CGACCTGATG 350 

GTGCGGGTGT 400 

GCGATCGAGC 450 

TCGTGAGGGT 500 

ATAAAGTATC 550 

CTGCGCGTTG 600 

AGGTCGTGAC 650 

"CCGGTACTGA 700 

TATCAGCCGA 750 

CTCCACCAAA 800 
812 



2) INFORMATION FOR SEQ ID NO: 314 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

35 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 250 

10 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 300 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

15 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

188 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 
_GTAC.CG,C_GCG_ TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 



GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT~CGTACCATCG~" 100" 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 200 

L5 GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGACATC GGCGAAGAAG 250 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 300 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 350 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

20 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGATAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCGCTG 600 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 700 

25 CCGCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATCACT TCC 813 



30 



10 



2) INFORMATION FOR SEQ ID NO: 316 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

50 GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 300 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 400 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

50 CGTTCTGCTG TTCGTCGATA ACATCTATCG TTATACCCTG GCCGGTACTG 700 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCCACCAA 800 
AACCGGTTCT ATCACTTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 317 

( i ) SEQUENCE CHARACTER I STICS : 
10 (A) LENGTH: 785 bases 
CB)__TYEE:_ Nucleic __acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



AGAATGGTAA TGAAGTTCTG 
GGTATCGTAC GTACCATCGC 

25 TCTGGATGTA AAAGACCTCG 
CAACGCTGGG TCGTATCATG 
GGCGACATCG GCGAAGAAGA 
CTATGAAGAG CTGTCCAGCT 
TTATCGACCT GATGTGTCCG 

30 GGCGGTGCGG GTGTAGGTAA 
CATCGCGATC GAGCACTCCG 
GTACTCGTGA GGGTAATGAC 
ATCGATAAAG TATCCCTGGT 
CCGTCTGCGC GTTGCGCTGA 

35 ACGAAGGTCG TGACGTACTG 
CTGGCCGGTA CTGAAGTATC 
AGGTTATCAG CCGACCCTGG 
TCACCTCCAC CAAAACCGGT 



GTGCTGGAAG TTCAGCAGCA 
CATGGGTTCT TCTGATGGTC 
AGCACCCGAT CGAAGTCCCG 
AACGTACTGG GTCAACCGGT 
GCGTTGGGCT ATCCACCGCG 
CTCAGGAACT GCTGGAAACC 
TTCGCCAAGG GCGGTAAAGT 
AACTGTAAAC ATGATGGAGC 
GTTACTCTGT GTTTGCGGGC 
TTCTACCACG AAATGACCGA 
GTACGGCCAG ATGAACGAGC 
CCGGCCTGAC CATGGCTGAG 
CTGTTCGTCG ATAACATCTA 
CGCACTGCTG GGTCGTATGC 
CGGAAGAGAT GGGCGTTCTG 
TCTATCACCT CCGTA 



GCTGGGCGGC 50 

TGCGCCGCGG 100 

GTAGGTAAAG 150 

TGACATGAAA 200 

CGGCACCGTC 250 

GGCATCAAAG 300 

TGGTCTGTTC 350 

TGATCCGTAA 400 

GTAGGTGAGC 450 

CTCCAACGTT 500 

CGCCGGGAAA 550 

AAATTCCGTG 600 

TCGTTACACC 650 

CTTCAGCGGT 700 

CAGGAACGTA 750 
785 



10 



2) INFORMATION FOR SEQ ID NO: 318 

( i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 



CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 

CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 

50 TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 

190 
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ATGAACGTAC TGGGTCAKCC AGTMGACATG AAAGGCGACA TCGGTGAA5a 200* 

AGAGCGTTGG GCTATCCACC GCGCTGCACC TTCCTACGAA GAGCTGTCTA 250 

GCTCTCAGGA ATTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT 300 

CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG TTCGGTGGTG CSGGTGTTGG 350 

5 TAAAACCGTA AACATGATGG AGCTGATCCG TAACATCGCG ATCGAGCACT 4 00 

CCGGTTACTC CGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGTAAC 450 

GACTTCTACC ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT 500 

GGTATATGGC CAGATGAACG AGCCACCGGG AAACCGTCTG CGCGTTGCTC 550 

TGACCGGTCT GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTA 600 

LO CTGCTGTTCG TCGATAACAT CTATCGTTAC ACCCTGGCCG GTACTGAAGT 650 

ATCTGCWCTG- _CTGGGT_CGTA ^TGCCTT CAGC GGTAGGTTAC CAGCCGACCC 700 

TGGCGGAAGA GATGGGCGTT CTGCAGGAAC GTATCACCTC "CACCAAGACC ~ 750" 

GGTTCTATC 7 59 



L5 



)0 



2) INFORMATION FOR SEQ ID NO: 319 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

IS (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 



TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

15 TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 250 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

10 CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

15 CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 

50 



2) INFORMATION FOR SEQ ID NO: 320 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
5 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

10 GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

_____ JTCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT~AAAGCAACAC "TGGGTCGTAT —200" 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 300 

L5 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

20 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

25 CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 321 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 

45 TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACJCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTCCAGTTGG AGAAGACACT 200 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 250 

50 CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 300 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 350 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 400 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 450 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

55 ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 650 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 750 

SO TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 
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ATTACTTCTA CTAAGAAGGG TTCAATTACT TCAA 
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5 2) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

_(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 



TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG C CATC CT ATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 2 00 

25 CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 300 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 350 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 400 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 450 

3 0 AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 500 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

3 5 TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTTC CATA 824 



40 



50 



2) INFORMATION FOR SEQ ID NO: 323 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 818 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

60 CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 200 
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CATGAACGTG CTTGGTCACC CTATCGACAT 

AAGAGCGTTG GGCTATTCAC CGCGCAGCGC 

GGCGCAACCG AGCTGCTGGA GACCGGCATC 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT 

5 GTAAAACCGT AAACATGATG GAGCTCATTC 

TCCGGTTACT CCGTGTTTGC AGGCGTAGGT 

CGACTTCTAC CACGAAATGA CTGAATCCAA 

TGGTATACGG TCAGATGAAC GAGCCGCCTG 

TTAACGGGCT TGACCATGGC GGAGAAGTTC 
10 TCTGCTGTTT ATCGACAACA TTTACCGCTA 

TATCCGCAC T GCTGG GCCGT ATGCCTTCAG 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG 
GGGGTCTATC ACCTCCGT 



PCT/CAOO/01150 



GAAGGGCGAC 


tv m/-i r~\r~* ta TV /"^ ' 

ATCGGCGAAG 


Z bU 


CGAGCTACGA 


AGACCTGTCG 


jUU 


AAGGTTATCG 


t% onmn tv mmrn/^ 

ACCTGATTTG 


OCA 

350 


GTTCGGCGGC 


GCCGGCGTAG 


400 


GCAACATTGC 


/""i TV TV ^"T TV /"* 

GACCGAGCAC 


A C f\ 

450 


GAACGTACCC 


GTGAGGGTAA 


500 


CGTATTGGAC 


AAGGTGTCGC 


r r a 

550 


/-l TV TV TV 0 /TT" 1 / r ""T» 

GAAACCGTCT 


GCGCGTAGCG 


/r rv a 

600 


/^l/^tTl/^TV rn/TTV TV O 

CGTGATGAAG 


GCCGTGACGT 


650 


TACCCTGGCC 


GGTACGGAAG 


700 


CCGTAGGCTA 


CCAGCCGACT 


750 


CGTATTACCT" 


CTACCAAGAC 


-800" 






818 



15 



2) INFORMATION FOR SEQ ID NO: 324 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT ACCTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

35 TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 200 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 250 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 300 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 350 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 400 

40 AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 450 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

45 GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA 7 00 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



50 



2) INFORMATION FOR SEQ ID NO: 325 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 50 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 200 

" — GTGfiCGTAGT~CAKGGGA"CA"C~GTCTTCAA"CG~CTCTGGGTGA-TTGCCTCGAT 250- 

GAGCCAGGTC TTGGCCGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 300 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

15 TTAAGGTCAT CGACCTGCTG ACCCCGTATG TTAAGGGCGG CAAGATCGGC 400 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 4 50 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 500 
GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG , 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

20 AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 650 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 700 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 828 

25 



2) INFORMATION FOR SEQ ID NO: 326 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 
10 (B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

15 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 200 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 250 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 300 

50 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 350 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 4 00 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 4 50 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 550 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

50 GACCGGCTCT ATCACTTCCG TA 822 
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2) INFORMATION FOR SEQ ID NO: 327 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 

20 CCGTGGCGAT GTCCCCCAAA TCTTTGATGC ACTTCATGTT GATGGTACTG 50 

AAACCACCCT TGAAGTCCAA CAACAGTTAG GTGATGGTGT GGTGCGTACC 100 

ATTGCCATGG GTTCTACCGA AGGCTTAAAG CGTGGCTTGC CTGTCTCTAA 150 

TTCAGGTGCA CCCATTTCGG TACCAGTCGG TCAAGCAACA CTGGGTCGCA 200 

TTATGGATGT CCTAGGTCGC CCAATCGATG AAGCAGGTCC GGTAAATGCT 250 

25 GAACAAAAAT GGTCCATTCA TCGTGAAGCA CCAAGTTATG ATGAACAGTC 300 

AAATAGTACA GAACTTTTAG AAACAGGCAT CAAAGTGATT GATTTGCTTT 350 

GTCCATTTGC CAAAGGTGGT AAAGTCGGTC TGTTCGGTGG TGCTGGTGTT 400 

GGTAAGACCG TTAACATGAT GGAGCTTATC AATAATATCG CCCTAAAACA 450 

CTCAGGTCTG TCGGTTTTTG CTGGTGTGGG TGAGCGTACT CGTGAGGGTA 500 

30 ATGACTTCTA CCATGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAT 550 

TTTACTCAGT CAAAAGTTGC CATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT CTGCGTGTTG CCTTAACTGG TTTGACCATG GCAGAGTATT 650 

TCCGTGATGA AAAAGACGAA GCAACGGGCA AAGGCCGTGA TGTTCTGCTG 700 

TTCGTTGATA ATATTTATCG TTACACATTG GCAGGTACTG AGGTATCAGC 750 

35 ACTTTTAGGT CGTATGCCAT CTGCGGTAGG TTATCAGCCG ACTTTGGCCG 800 

AAGAGATGGG CTTGCTACAA GAGCGTATCA CCTCCACCCA ATCAGGCTCA 850 

ATTA 854 



40 



50 



2) INFORMATION FOR SEQ ID NO: 328 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

50 CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT TTTAGGACAC CCAATCGATG AGGCAGGTCC GGTAGA'GCiST 350 

AGTAACACTT GGGCGATTCA CCGTGAAGCG CCAAGCTATG ATGAACAATC 300 

AAACTCTACT GAACTTTTAG AAACCGGTAT TAAAGTAATT GACTTACTAT 350 

GCCCATTTGC TAAAGGTGGT AAAGTCGGTC TGTTCGGTGG CGCGGGTGTT 4 00 

5 GGTAAAACCG TTAACATGAT GGAACTTATC AATAACATCG CAAAAGCACA 450 

CTCAGGTTTA TCGGTATTTG CTGGTGTAGG TGAGCGTACT CGTGAAGGTA 500 

ATGACTTCTA CCACGAGATG AAAGACTCAA ACGTACTTGA TAAAGTTGCG 550 

ATGGTGTATG GTCAGATGAA TGAGCCACCA GGAAACCGTT TACGTGTTGC 600 

CCTGACAGGT TTAACCATGG CAGAATACTT CCGTGACGAA AAAGATGAAA 650 

10 ACGGTAAAGG TCGTGACGTA TTATTGTTCG TTGACAATAT TTATCGTTAC 700 

ACGCTAGCGG_GTACCG AAGT AT CAGCATTAJTTAGGTCGTA TGCCATCTGC 750 

AGTAGGGTAT CAGCCAACGC TTGCAGAAGA GATGGGTGTA - CTACAAGAAC 800~ 

GTATTACTTC AACCCAATCA GGCTCTATTA C 831 



15 



30 



2) INFORMATION FOR SEQ ID NO: 329 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH : 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 ( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 



CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

35 GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 300 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

10 TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 400 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCTGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

15 CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 

50 



2) INFORMATION FOR SEQ ID NO: 330 

55 (i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CGAT-CGCAAT-GGGTACGTCT^GACGGCCT GA AGCGT GGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGG'CCG 200 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 250 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 300 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA' CTGCTGGAAA CCGGCATCAA 350 

GGTTATCGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 4 50 

AACATCGCGG CTGAACACTC AGGTTACTCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 550 

2 0 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG ACCATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TACCGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGCCAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 800 

25 TATTACCTCC ACCAAGACCG GTTC 824 



2) INFORMATION FOR SEQ ID NO: 331 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 14589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

45 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 200 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 250 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 300 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 350 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 450 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGCGGTAGGT TATCAGCCAA 750 

50 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 
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5 2) INFORMATION FOR SEQ ID NO: 332 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS: Double 

(D) TQP.OLOGY.:_Jiine_ar 



20 



(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 50 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 100 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 150 

AACCCAATTT CTGTTCCAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 200 

25 CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GCAGAAGAGA 250 

ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA ATCTAACAGT 300 

ACTGAACTTT TAGAAACGGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 350 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 4 00 

CCGTCAATAT GATGGAATTA ATCCGTAACA TCGCAATTGA GCACTCAGGT 450 

30 TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 500 

CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 600 

GGCTTAACTA TGGCGGAAAA ATTCCGTGAT GAAGGTCGTG ATGTCTTATT 650 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA GAAGTTTCTG 700 

35 CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTTCTGCA AGAGCGTATT ACCTCAACCA AAACAGGTTC 800 

TATTA 805 



10 



50 



2) INFORMATION FOR SEQ ID NO: 333 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 8 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 49100 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

50 GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 200 

199 



SUBSTITUTE SHEET (RULE 26) 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

TTGGTGAAGA AGAGCGTTGG GCTATTCACC GTGAAGCACC AAGTTATGAA 3 00 

GATTTATCTG GCGCCAATGA ACTGCTGGAA ACGGGTATCA AGGTTATCGA 350 

CCTGATTTGT CCGTTTGCTA AAGGTGGTAA AGTTGGTCTG TTTGGTGGTG 400 

5 CGGGTGTAGG TAAAACCGTA AACATGATGG AGCTGATTCG TAACATTGCG 450 

ACTGAGCACT CAGGTTACTC CGTATTCGCC GGTGTAGGGG AACGTACCCG 500 

TGAAGGTAAT GACTTCTACC ACGAAATGAC CGAATCAAAC GTACTGGATA 550 

AAGTATCTCT GGTTTATGGC CAGATGAACG AGCCACCAGG AAACCGTCTG 600 

CGCGTGGCGT TAACGGGTTT GACCATGGCT GAAAAATTCC GTGATGAAGG 650 

LO TCGTGACGTT CTGTTATTTA TCGATAACAT TTATCGCTAT ACCTTAGCCG 700 

__ j3TACCGMGT_M , CA^ GGTAGGTTAT 750 

CAGCCAACGT TAGCAGAAGA GATGGGTGTG TTGCAGGAAC ~ GTATTACTTC 800 

AACCAAAACC GGTTCAATCA CTTCTGTA 828 



L5 



30 



2) INFORMATION FOR SEQ ID NO: 334 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 100 

IS CAATGGGTAC ATCAGACGGT TTAAGCCGTG GCTTAAAGGT TGAAGATTTA 150 

GGCCACCCAA TTGAAGTACC AGTAGGTAAA GCGACTTTAG GACGTATCAT 2 00 

GAACGTTCTG GGTACACCTA TTGATATGAA AGGTGAGATT GAAACCGAAG 2 50 

AGCGTTGGTC AATCCACCGT GAAGCACCAA CTTACGAAGA GTTATCAAAC 3 00 

TCTCAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 350 

10 ATTTGCTAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 4 00 

AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACTCA 450 

GGTTACTCTG TATTTGCTGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

CTTCTATCAT GAAATGACAG ATTCTAACGT TCTTGACAAA GTATCGTTAG 550 

TTTACGGTCA GATGAATGAG CCACCAGGAA ACCGTCTGCG TGTTGCACTG 600 

15 ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

GTTATTCGTC GATAACATCT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 700 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA CCAAAACAGG 800 

TTCTATC 807 

50 



2) INFORMATION FOR SEQ ID NO: 335 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

200 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 
5 (B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 



CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

L0 TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

ATGGGT ACAT CAGACGGT^ AAGCCGTGGC TTAAAAGTTG AAAACTTAGG 150 

CCACCCAATT GAAGTACCAG TAGGTAAAGC "AACACTGGGA" "CGTATCATGA" 200~ 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 250 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 300 

15 TCAAGAACTA CTAGAAACCG GTATCAAAGT AATGGACTTA ATCTGTCCGT 350 

TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 450 

TTATTCTGTA TTTGCAGGTG TTGGTGAGCG TACTCGTGAG GGTAACGACT 500 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

20 TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 600 

GGGTTTAACC ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

TATTCGTCGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 70 0 

GCACTGTTAG GCCGTATGCC ATCAGCAGTA GGTTACCAAC CAACATTGGC 750 

TGAAGAGATG GGTGTTCTGC AAGAACGTAT CACTTCAACC AAAACAGGTT 800 

25 CAATCACCTC T 811 



2) INFORMATION FOR SEQ ID NO: 336 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

15 TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 2 00 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 250 

50 GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 300 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 4 50 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 500 

55 AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 700 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 750 

50 ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

201 
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PCT/CAOO/01150 



5 2) INFORMATION FOR SEQ ID NO: 337 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS: Double 

_(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 100 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 200 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 250 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 350 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 450 

30 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 500 

TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

J 5 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 830 



ao 



50 



2) INFORMATION FOR SEQ ID NO: 338 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

50 TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 200 

202 
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CATGAACGTT CTGGGTGACC CTATTGATAT 
AAGAGCGCTG GTCTATTCAC CGTTCAGCGC 
AACTCAACAG AACTGCTAGA AACCGGTATC 
TCCATTCGCG AAAGGTGGTA AAGTTGGTCT 
5 GTAAAACAGT AAACATGATG GAACTGATCC 
TCAGGTTACT CAGTATTCGC TGGTGTTGGT 
CGACTTCTAT CATGAAATGA CTGATTCTAA 
TGGTTTATGG CCAGATGAAC GAGCCACCAG 
CTGACTGGTC TGACTATGGC TGAAAAATTC 
10 ACTGCTGTTC GTTGACAACA TTTATCGTTA 

TATCAGCACT GTTAGGTCGT ATGCCTTCAG 

TTGG C AG AAG — AGAT GGGTGT "TCTACAAGAA" 
CGGTTCTATC AC 



PCT/CA00/01150 



GAAAGGTGAT 


ATCGGCGAAG 


250 


CAAGCTATGA 


tv tv mm tv / ii i v 

AGAATTAGCT 


300 


AAAGTAATGG 


ACTTAATCTG 


350 


GTTCGGTGGT 


GCGGGTGTTG 


400 


GTAACATCGC 


GATTGAGCAC 


450 


GAGCGTACCC 


GTGAAGGTAA 


500 


CGTTCTGGAT 


AAAGTATCAC 


550 


GAAACCGTCT 


GCGTGTTGCG 


600 


CGTGACGAAG 


GTCGTGACGT 


650 


TACACTGGCA 


GGTACTGAAG 


700 


CGGTAGGTTA 


TCAGCCAACA 


750 


"CGTATCACTT 


"CTACCAAAAC 


800 






812 



15 

2) INFORMATION FOR SEQ ID NO: 339 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 

JO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 



TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

35 CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 200 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 300 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 350 

10 TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 400 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATTCAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

15 CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 800 

CTAAGACAGG TTCTATCAC 819 

50 



2) INFORMATION FOR SEQ ID NO: 340 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

203 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 



AACCGCACTG 
10 CGAAACCACC 
CTATTGCAAT 



ACGTGCCTCA 
CTAGAAGTTC 
GGGATCTACT 



AACACTGGTG 
TATTATGGAC 
CTGAAGACCA 

15 GCTAACAGTA 
TTGTCCGTTC 
TTGGTAAAAC 
CACTCAGGTT 
TAACGACTTC 

20 ACTTCACCAA 
CCAGGTAACC 
CTTCCGTGAT 
TCGTTGATAA 
CTTCTAGGTC 

25 AGAGATGGGT 
TTACTTC 



CCCCAATTAC 
GTTTTAGGTC 
TTGGTCAATC 
CTGACCTATT 
GCTAAAGGGG 
CGTAAACATG 
TATCAGTATT 
TACCACGAGA 
CTCAAAAGTT 
GTTTACGTGT 
CAAAAAGATG 
CATCTACCGC 
GTATGCCATC 
GTACTACAAG 



AATTTATGAC 
AGCAGCAGCT 
GAAGGTCT TA 
AGTTCCAGTA 
GTCCAATTGA 
CACCGTCAAG 
AGAGACAGGT 
GTAAAGTTGG 
ATGGAATTGA 
CGCTGGTGTG 
TGCAAGAAGC 
GCGATGGTTT 
TGCGTTAACC 
AAAACGGTAA 
TACACGCTAG 
AGCAGTAGGT 
AGCGTATTAC 



GCGTTAGTTG 
GGGCGATGGT 
AGCGTGGTTT 
GGTGATGCGA 
CGAACAAGGT 
CGCCATCATA 
ATTAAAGTAA 
TCTGTTCGGT 
TTAATAACAT 
GGTGAGCGTA 
GGGTGTTGTT 
ATGGTCAGAT 
GGTCTGACTA 
AGGTAAAGAC 
CCGGTACTGA 
TATCAGCCAA 
TTCAACTCAG 



TAGATGGTAC 50 

GTGGTACGTA 100 

ACCAGTAACA 150 

"CTTTAGGTCG 200" 

CCAGTTAATT 250 

TGATGAGCAA 300 

TTGACTTACT 350 

GGTGCCGGTG 4 00 

CGCTCTTAAG 450 

CTCGTGAAGG 500 

GACGTTGAAA 550 

GAATGAGCCA 600 

TGGCTGAGTA 650 

GTTCTATTAT 700 

AGTATCAGCA 750 

CACTAGCGGA 800 

ACTGGTTCTA 850 
907 



30 2) INFORMATION FOR SEQ ID NO: 341 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 

(B) STRAIN: DSM 4594 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



GTTCCCTCAG GATGCAGTAC CGAACGTGTA CAATGCTCTT GAGGTAGAAA 50 

ACGGTACCTC CAAACTGGTG CTGGAAGTTC AGCAACAGTT AGGCGGCGGC 100 

GTTGTTCGTT GTATCGCAAT GGGTACCTCA GACGGCCTGC GTCGCGGTCT 150 

GAAAGTGAAC AACCTGGAAC ACCCAATTGA AGTACCGGTA GGTAAAGCGA 200 

50 CTCTGGGTCG TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAAGGT 250 

GAAATCGGCG AAGAAGAACG TCGTGCTATT CACCGTGCTG CGCCTTCTTA 300 

TGAAGAGCTG GCAAACTCCC AGGAATTGCT GGAAACCGGT ATCAAAGTTA 350 

TGGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT. 400 

GGTGCGGGTG TAGGTAAAAC TGTGAACATG ATGGAGCTGA TCCGTAACAT 450 

55 TGCGATCGAG CACTCCGGTT ATTCTGTGTT TGCAGGCGTG GGTGAACGTA 500 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACTGATTC CAACGTTATC 550 

GACAAAGTTT CCCTGGTGTA TGGCCAGATG AATGAGCCAC CAGGTAACCG 600 

TCTGCGCGTT GCACTGACCG GCCTGACCAT GGCGGAAAAA TTCCGTGATG 650 

AAGGTCGTGA CGTACTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG 700 

60 GCCGGTACCG AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG 750 

204 
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TTATCAGCCA ACGCTGGCGG AAGAGATGGG CGCTCTGCAA GAACGTATCA' 800 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG TA 832 



2) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
10 (B) TYPE: Nucleic acid 

_ (C ) ST RANDEDNESS ; Double 

(D.) TOPOLOGY : Linear" 

( i i ) MOLECULE TYPE : Genomic DNA 



40 



15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

25 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

30 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

35 AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



2} INFORMATION FOR SEQ ID NO: 343 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 
55 (B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
60 GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 

205 
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CCATCGCGAT GGGGTCTTCT GACGGTCTGC GTCGCGGTCT ' GGATGTAAAA* 150 

GATCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCCA CGCTGGGTCG 200 

TATCATGAAC GTCCTGGGCG AACCGGTCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA CGAAGAGTTG 300 

5 TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTTA TTCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCGGGCGTA GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GATAAAGTAT 550 

10 CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 



TGTACTGCTG~TTCGTCGATA~ACATCTATCG~TTACACCCTG-GeeGGTAeGG 700- 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CCGCAGTAGG TTACCAGCCG 750 
ACTCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA CCTCCACCAA 800 
15 AACCGGTTCT ATCACCTCCG - 820 



20 



2) INFORMATION FOR SEQ ID NO: 344 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Salmonella choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

35 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 
TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT ' 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 250 

40 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

45 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

50 TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 2) INFORMATION FOR SEQ ID NO: 345 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

206 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



3 0- 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 



TTCCC TCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC ^TGGAAGTTCA - GCAGCAGCTT^GGCGGCGGTA - 1:0 0" 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

15 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

20 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

25 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 346 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

50 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

55 GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

60 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 
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CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT " 756 
TATCAGCCAA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 
CTCCACCAAA ACCGGTTCTA TCACCTCCG 829 



2) INFORMATION FOR SEQ ID NO: 347 

( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 817 bases 
(B)_ _ TYPE : Nucle ic aci d 

(C) STRANDEDNESS: Double ~ — — 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 
20 (B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

25 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 300 

30 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

35 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

4 0 CGGTTCTATC ACCTCCG 817 



45 



60 



2) INFORMATION FOR SEQ ID NO: 348 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
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GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA £0 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATT GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

5 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTGG 400 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 450 

10 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACT.TCTAC_CACGAAA_TGA _CCGACTCCAA COTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA - ~ ~600~ 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

15 TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTC 806 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 349 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

40 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

45 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

50 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 350 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 823 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 
10 (B) STRAIN: ATCC 1074 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

15 TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

20 GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

25 ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

30 CTCCACCAAG ACCGGTTCTA TCA 823 



2) INFORMATION FOR SEQ ID NO: 351 



35 



( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Typhimurium 
(B) STRAIN: ATCC 14 028 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TTGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

55 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

60 GCGATCGAGC ACTCCGGTTA CTCAGTGTTT GCGGGCGTAG GGGAACGTAC 500 

210 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TCGTGAGGGT AACGACTTCT 
ATAAAGTATC CCTGGTGTAT 
CTGCGCGTTG CATTGACCGG 
AGGTCGTGAC GTACTGCTGT 
CCGGTACGGA AGTATCCGCA 
TACCAGCCGA CTCTGGCGGA 
CTCCACCAAA ACCGGTTCTA 



ACCACGAAAT 
GGCCAGATGA 
TCTGACCATG 
TCGTCGATAA 
CTGCTGGGCC 
AGAGATGGGC 
TCACCTCCGT 



GACCGACTCC ^AGGTTAT€9 
ACGAGCCGCC GGGAAACCGT 
GCGGAGAAAT TCCGTGACGA 
CATCTATCGT TACACCCTGG 
GTATGCCTTC CGCAGTAGGT 
GTTCTGCAGG AACGTATCAC 
A 



5U 



600 
650 
700 
750 
800 
831 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 352 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 GAAACTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTCGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GACGGTCTGC GTCGCGGTCT GAAAGTGAAC 150 

AACCTGGAAC ACCCGATTGA AGTGCCGGTG GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGCG AACCAATCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTCCTG CGCCAAGCTA CGAAGAGCTG 300 

10 TCCAACTCCC AGGACCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

TTGTCCGTTC GCCAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 



TGGGCAAAAC CGTAAACATG" "ATGGAGCTGA - TCCGTAACAT" CGCGATCGAG "450" 

CACTCCGGTT ATTCCGTGTT TGCGGGCGTG GGCGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTTCTG GACAAAGTAT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCGC' CGGGTAACCG TCTGCGCGTT 600 

GCATTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGCCGCGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CCGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCC ATCACCTCCG 820 



2 ) INFORMATION FOR SEQ ID NO: 354 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

Cvi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

40 GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 150 

GACCTGCAGC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 2 50 

45 AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 3 00 

TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

55 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACC 816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimes ii 

(B) STRAIN: ATCC 14460 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 

15 

GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGTCACA 150 

GACCTGGACC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

20 TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 250 

AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 300 

GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

25 CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

30 AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



35 

2) INFORMATION FOR SEQ ID NO: 356 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 819 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 



ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 150 

55 ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 200 

ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 300 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 4 00 

60 TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 
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ACTCCGGTTA TTCCGTGTTT 
AACGACTTCT ACCACGAAAT 
CCTGGTTTAC GGCCAGATGA 
CTCTGACCGG TCTGACCATG 
5 GTTCTGCTGT TCGTTGATAA 
AGTGTCCGCA CTTCTGGGCC 
CGCTGGCGGA AGAGATGGGC 
ACCGGTTCTA TCACTTCCG 



GCAGGCGTGG GTGAGCGTAC 
GAACGACTCC AACGTACTGG 
ACGAGCCACC GGGTAACCGT 
GCGGAGAAAT TCCGTGACGA 
CATTTACCGT TATACCCTGG 
GTATGCCATC TGCGGTAGGT 
GTTCTGCAAG AACGTATCAC 



PCT/CA00/01150 

TCGTGAGGGT 500 

ACAAAGTATC 550 

CTGCGCGTTG 600 

AGGCCGCGAC 650 

CCGGTACCGA 700 

TATCAGCCAA 750 

CTCTACCAAG 800 
819 



10 



2) INFORMATION FOR SEQ ID NO: 357 

( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 (ii) MOLECULE TYPE: Genomic DNA 



( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 



TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

30 CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 2 00 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 250 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 300 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 350 

35 TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 400 

GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 4 50 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

40 CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 750 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 

45 



2) INFORMATION FOR SEQ ID NO: 358 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorifera 
60 (B) STRAIN: ATCC 33077 

214 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 AACTCTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTGGTTCGTT 100 

GTATCGCTAT GGGCACCTCC GACGGTTTGC GTCGCGGCCT GAAAGTGAAC 150 

GATCTGCAAC ACCCAATCGA AGTCCCGGTT GGCAAGGCAA CGCTGGGTCG 200 

TATCATGAAC GTATTGGGTC AACCAATCGA CATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCAAGCTA CGAAGAACTG 300 

10 TCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTTA TGGACCTGAT 350 

CTGCCCGTTT GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 



TTGGTAAAAC CGTAAACATG ~ATGGAGCTGA~TCCGTAA"CAT — CGCGATCGAG 450 

CACTCCGGTT ATTCAGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTACTG GACAAGGTTT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCCGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCTGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCT ATCACCTCCG TA 822 



25 



2) INFORMATION FOR SEQ ID NO: 359 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia plymuthica 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

40 GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 200 

GGGTGAACCA ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

4 5 CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 300 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 400 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

50 CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 700 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

55 ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 800 

CTCCG 805 



60 2) INFORMATION FOR SEQ ID NO: 360 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 
~ " ( B ) STRAIN! ~ATCC~2 7 5 93~ 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 150 

AAAGTTAACG ACCTCGAGCA CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

20 GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCAAGCTAC 300 

GAAGAGCTGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 350 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

25 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTACCGT TACACCCTGG 700 

30 CCGGTACCGA AGTGTCCGCA CTGCTCGGCC GTATGCCATC TGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC 800 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 361 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 06 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(B) STRAIN: LCDC D7172 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

CCGTAAACAT GATGGAACTG ATCCGTAACA TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 15 0 

55 ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 200 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 250 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 300 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 350 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 400 

60 GATCAC 406 
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2) INFORMATION FOR SEQ ID NO: 362 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



( l iT MOLECUIiE - TYPE T — Genomic~DNA - — 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

20 TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 200 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 250 

25 AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

30 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

35 TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



40 2) INFORMATION FOR SEQ ID NO: 363 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 



GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

60 CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 
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AAGAGCGTTG 
AACTCTCAGG 
TCCGTTCGCT 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTGTATGG 
CTGACCGGTC 
TCTGCTGTTC 
TATCCGCACT 
CTGGCGGAAG 
CG 



GGCTATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTTGACAACA 
GCTGGGCCGT 
AGATGGGCGT 



CGCGCAGCAC 
AACCGGTATC 
AAGTTGGTCT 
GAGCTCATTC 
GGGCGTAGGT 
CCGACTCCAA 
GAGCCGCCGG 
TGAGAAATTC 
TCTATCGTTA 
ATGCCTTCAG 
TCTGCAGGAA 



CTTCCTACGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTATCGAC 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CGGTAGGTTA 
CGTATCACCT 



AGAGCTGTCA 
ACCTGATGTG 
GCGGGTGTAG 
GATCGAGCAC 
GTGAGGGTAA 
AAAGTATCCC 
GCGCGTTGCT 
GTCGTGACGT 
GGTACGGAAG 
TCAGCCGACC 
CCACCAAAAC 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 



—8-0-2— 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 364 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCACCTCCG 819 



2) INFORMATION FOR SEQ ID NO: 365 

( i ) SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 2 993 0 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 100 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

GAAC ACCCG A T CGAAG TCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 2 00 

GAACGTACTG GGTGAACCGG TCGACATGAA~AGGCGAGATC — GGTGAAGAAG 250" 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 300 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 350 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 600 

2 0 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT .650 

GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 700 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 

TT 802 

25 



2) INFORMATION FOR SEQ ID NO: 366 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 

TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 200 

ATGTACTAGG TGAAACAATT GACCTTAAAG AAGAAATTAG TGATTCTGTT 250 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 300 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 350 

50 CTTATATTAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 400 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 450 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 700 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 785 

60 
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2) INFORMATION FOR SEQ ID NO: 367 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE; Genomic DNA 



15 



(vi") ORIGINAL SOURCE :~ " " " ~ ~" — " 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 



GAACATAACG AAGTGCCTAA TATTAATAAC GCCTTAGTAC TCGATGTTGA 50 

AAGAGAAGAC GGAACAGTGT CTTTAACTTT AGAAGTAGCT TTACAATTAG 100 

20 GCGATGACGT TGTTCGTACC ATTGCAATGG ATTCAACTGA TGGTGTTAAA 150 

CGTGGTAACG AAGTCAAAGA TACTGGTAAT AGCATTAGCG TACCAGTCGG 200 

AGACGAAACT TTAGGACGTG TCTTCAACGT TCTAGGTGAA ACAATTGATT 2 50 

TAGAAGATAA ACTTGATGAT TCTGCGCGAC GTGACCCTAT ACATAGAGAA 300 

GCGCCAGCGT TTGATCAATT ATCAACTCAA GTTGAAATTT TAGAAACAGG 350 

25 AATTAAAGTT GTTGACTTAT TAGCACCTTA TATTAAAGGT GGTAAAGTTG 4 00 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTTTAAT CCAAGAATTA 450 

ATCAACAACA TCGCTCAAGA ACACGGTGGT ATTTCAGTCT TTGCCGGTGT 500 

AGGTGAACGT ACACGTGAAG GTAACGACTT GTACTATGAA ATGAGCGACA 550 

GTGGTGTAAT CAAGAAAACA GCCATGGTCT TCGGACAAAT GAACGAACCA 600 

30 CCTGGCGCAC GTATGCGTGT TGCTTTATCT GGTTTAACAA TGGCTGAATA 650 

TTTCCGTGAT GAACAAGGAC AAGACGTATT GTTATTCATC GACAATATTT 700 

TCCGTTTCAC ACAAGCCGGT TCAGAAGTTT CTGCCTTACT AGGTCGTTTA 750 

CCATCAGCCG TTGGTTATCA ACCTACATTA GCAACAGAAA TGGGACAATT 800 

ACAAGAACGT ATTACTTCAA CAACAAAAGG ATCAGTTACT TCA 843 

35 



2) INFORMATION FOR SEQ ID NO: 368 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 
50 (B) STRAIN: ATCC 27840 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 

GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATC/3AA 50 

55 GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG TTGCACTTCA 100 

ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 150 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 200 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 250 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 300 

60 GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA AATCTTAGAA 350 

220 
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ACAGGTATCA AAGTAGTAGA CTTATTAGCA 
AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
5 CGATAGTGGT GTAATTAAGA AAACAGCGAT 
AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGGCCAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGCCTTC AGCCGTTGGT TATCAACCAA 
10 CAATTACAAG AACGTATTAG TTCAACTAAT 



PCT/CAOO/01150 

CCTTACATTA AAGGTGGTAA 400" 

TAAGACAGTT TTAATCCAAG 450 

GTGGTATTTC AGTATTCGCC 500 

GACCTTTACT ATGAAATGAG 550 

GGTATTCGGT CAGATGAACG 600 

TATCAGGTTT AACAATGGCA 650 

GTATTATTAT TCATTGATAA 700 

AGTTTCAGCA TTACTTGGAC 750 

CACTTGCTAC TGAAATGGGT 800 

AAAGGTTCTG TTACTTCAA 849 



2) INFORMATION FOR SEQ ID NO: 369 



15 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

30 GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 2 00 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

35 ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 3 00 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 350 

CATTGATTTA TTAGCACCAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 4 00 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 450 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

40 TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 700 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 750 

4 5 GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 800 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 830 



50 2) INFORMATION FOR SEQ ID NO: 370 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidi's 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 

5 

AATAATGCAT TACACATCGA AGTTCCTAAA GAAGATGGAG CGCTTCAATT 50 

AACATTAGAA GTTGCACTTC AACTAGGTGA CGATGTAGTT CGTACAATTG 100 

CAATGGACTC AACTGACGGC GTTCAAAGAG GAATGGAAGT TAAAGATACA 150 

GGTAGAGACA TAAGTGTACC TGTCGGTGAC GTAACTCTAG GAAGAGTGTT 200 

10 TAACGTACTA GGAGAAACTA TTGACTTAGA TGAAAAAATT GATGATTCAG 250 

_ TACG ACGTGA CCCT ATCCA T AGACAAGCTC CAGGATTCGA CGAATTATCA 300 
ACAAAAGTAG AAATCTTAGA AACTGGTATT _ AAAGTAGTAG "ACTTATTAGC" — ~~350^ 

ACCTTACATA AAAGGTGGTA AAATTGGATT ATTTGGTGGT GCCGGTGTAG 400 

GTAAAACCGT ACTAATCCAA GAACTTATTA ATAACATCGC TCAAGAACAC 450 

15 GGTGGTATCT CAGTATTCGC TGGTGTTGGT GAACGTACAC GTGAAGGTAA 50 0 

TGATCTTTAC TATGAAATGA GTGACAGTGG TGTTATCAAG AAAACTGCAA 550 

TGGTCTTTGG TCAAATGAAT GAGCCACCTG GTGCACGTAT GCGTGTAGCA 600 

TTATCCGGAT TAACAATGGC CGAATATTTC CGAGATGAAG AAGGCCAAGA 650 

TGTGTTATTA TTCATTGATA ACATTTTCAG ATTCACTCAA GCTGGTTCAG 700 

20 AAGTTTCTGC GTTATTAGGT CGTATGCCAT CAGCTGTTGG TTATCAACCT 750 

ACACTTGCTA CAGAAATGGG TCAATTACAA GAACGTA 787 



25 2) INFORMATION FOR SEQ ID NO: 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 



GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 2 00 

45 TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 2 50 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 300 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 350 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 400 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 4 50 

50 ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 600 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 700 

55 CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 830 



60 
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2) INFORMATION FOR SEQ ID NO: 372 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE; Genomic DNA 

10 

_( v i.)_QR IGINAL SOURCE : 



(A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 

TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

20 AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 200 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 

25 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

30 TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



35 



2) INFORMATION FOR SEQ ID NO: 373 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 



TCGAACATAA TGAAGTCCCT 
CCCAAAAATG ATGGCACATT 

55 AGGTGATGAT GTTGTTCGTA 
AACGTGGTAT GCAAGTTGTG 
GGTGAAGAAA CACTTGGACG 
TTTAAACGAA AAAATAGATA 
GTCAACCTAA TTTTGATGAA 

60 GGTATTAAAG TTGTAGACTT 



GAAATTAATA ATGCCCTAAT 
CAAATTAACA TTAGAAGTTG 
CTATTGCAAT GGATTCAACT 
AATACTGGTA AAGATATTAG 
TGTGTTTAAC GTTTTAGGAG 
GTTCTGTTAG ACGTGATCCA 
TTATCTACTG AAGTAGAAAT 
ATTAGCACCT TATATTAAAG 
223 



TATTGAAGTA 50 

CATTGCAACT 100 

GATGGTGTTC 150 

TGTTCCTGTA 2 00 

AAACAATAGA 250 

ATTCATCGTC 300 

TCTTGAAACA 350 

GTGGTAAGAT 400 
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TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



10 



2) INFORMATION FOR SEQ ID NO: 374 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
25 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 50 

30 AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 200 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 250 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 300 

35 CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 3 50 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 400 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 4 50 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 550 

4 0 CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 750 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

45 GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 835 



2) INFORMATION FOR SEQ ID NO: 375 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Staphylococcus saprophyticus 
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(B) STRAIN: ATCC 15305 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 

5 GAGCACAATG AAGTTCCAGA AATTAACAAT GCCTTAGTCG TAGACGTTGA 50 

AAGAGATGAA GGTACAGTAT CTCTTACATT AGAAGTGGCA TTACAACTTG 100 

GCGATGATGT CGTACGTACA ATTGCAATGG ATTCTACTGA TGGTGTTAAA 150 

CGTGGTACAG AAGTTCGAGA TAGCGGAGAT AGCATCAGTG TTCCAGTTGG 200 

TGATGCTACG TTAGGACGTG TGTTTAATGT TCTTGGTGAT ACAATTGACT 250 

10 TAGACGAGAA GCTTGATACT TCTGTCAAAC GTGATCCAAT TCATAGAGAA 300 

G CACCTGCAT TC GATCAATT ATCAACAAAA GTTGAAATCT TAGAAACAGG 350 

TATTAAAGTA ATTGATTTAC TTGCACCATA^TATTAAAGGT "GGTAAAATCG 400~ 

GTTTATTCGG TGGCGCTGGT GTAGGTAAAA CAGTATTAAT TCAAGAATTA 450 

ATTAATAATA TAGCTCAAGA ACATGGTGGT ATTTCAGTAT TTGCCGGCGT 500 

15 AGGTGAACGT ACGCGTGAAG GTAATGACTT ATACTACGAA ATGAGTGATA 550 

GTGGTGTTAT TAAGAAAACA GCTATGGTCT TCGGACAAAT GAATGAGCCA 600 

CCTGGTGCGC GTATGCGTGT TGCTTTATCA GGCTTAACAA TGGCTGAACA 650 

CTTCCGTGAT GTACAAGGAC AAGATGTTTT ACTATTTATT GATAACATAT 700 

TCAGATTTAC GCAAGCTGGT TCAGAAGTAT CAGCACTATT AGGTCGTATG 750 

20 CCATCAGCCG TTGGTTATCA ACCTACCCTT GCTACTGAAA TGGGTCAATT 800 

ACAAGAACGT ATTACATCAA CAACTAAAGG ATCTGTAACG TC 842 



25 2) INFORMATION FOR SEQ ID NO: 376 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 



TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 100 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 200 

45 AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 250 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 300 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 350 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 400 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

50 TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 700 

55 CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 



60 
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2) INFORMATION FOR SEQ ID NO: 377 



PCT/CAOO/01150 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIG INAL S OURCE: 

(A) ORGANTSM: S£aphyXttoc~C\is-warneri 

(B) STRAIN: ATCC 27836 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

20 GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 200 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 250 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 300 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 350 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 400 

25 TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 450 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

30 CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 700 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 750 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACATCTACAA ATAAAGGTTC TGTAACATCT A 841 

35 



2) INFORMATION FOR SEQ ID NO: 378 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 



TTTAACACGA ATGAACCGCT 
CAAAGACAGT GAGAAAAAAC 

55 TTGGTGAAGG CCTCGTTCGT 
ACACGTGGTC TAGAAGTTCT 
TGGTAAAGAA ACGCTTGGAC 
ATCTTGAAGA ACCATTTGGA 
AGTGCCCCAA CTTTTGATGA 

60 AGGGATTAAA GTTATCGACC 



TCCTGAGATA AATAATGCAC 
ATAAAATCGT TCTTGAAGTA 
ACCATTGCTA TGGAATCAAC 
TGATACAGGC CGTGCAATCA 
GTGTCTTCAA CGTTCTTGGT 
GAAGATGCAG AACGTCACCC 
ATTATCAACG TCAACAGAAA 
TACTTGCCCC TTACTTAAAA 
226 



TTGTTGTTTA 50 

GCTCTTGAAC 100 

TGATGGTTTG 150 

GTGTACCAGT 200 

GATGCTATCG 250 

CATTCATAAG 300 

TCCTTGAAAC 350 

GGAGGGAAGG 4 00 
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TTGGACTTTT CGGTGGTGCC GGAGTTGGTA AGACCGTTCT TATCCAAGAG" 457) 

TTGATTCATA ACATTGCTCA AGAGCATGGT GGTATTTCAG TATTTACCGG 500 

AGTTGGTGAA CGTACACGTG AAGGTAATGA CCTCTATTGG GAAATGAAAG 550 

AATCAGGCGT TATTGAAAAA ACAGCTATGG TATTTGGTCA GATGAATGAG 600 

CCACCTGGTG CACGTATGCG TGTAGCCCTT ACTGGTTTGA CAATCGCTGA 650 

ATATTTCCGT GATGTTGAAG GACAGGACGT GCTTCTCTTT ATTGATAACA 700 

TTTTTCGTTT CACACAAGCA GGTTCTGAAG TTTCAGCTCT TCTTGGACGT 750 

ATGCCATCAG CCGTTGGTTA TCAACCAACC TTGGCAACTG AAATGGGTCA 800 

ATTGCAAGAA CGTATCACGT CAACTAAAAA AGGTTCTGTT ACATCA 846 



2) INFORMATION FOR SEQ ID NO: 379 

15 (i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
25 (B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

30 AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

35 AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

40 ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

45 TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



50 



2) INFORMATION FOR SEQ ID NO: 380 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus agalactiae 
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(B) STRAIN: ATCC 13813 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 

5 TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

10 TCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGGACC-AT-C-GTTTGATGAA _TTATCMCAT^ATCAGAAAT_CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400" 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

15 GTAGGAGAAC GTACTCGTGA AGGGAATGAO CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



25 2) INFORMATION FOR SEQ ID NO: 381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 200 

45 GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

50 TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

55 CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 
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2) INFORMATION FOR SEQ ID NO: 382 



PCT/CAOO/01150 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 845 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(.vi )-ORIGINAL_SOURCE : 



(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

20 CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

25 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

30 TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 

35 

2) INFORMATION FOR SEQ ID NO: 383 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



4 5 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 



TTGCAAGTGG CGACAAACTT 
AAAAATGGCG ATAAGTCACA 

55 TGGTGACGGC CTCGTTCGTA 
CACGTGGTTT GGAAGTATTA 
GGTAAGGATA CTTTGGGTCG 
CCTTGAAGAG CCTTTTGCAG 
AAGCACCATC GTTTGATGAA 

60 GGTATTAAAG TTATTGACTT 



CCTGAGATTA ATAATGCATT 
AAAAGTAGTA CTTGAAGTTG 
CAATCGCTAT GGAATCAACT 
GATACTGGTC GCGCAATTAG 
TGTCTTCAAC GTTCTTGGAG 
AAGATGCAGA ACGTCAACCA 
TTATCAACAT CATCAGAAAT 
ATTAGCACCT TACTTAAAAG 
229 



GATTGTCTAT 50 

CTCTTGAACT 100 

GATGGGCTTA 150 

TGTGCCGGTT 200 

ATGCTATTGA 250 

ATCCATAAAA 300 

CTTAGAAACT 350 

GTGGTAAAGT 4 00 
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TGGACTTTTC 
TAATCCACAA 
GTAGGAGAAC 
ATCTGGCGTT 
CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 
TGCCTTCAGC 
TTACAAGAGC 



GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 
CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 
GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 
ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 
ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 
ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 
ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 
CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 
GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 



450 
500 
550 
600 
650 
700 
750 
800 
845 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 384 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT CCAGAGATTA ATAATGCATT GATTGTTTAT 50 

AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTACCAGTA 200 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 350 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 400 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 700 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 845 



2) INFORMATION FOR SEQ ID NO: 385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp. egui 
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(B) STRAIN: ATCC 9528 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

5 TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 250 

10 CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 300 

AA G CACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 350 

GGTATCAAGG TTATTGACCT GCTTGCCCCT" TACTTAAAGG~GTGGTAAGGT "400 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 500 

15 GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 600 

CACCAGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 846 



25 2) INFORMATION FOR SEQ ID NO: 386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 



TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 200 

45 GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 250 

TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 350 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 400 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

50 TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

55 TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 750 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 846 



231 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
2) INFORMATION FOR SEQ ID NO: 387 



PCT/CA00/01150 



10 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Streptococcus salivafius 

(B) STRAIN: ATCC 7073 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

GCAGCTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 50 

TGATGAACAA AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 100 

GAGAAGGTGT GGTTCGTACC ATTGCCATGG AATCTACTGA TGGATTGACT 150 

20 CGTGGACTAG AAGTTCTGGA CACTGGTCGT CCAATCAGCG TTCCTGTTGG 200 

TAAAGATACC CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 250 

TGGAAGCACC TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 300 

GCACCAACTT TCGATGAATT GTCAACATCT ACTGAAATCC TTGAAACAGG 350 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GGTAAAGTCG 400 

25 GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 500 

TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 600 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 650 

30 CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 700 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCTACACTT GCTACTGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA TCT 843 



35 



50 



2) INFORMATION FOR SEQ ID NO: 388 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC. 43765 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 



TTGCAGCAGA AGATAAACTT CCTGAGATTA ACAACGCACT CGTTGTATAT 50 

AAAAATGATG ATTCCAAACA AAAAGTCGTG CTTGAAGTGG CTTTGGAACT 100 

55 TGGTGATGGC GTTGTACGGA CCATTGCCAT GGAATCAACG GATGGATTGA 150 

CACGTGGGAT GGAAGTTCTC GATACAGGTC GTCCCATCTC TGTTCCAGTC 200 

GGTAAAGAAA CGCTGGGTCG TGTCTTCAAT GTGTTGGGAG ATACCATTGA 250 

CCTTGAAGAG TCTTTTCCGG CAGATTTTGA ACGTGAGCCT ATCCATAAGA 300 

AAGCGCCGGC TTTTGACGAA TTATCTACTT CAAGCGAAAT TTTGGAAACA 350 

.60 GGGATTAAGG TTATCGACCT CCTAGCACCT TATCTAAAAG GTGGTAAGGT 400 
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TGGTCTCTTC GGTGGTGCTG GTGTTGGTAA 
TGATTCACAA TATTGCCCAA GAACACGGTG 
GTTGGCGAGC GTACCCGTGA AGGGAACGAT 
ATCAGGTGTT ATTGAAAAAA CGGCCATGGT 
5 CACCAGGAGC CCGTATGCGT GTTGCTCTTA 
TACTTCCGTG ATGTGGAAGG GCAGGATGTT 
CTTCCGTTTC ACACAGGCTG GTTCAGAAGT 
TGCCATCAGC CGTTGGTTAT CAGCCAACAC 
TTGCAGGAGC GTATTACCTC AACCAAGAAG 



PCT/CAOO/01150 

AACCGTTCTT ATCCAAGAAT 450 

GTATCTCTGT ATTTACCGGA 500 

CTTTACTGGG AAATGAAAGA 550 

ATTTGGTCAG ATGAATGAGC 600 

CTGGTTTGAC TATTGCGGAA 650 

CTTCTGTTCA TCGATAATAT 700 

GTCTGCCCTC TTGGGTCGTA 750 

TTGCGACGGA GATGGGACAA 800 

GGTTCTGTTA C 841 



2) INFORMATION FOR SEQ ID NO: 389 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uJberis 
25 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

30 AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 100 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 150 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 200 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 250 

TGGATGAACC ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 300 

35 GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 400 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

40 CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

45 GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



2) INFORMATION FOR SEQ ID NO: 390 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Tatumella ptyseos 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

5 TTCCCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 50 

TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 150 

AAGGTGACCG ATCTGCAAAA ACCGATTCAG GTACCGGTCG GTAAAGCGAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

10 ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTGAACTC TATTCACCGT 300 

GCAGCGCC AA GCT ACGAAGA A CTGGC TAAC _TCTACTGAGC_ TGCTGGAAAC 350 

GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 400' 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 450 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 500 

15 GGTGGGTGAG CGTACCCGTG AAGGTAACGA- CTTCTACCAC GAAATGACCG 550 

AGTCTAACGT TCTGGATAAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GATAACATCT 700 

ATCGTTATAC CCTGGCCGGT ACTGAAGTTT CAGCACTGCT GGGTCGTATG 750 

20 CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 800 

TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 896 



25 2) INFORMATION FOR SEQ ID NO: 391 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 



TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 100 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

45 GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 2 50 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

50 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

55 CGGGTACTGA AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 829 



60. ) 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43970 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 



CGAATTCCCC 
AAGGCACAGC 
GGTGTTGTTC 

20 GTTGAAAGTC 
CAACTCTGGG 
GGTCCTATCG 
TTACGAAGAG 
TAATGGATCT 

25 GGTGGTGCGG 
TATTGCGATT 
GTACTCGTGA 
CTGGACAAAG 
CCGTCTGCGC 

30 ATGAAGGTCG 
CTGGCCGGTA 
AGGCTATCAG 
TCACTTCCAC 



CAAGACGCTG 
TCAGAAGCTG 
GTTGTATCGC 
ATCAACCTGG 
CCGTATCATG 
GTGAAGAAGA 
CTTGCCAGCT 
GATTTGTCCG 
GTGTGGGTAA 
GAGCACTCAG 
GGGTAACGAC 
TATCCTTGGT 
GTTGCACTGA 
TGATGTACTG 
CAGAGGTATC 
CCAACGCTGG 
CAAGACGGGT 



TACCAAAAGT 
GTGCTGGAAG 
AATGGGCTCT 
AACACCCAAT 
AACGTATTGG 
GCGTTGGGCA 
CGCAAGATCT 
TTCGCTAAGG 
AACAGTCAAC 
GTTATTCTGT 
TTCTACCACG 
TTATGGCCAG 
CCGGCTTGAC 
TTATTCATCG 
TGCACTGCTA 
CAGAAGAGAT 
TCAATCACCT 



GTACAACGCC 
TTCAGCAACA 
TCCGATGGTC 
TGAAGTGCCG 
GTGACCCAAT 
ATCCACCGCG 
GTTAGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
ATTTGCCGGT 
AGATGACTGA 
ATGAATGAGC 
CATGGCGGAG 
ATAACATCTA 
GGTCGTATGC 
GGGTGTGTTG 
CCGTA 



CTTGAGGTTG 50 

GCTGGGCGGT 100 

TGAGCCGCGG 150 

GTGGGTAAAT 200 

CGACATGAAA 250 

AAGCGCCTTC 300 

GGTATCAAGG 350 

CGGTCTGTTC 400 

TGATTCGTAA 450 

GTGGGTGAGC 500 

CTCCAACGTT 550 

CACCAGGTAA 600 

AAATTCCGTG 650 

TCGTTATACC 700 

CATCAGCGGT 750 

CAGGAACGTA 800 
835 



35 



2) INFORMATION FOR SEQ ID NO: 393 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTGGG CGGTGGTGTT GTTCGTTGTA 100 

55 TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 250 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC 300 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 350 

60 TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 
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GTAAAACGGT AAACATGATG GAGCTTATTC 
TCAGGTTATT CCGTATTTGC TGGCGTGGGT 
CGACTTCTAC CACGAGATGA CTGACTCCAA 
TGGTTTATGG CCAAATGAAT GAGCCACCAG 
CTGACCGGCT TGACCATGGC GGAGAAATTC 
ATTGCTGTTC ATCGATAACA TCTATCGCTA 
TTTCCGCACT GCTGGGTCGT ATGCCATCTG 
CTGGCAGAAG AGATGGGTGT GTTGCAGGAA 
GGGTTCAATC AC 



GTAACATTGC GATTGAGCAC 
GAGCGTACTC GTGAGGGTAA 
CGTTCTGGAC AAAGTATCCT 
GTAACCGTCT GCGCGTTGCA 
CGTGATGAAG GTCGTGACGT 
TACCTTAGCC GGTACGGAAG 
CCGTAGGTTA CCAGCCAACG 
CGTATTACTT CCACCAAGAC 



450 
500 
550 
600 
650 
700 
750 
800 
812 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 394 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 200 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 300 

GATCTGTTAG AAACCGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 350 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 650 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATCCGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCACCAAGA CGGGTTCAAT 800 

CA 802 



2) INFORMATION FOR SEQ ID NO: 395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 

236 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 

5 GCTGTACCAA GAGTGTACAA CGCCCTTGAG GTTGAAGGCA CTGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTAGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GCGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCGGTTGGT AAATCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 250 

10 AAGAGCGTTG GGCAATCCAC CGCGAAGCGC CTTCTTACGA AGAGCTTGCC 300 

AGCTCAC AAG AT TTGTTAGA_ AACCGGTATC AAAGTAATGG ACTTGATTTG 350 

CCCGTTCGCT AAGGGCGGTA AAGTGGGTCT" GTTCGGTGGT"GCGGGTGTAG 400~ 

GTAAAACAGT AAACATGATG GAGCTTATTC GTAACATCGC GATTGAGCAC 450 

TCAGGTTATT CTGTATTTGC TGGTGTGGGT GAGCGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTGTATGG CCAGATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGTTGTTC ATCGATAACA TCTATCGCTA TACCTTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCAACG 750 

20 CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACGT CCACCAAGAC 800 

GGGTTC 806 



25 2) INFORMATION FOR SEQ ID NO: 396 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 806 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 50 

GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 150 

CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 200 

45 CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 250 

AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 300 

AGCTCACAAG ATCTGTTAGA AACCGGTATC AAGGTTATGG ACCTGATTTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

50 TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAGCGTACCC GTGAGGGTAA 500 

TGACTTCTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC AAAGTATCCT 550 

TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

55 TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGCCAACA 750 

CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACTAAGAC 800 

GGGTTC 806 



60 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM": Yersinia rohdei 

(B) STRAIN: ATCC 43380 



15 (xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 397 

TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGCTTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 150 

20 AAAGTTATCA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAATCAAC 200 

TCTGGGCCGT ATCATGAACG TATTGGGTGA CCCAATCGAC ATGAAAGGCC 250 

CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 300 

GAAGAGCTTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 350 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

25 GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGCCAGATGA ATGAGCCACC AGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CTTGACCATG GCGGAAAAAT TCCGTGATGA 650 

30 AGGCCGTGAC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC . 750 

TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 800 

TTCCACTAAG ACGGGTTCAA TCACCTCCG 829 



35 



2) INFORMATION FOR SEQ ID NO: 398 

(i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 98 



ATGCCGTACC GCGCGTGTAC 
AAACTGGTGC TGGAAGCTCA 

55 TATCGCCATG GGTTCTTCCG 
ACCTCGAGCA CCCGATCGAA 
ATCATGAACG TCCTGGGTCA 
AGAAGAGCGT TGGGCTATCC 
CCAGCTCTCA GGAACTGCTG 

60 TGCCCGTTCG CTAAGGGTGG 



GATGCTCTTG AGGTACAAAA 
GCAGCAGCTC GGCGGCGGTA 
ACGGTCTGCG TCGTGGTCTG 
GTCCCGGTAG GTAAAGCAAC 
GCCGATCGAC ATGAAAGGCG 
ACCGCGCAGC ACCTTCCTAT 
GAAACCGGTA TCAAAGTAAT 
TAAAGTCGGT CTGTTCGGTG 
238 



TGGTAACGAG 50 

TCGTGCGTAC 100 

GAAGTTAAAG 150 

CCTGGGCCGT 200 

ACATCGGTGA 250 

GAAGAGCTGT 300 

GGATCTGATC 350 

GTGCGGGTGT 400 
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AGGTAAAACT GTAAACATGA 
ACTCCGGTTA CTCTGTGTTT 
AACGACTTCT ACCACGAAAT 
CCTGGTGTAC GGCCAGATGA 
5 CGCTGACCGG CCTGACCATG 
GTTCTGCTGT TCGTCGATAA 
AGTATCCGCA CTGCTGGGTC 
CTCTGGCGGA AGAGATGGGT 
ACCGGTTCTA TCACCTCCG 



TGGAGCTTAT CCGTAACATC 
GCAGGCGTGG GTGAACGTAC 
GACCGACTCC AACGTACTGG 
ACGAGCCGCC GGGAAACCGT 
GCTGAGAAAT TCCGTGACGA 
CATCTACCGT TATACCCTGG 
GTATGCCTTC TGCGGTAGGT 
GTTCTTCAGG AACGTATCAC 



PCT/CAOO/01150 

UCGATTGAGC ' 45*0' 

TCGTGAGGGT 500 

ATAAAGTATC 550 

CTGCGCGTTG 600 

AGGCCGTGAC 650 

CCGGTACGGA 700 

TATCAGCCAA 750 

CTCTACCAAA 800 
819 



2) INFORMATION FOR SEQ ID NO: 399 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 
25 (B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 

AAGCTTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 50 

30 GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 150 

GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGT CAGACCCGAG AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 250 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 300 

35 CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 350 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 400 

ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 450 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 500 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

40 GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTTACCTT 650 

CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 700 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 750 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

45 CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 900 

TGCCACACTG CCCACATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CGACCGACGA ACCGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC * 1050 

50 GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 1097 



2) INFORMATION FOR SEQ ID NO: 400 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 

10 CAAGCTTAAG GCTGAACGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT 50 
GGAA GTTCGA GACTCCCAAG TACCACGTTA CCGTCATTGA TGCCCCTGGC 100 



CATCGTGATT TCATCAAGAA CATGATTA"CT"GGTACTTCCC~AAGCTGACTG~ 15 0" 

CGGTATCTTG ATTATTGCTG CTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CCAAGGATGG TCAAACCCGT GAACACGCTT TGCTTGCTTT CACCCTTGGT 250 

15 GTCCGTCAAT TGATTGTCGC TATCAACAAG' ATGGATTCCA CCAAGTACTC 300 

TGAGGCCCGT TACAACGAAA TTGTCAAGGA AGTCTCCACC TTCATCAAGA 350 

AGATTGGTTT CAACCCCAAG TCCGTTCCTT TCGTCCCTAT CTCTGGCTGG 4 00 
AACGGTGACA ACATGTTGGA GGARTCCACC AACATGCCTT GGTTCAAGGG m 450 

ATGGAACAAG GAGACTAAGG CTGGTGCCAA GACYGGCAAG ACCCTTCTTG 500 

20 AAGCCATTGA CAACATTGAT CCCCCTGTTC GTCCTTCCGA CAAGCCCCTT 550 

CGTCTTCCCC TTCAAGATGT CTACAAGATC GGTGGTATTG GTACAGTTCC 600 

TGTCGGTCGT GTTGAGACTG GTGTCATCAA GCCTGGTATG GTTGTCACCT 650 

TCGCTCCCGC TAACGTCACC ACTGAAGTCA AGTCCGTYGA AATGCACCAC 700 

GAGCAACTTG CTGAAGGTGT TCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

25 GAACGTTTCC GTCAAGGATA TCCGCCGTGG TAACGTYTGC TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA TCCGCTTCCT TCACCGCTCA AGTTATTGTC 850 

TTGAACCACC CTGGTCARAT TGGTGCTGGT TACTCTCCTG TCTTGGATTG 900 

CCACACTGCT CACATTGCAT GCAAGTTCTY TKAGCTTCTT KAGAAGATCG 950 

ATYGTCGTTC CGGTAAGTAA ATANTTTGGT TTRGGATATG GGTATTGGGC 1000 

30 TTAATCTYTG GATTTTGCCT CAATTGCTCC TTCCTTGATC TTTCTCGATT 1050 

ACTTTTTGAT CATTTGCTAA TCCAAACCCT TTCCATTTYA TTGAAAACAG 1100 

GTAAGAAGTT GGAAGACTCC CCCAAGTTCG TCAAGWSYGG TGACTCTGCY 1150 

ATCGTCAAGA TGGTTCCTTC CAAGCCCATG TGCGTTGAAG CCTACACTGA 12 00 

ATATCCTCCT CTTGGTCGTT TCGCTGTCCG TGA 1233 

35 



2) INFORMATION FOR SEQ ID NO: 401 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
50 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 10 0 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 200 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 3 00 

ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 350 

240 



55 
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CAAGTGGTCC GAGGAGCGTT ACCAGGAGAT CATCAAGGAG ACCTCCAACT' COO 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 450 

TCCGGTTTCA ACGGTGACAA CATGATTGAG GCCTCATCCA ACTGCCCCTG 500 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 550 

5 TCCTCGAGGC CATCGACGCC ATCGACCCTY CCAGCCGTCC CACCGACAAG 600 

CCCCTCCGTC TTCCCCTYCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 650 

GGTGCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGGCC GGTATGGTCG 700 

TCACCTTCGC CCCCGCTGGT GTCACCACTG AAGTCAAGTC CGTCGAGATG 750 

CACCACGAGC AGCTCACCGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 800 

10 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 850 

ACTCCA AGAA CGACCCCCCC AAGGGTGCCG AGTCCTTCAA CGCCCAGGTC 900 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT - GCTGGTTACG "CCCCAGTCCT 950- 

CGACTGCCAC ACCGCCCACA TTGCTTGCAA GTTCTCTGAG CTCCTCGAGA 1000 

AGATTGACCG CCGTACCGGA AAGTCTGTTG AGAACTCTCC CAAGTTCATC 1050 

15 AAGTCCGGTG ACGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1100 

CGTTGAGGCT TTCACTGACT ACCCTCCTCT CGGTCGTTTC GCTGTCCGTG 1150 

A 1151 



20 



30 



2) INFORMATION FOR SEQ ID NO: 402 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1283 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 100 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTTCAAT 150 

4 0 AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 200 

CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 250 

GAGGCTGGTA TCTCCAAGGA TGGTCAGACC CGTGAGCACG CTCTGCTCGC 300 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 3 50 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 400 

45 AACTTCATCA AGAAGGTCGG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 450 

CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 500 

CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 550 

AAGACCCTTC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCCAC 600 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

50 TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGCCTGGT 700 

ATGGTCGTTA CTTTCGCTCC TGCCAACGTG ACCACTGAAG TCAAGTCCGT 750 

TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 800 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 850 

GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 900 

55 CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGCTGAGCTC 1000 

CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG ACAAGCCCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

60 GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 1200 

241 
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ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 1250 
TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 1283 



2) INFORMATION FOR SEQ ID NO: 403 

( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 1103 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 



15 



55 



(D) TCFOLOGY7" "Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL 9 5 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 

TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

25 GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 200 

CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 250 

ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 300 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC ACCTGCAAGT GGTCCGAGGA 350 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 400 

30 GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 450 

GACAACATGC TTGAGCCCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

GAAGGAGACC AAGGCCGGCA AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

TCGACGCCAT TGAGCCCCCT GTCCGTCCCT CCAACAAGCC CCTCCGTCTT 600 

CCCCTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

35 CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 700 

CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

CTCCAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 800 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

4 0 CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

TGCCCACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 1000 

GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 1100 

CAC 1103 

45 



2) INFORMATION FOR SEQ ID NO: 404 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
60 (B) STRAIN: WSA-172 

242 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

5 GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 200 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 250 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 300 

10 GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 350 

GTCGGAGGAT -CGTTACAACG. .AAATTGTCAA. GGAAACC TCCAACTTCATCA 400 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450" 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

15 TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 600 

CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

20 CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

25 TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 1149 



30 2) INFORMATION FOR SEQ ID NO: 405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 



CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 100 

CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC 150 

CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 200 

50 TGACTGCGCT ATCCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 250 

GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 300 

CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 350 

GTGGTCCGAG GACCGTTACA ACGAAATCGT TAAGGAGACC TCCAACTTCA 400 

TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 450 

55 GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 500 

CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAGACCC 550 

TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTCCAACAAG 600 

CCCCTCCGTC TTCCCCTCCA GGATGTCTAC AAGATCTCCG GTATTGGAAC 650 

TGTTCCCGTC GGTCGTGTCG AGACCGGTAT CATTGCCCCT GGTATGGTCG 700 

60 TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 750 
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CACCACCAGC AGCTCAAGGA 
CGTCAAGAAC GTTTCCGTCA 
ACTCCAAGAA CGACCCCCCT 
ATCGTCCTCA ACCACCCCGG 
5 GGACTGCCAC ACTGCTCACA 
AGATTGACCG CCGTACCGGA 
AAGTCCGGTG ACGCTGCCAT 
TGTTGAGGCT TTCACTGACT 
A 



AGGTGTCCCC GGTGACAACG 
AGGAGGTTCG CCGTGGTAAC 
CTTGGCTGTG AGAGCTTCAC 
TCAGGTCGGC GCTGGTTACG 
TTGCTTGCAA GTTCGCTGAG 
AAGTCTGTTG AATCTTCCCC 
CGTCAAGATG ATTCCCTCCA 
ACCCCCCTCT TGGTCGTTTC 



TTGGTTTCAA 


O A A 

800 


GTTGCCGGTG 


n i- « 
850 


CGCCCAGGTC 


900 


CTCCCGTCCT 


r\ r~ r\ 

950 


CTCCTTGAGA 


1000 


CAAGTTCATC 


1050 


AGCCCATGTG 


1100 


GCCGTCCGCG 


1150 




1151 



2) INFORMATION FOR SEQ ID NO: 406 

15 (i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

" (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capita tus 
25 (B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 



GCTTAAAGCT GAACGTGAAC 

30 AGTTCGAAAC TCCTAAGTAC 
CGTGATTTCA TCAAGAACAT 
CATTCTTATC ATTGCTGCCG 
AGGAAGGTCA AACCAGAGAA 
AGACAACTTA TCATTGCCAT 

35 AAAGAGATAC GAAGAAATCG 
TTGGTTACAA CCCCAAGTCT 
GGTGACAACA TGTTGGAACC 
GACCAAGGAA ACCAAGGCTG 
CCATTGACGC CATTGAACCA 

4 0 CTCCCACTCC AAGATGTTTA 
CGGCCGTGTC GAAACCGGTA 
CTCCACCAAT GGTCACAACT 
CAACTTGCTC AAGGTAACCC 
CGTTTCCGTT AAGGAAATCA 

45 ACGATCCACC AAAGGGCTGC 
AACCACCCTG GTCAAATCTC 
CACTGCCCAC ATTGCCTGCA 
GTCGTTCTGG TAAGAAGATT 
GATGCCGCTA TCGTCAAGAT 

50 CTTCACTGAA TACCCACCAC 



GTGGTATCAC CATTGATATC 
TACGTTACTG TTATTGATGC 
GATTACTGGT ACTTCCCAAG 
GTGTCGGTGA ATTCGAAGCT 
CACGCTCTTC TCGCTTTCAC 
CAACAAGATG GACTCTGTCA 
TCAAGGAGGC TTCCAACTTC 
GTTCCATTCG TTCCTATCTC 
TACCACCAAC GCCCCATGGT 
GTGCCACTAA GGGTATGACT 
CCAGTAAGAC CATCCGACAA 
CAAGATTGGT GGTATCGGAA 
TCATCAAGGC CGGTATGGTC 
GAAGTTAAGT CCGTTGAAAT 
AGGTGACAAC GTTGGTTTCA 
GACGTGGTAA CGTCTGTGGT 
GAATCTTTCA ACGCTCAAGT 
TGCTGGTTAC TCTCCAGTTC 
GATTCGACGA ACTCCTTGAA 
GAAGACTCTC CAAAGTTTGT 
GATCCCAACC AAGCCAATGT 
TTGGTCGTTT CGCCGTCCGT 



GCTCTCTGGA 


50 


TCCAGGTCAC 


100 


CCGATTGCGC 


150 


GGTATCTCCA 


200 


CCTTGGTGTC 


250 


AGTGGGACCA 


300 


GTCAAGAAGG 


350 


TGGTTGGAAC 


400 


ACAAGGGATG 


450 


CTTATTGAAG 


500 


GCCACTCCGT 


550 


CTGTGCCAGT 


600 


GTTACCTTTG 


650 


GCACCACGAA 


700 


ACGTCAAGAA 


750 


GACTCCAAGA 


800 


TATCGTCTTG 


850 


TCGATTGCCA 


900 


AAGATCGACC 


950 


CAAGTCTGGT 


1000 


GCGTTGAAAC 


1050 


GAT 


1093 



2 ) INFORMATION FOR SEQ ID NO: 407 

55 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

244 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

10 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CT TTGT GGAA ATTCGAA ACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT" CAA~GAATATG~ATCACTGGTATCTTCTCAAGC 1 50" 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 
GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

15 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 
ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 
TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTYGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

20 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 
CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 
TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 
TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 
CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

25 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 
ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 
ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 00 

30 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 



35 



45 



2) INFORMATION FOR SEQ ID NO: 408 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1089 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 50 

TCGAAACTCC AAAATACCAC GTTACCGTCA TTGATGCTCC AGGTCACAGA 100 

GATTTCATCA AGAATATGAT CACTGGTACT TCTCAAGCTG ATTGTGCTAT 150 

55 TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 200 

ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 250 

CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 300 

CAGATTTGAA GAAATCATCA AGGAAACCTC CAACTTCGTC AAGAAGGTTG 350 

GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 4 00 

60 GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 4 50 
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AAAGGAAACC AAATCCGGTA AAGTTACTGG TAAGACCTTG TTAGAAGCTA 500 

TTGACGCTAT TGAACCACCA ACCAGACCAA CCGACAAACC ATTGAGATTG 550 

CCATTGCAAG ATGTTTACAA GATCGGTGGT ATTGGTACTG TGCCAGTCGG 600 

TAGAGTTGAA ACTGGTATCA TCAAAGCCGG TATGGTTGTT ACTTTCGCCC 650 

5 CAGCTGGTGT TACCACTGAA GTCAAATCCG TTGAAATGCA TCACGAACAA 700 

TTGGCTGAAG GTGTTCCAGG TGACAATGTT GGTTTCAACG TTAAGAACGT 750 

TTCCGTTAAA GAAATTAGAA GAGGTAACGT TTGTGGTGAC TCCAAGAACG 800 

ATCCACCAAA GGGTTGTGAC TCTTTCAATG CCCAAGTCAT TGTTTTGAAC 850 

CATCCAGGTC AAATCTCTGC TGGTTACTCT CCAGTCTTGG ATTGTCACAC 900 

10 TGCCCACATT GCTTGTAAAT TCGACACTTT GGTTGAAAAG ATTGACAGAA 950 

^GAACTGi5TAA_ GAAATTGGAA_ GAAAATCCAA AATTCGTCAA ATCCGGTGAT 1000 

GCTGCTATCG TCAAGATGGT CCCAACCAAA - CCAATGTGTG~TTGAAGCTTT 1050- 

CACTGACTAC CCACCATTAG GTAGATTCGC TGTCAGAGA 1089 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 409 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 ( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 56884 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 



CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

35 CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

40 TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

45 TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

50 ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

55 A 1101 



2 ) INFORMATION FOR SEQ ID NO: 410 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 

15 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

20 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

25 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

30 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

35 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 411 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1102 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

60 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

247 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
5 GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 
10 TTACTTTCGC 
CATCACGAAC 



PCT/CA00/01150 



CGTTAAGAAC 
ACTCCAAGAA 
ATTGTTTTGA 
15 GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
AT 

20 



GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



ACCAGAGAAC 

TGTTGCTGTC 

AAGAAATCAT 

CCAAAGACTG 

GATTGAACCA 

CCAAATCCGG 

ATTGAACCAC 

AGATGTTTAC 

AAACTGGTAT 

GTTACCACTG 

AGGTGTTCCA 

AAGAAATTAG' 

AAGGGTTGTG 

TCAAATCTCT 

TTGCTTGTAA 

AAGAAATTGG 

CGTCAAGATG 

ACCCACCATT 



ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
"AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



GGCTTACACT 
ACTCTGTCAA 
TCCAACTTCG 
TCCAATCTCT 
GTCCATGGTA 
GGTAAGACCT 
AACCGACAAA 
GTATTGGTAC 
GGTATGGTTG 
CGTTGAAATG 
TTGGTTTCAA 
"GTTTGTGGTG" 
TGCCCAAGTC 
CTCCAGTCTT 
TTGGTTGAAA 
AAAATTCGTC 
AACCAATGTG 
GCTGTCAGAG 



25(T 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
—800- 
850 
900 
950 
1000 
1050 
1100 
1102 



2) INFORMATION FOR SEQ ID NO: 412 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
35 (B) STRAIN: NCPF 3108 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

40 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

45 ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

50 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

55 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

60 TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 
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5 2) INFORMATION FOR SEQ ID NO: 413 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 



GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 150 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

25 TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 250 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 4 00 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 450 

3 0 GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 700 

3 5 CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

4 0 TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 1098 



45 



55 



2) INFORMATION FOR SEQ ID NO: 414 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1102 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3 94 9 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

249 
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CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

5 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTCG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

10 GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 



TGTTAGAAGC~TATTGATGCT""ATTGAACCAC~CAACCAGAee— AAGGGAGAAA 550- 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

15 TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

20 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 

25 



2) INFORMATION FOR SEQ ID NO: 415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
40 (B) STRAIN: CBS 7987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

4 5 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

50 ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

55 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

60 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

250 
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ATTGTCTTGA ACCATCCAGG TCAAATCTCT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA 
AGATTGACAG AAGAACTGGT AAGAAATTGG 
AAATCCGGTG ACGCTGCTAT YGTCAAGATG 
5 TGTTGAAGCT TTCACTGACT ACCCACCATT 
AT 



PCT/CA00/01150 

GCTGGTTAYT CTCCAGTCTT 900 
ATTCGACACT TTGGTTGAAA 950 
AAGAAAATCC AAAATTCGTC 1000 
GTCCCAACCA AACCAATGTG 1050 
AGGTAGATTC GCTGTCAGAG 1100 

1102 



10 2) INFORMATION FOR SEQ ID NO: 416 



15 



20 



25 



30 



35 



40 



"TiTSEQUENCE _ CHARACTERISTICS-r 

(A) LENGTH : 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double. 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 



45 



AATTGAAGGC 
AAATTCGAAA 
CAGAGATTTC 
CTATTTTRAT 
AAGGATGGTC 
TAGACAATTG 
AGGCTAGATT 
GTTGGTTACA 
CGGTGACAAC 
GGGAAAAGGA 
GCCATTGATG 
ATTACCATTA 
TCGGTAGAGT 
GCCCCAGCCG 
ACAATTAGCT 
ACGTTTCCGT 
AACGACCCAC 
GAACCACCCA 
ACACCGCCCA 
AGAAGATCCG 
TGATGCTGCT 
CTTTCACTGA 



TGAAAGAGAA 
CTCCAAAATA 
ATCAAGAACA 
TATTGCTGGT 
AAACCAGAGA 
ATTGTTGCCG 
CGAAGAAATC 
ACCCTAAGAC 
ATGATTGAAG 
AACCAAGGCT 
CCATTGAACC 
CAAGATGTCT 
TGAAACCGGT 
GTGTCACTAC 
GAAGGTGTTC 
TAAGGAAATC 
CAAAGGGTGC 
GGTCARATCT 
CATTGCTTGT 
GTAAGAAATT 
ATCGTCAAGA 
ATACCCACCA 



AGAGGTATCA 
CCACGTTACC 
TGATTACTGG 
GGTGTCGGTG 
ACACGCTTTA 
TCAACAAGAT 
ATCAAGGAAA 
TGTTCCTTTC 
CCTCCACCAA 
GGTAAATCTA 
ACCAACCAGA 
ACAAGATCGG 
GTTATCAAGG 
CGAAGTCAAA 
CAGGTGACAA 
AGAAGAGGTA 
TGAATCTTTC 
CTGCTGGTTA 
AAATTCGATG 
AGAAGACGAA 
TGGTCCCAAC 
TTAGGTAGAT 



CCATTGATAT 
GTTATTGATG 
TACTTCTCAA 
AATTCGAAGC 
TTGGCTTACA 
GGACTCTGTT 
CCTCTAACTT 
GTYCCAATTT 
CTGTCCATGG 
CTGGTAAGAC 
CCAACCGAAA 
TGGTATTGGT 
GTGGTATGGT 
TCCGTTGAAA 
TGTTGGTTTC 
ACGTTTGTGG 
ACCGCTCAAG 
CTCTCCAGTC 
CTTTACTCGA 
CCAAAATTCG 
CAAACCAATG 
TCGCTGTTAG 



CGCTTTATGG 50 

CTCCAGGTCA 100 

GCTGATTGTG 150 

CGGTATCTCT 200 

CCTTAGGTGT 250 

AAATGGGACA 300 

CGTCAAGAAG 350 

CTGGATGGAA 400 

TACAAGGGTT 450 

TTTGTTAGAA 500 

AGCCATTGAG 550 

ACTGTGCCAG 600 

TGTTACCTTT 650 

TGCACCACGA 700 

AACGTCAAGA 750 

TGACTCCAAG 800 

TTATTGTCTT 850 

TTAGATTGTC 900 

AAAGATTGAC 950 

TCAAGTCCGG 1000 

TGTGTTGAAG 1050 

AGAT 1094 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 417 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

251 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

10 TGACTGTGCT ATCTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 250 

" CTAGGTGWA^GACAATTGAT~TGTYGCTGTC-AACAAGATGG-ACTCTGTGAA- 300 

GTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA * 350 

TCAAGAAGGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

15 GGTTGGAACG GTGACAACAT GATTGAAGCC ACCACCAACG CTTCCTGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 500 

TGTTGGAAGC CATTGACGCT ATCGAACCAC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 600 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 650 

20 TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 800 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

25 GGACTGTCAC ACCGCCCACA TTGCTTGTAA GTTCGAAGAA TTGTTGGAAA 950 

AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 1100 

A 1101 

30 



2) INFORMATION FOR SEQ ID NO: 418 

35 ( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
45 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

50 AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 200 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 300 

55 GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 3 50 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 400 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

60 ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 

252 
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ACCGGTATCA TYAAGGCCGG 
YACCACTGAA GTCAAGTCCG 
GTGTTCCAGG TGACAATGTT 
GAAATTAGAA GAGGTAACGT 
5 GGGTTGTGAC TCTTTCACCG 
AAATCTCTGC TGGTTACTCT 
GCTTGTAAAT TCGACACCTT 
GAAGATGGAG GACAACCCCA 
TCAAGATGGT GCCATCCAAG 
10 CCACCATTGG GAAGATTCGC 



TATGGTTGTT ACCTTTGCCC 
TGGAAATGCA CCACGAACAA 
GGTTTCAACG TTAAGAACGT 
TTGTGGTGAC TCCAAGAACG 
CTCAAGTTAT TGTGTTGAAC 
CCAGTTTTGG ACTGTCACAC 
GTTGGAGAAG ATTGACAGAA 
AGTTTGTCAA GTCCGGTGAC 
CCAATGTGTG TTGAGGCTTT 
CGTCAGAGAC 
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2) INFORMATION FOR SEQ ID NO: 419 



15 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 

30 TCTGTCAAGT GGGACAAGGC CAGATACGAG GAAATCGTCA AGGAGACCTC 50 

TAACTTCGTC AAGAAGGTTG GTTACAACCC TAAGACTGTT CCATTCGTCC 100 

CAATCTCTGG TTGGAACGGT GACAACATGA TTGAGGCTTC TACCAACTGT 150 

GACTGGTACA AGGGTTGGGA GAAGGAGACC AAGTCTGGTA AGTCCACCGG 200 

TAAGACCTTG TTGGAGGCCA TTGACGCCAT TGAGCCACCA ACCAGACCAA 250 

35 CCGACAAGCC ATTGAGATTG CCATTGCAGG ATGTCTACAA GATTGGTGGT 300 

ATCGGAACTG TGCCAGTCGG CAGAGTTGAG ACCGGTGTTA TCAAGGCCGG 350 

TATGGTTGTC ACCTTCGCCC CAGCTGGTGT CACCACTGAA GTCAAGTCTG 400 

TCGAGATGCA CCACGAGCAG TTGCCAGAGG GTGTCCCAGG TGACAACGTT 450 

GGTTTCAACG TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT 500 

40 CTGTGGTGAC TCCAAGCAGG ACCCACCAAA GGGCTGTGAC TCTTTCACCG 550 

CTCAGGTTAT TGTGTTGAAC CACCCAGGTC AGATCTCTTC TGGTTACTCT 600 

CCAGTTTTGG ACTGTCACAC TGCCCACATT GCTTGTAAGT TCGACACCTT 650 

GGTTGAGAAG ATCGACAGAA GAACCGGTAA GAAGTTGGAA GATGAGCCAA 700 

AGTTCATCAA GTCCGGTGAC GCTGCTATCG TCAAGATGGT CCCAACCAAG 750 

45 C 751 



2) INFORMATION FOR SEQ ID NO: 420 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Candida inconspicua 

253 
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(B) STRAIN: ATCC 16783 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

5 TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 250 

10 TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 300 

GTGGG ATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 



TCAAGAAGGT TGGTTACAAC CCAA7fGA"CTG~TTCCATTCGT~TCCAATTTCT~ — 400- 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

15 TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

20 CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

25 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 



40 



2) INFORMATION FOR SEQ ID NO: 421 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1099 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTACCAA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

50 ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 200 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 250 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 300 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 4 00 

55 TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT CCATGGTACA 450 

AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGACCTTG 500 

TTGGAAGCTA TTGACGCTAT CGAACCACCA ACCAGACCAA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 650 

60 ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 

254 
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10 



25 
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CCACGAACAA TTGGAAGAAG GTCTACCAGG TGACAACGTC GGTTTCAACG x 750 

TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT CTGTGGTGAC 800 

TCCAAGAACG ATCCACCAAA GGCTGCTGCT TCTTTCAACG CCACTGTTAT 850 

CGTCTTGAAC CACCCAGGTC AAATCTCTGC TGGTTACTCT CCAGTTTTGG 900 

ATTGTCACAC TGCTCACATT GCTTGTAAGT TCGACGAATT GTTGGAAAAG 950 

AACGACAGAA GATCCGGTAA GAAGTTGGAA GACTCTCCAA AGTTCTTGAA 1000 

GTCTGGTGAC GCTGCTTTGG TTAAGTTCGT TCCATCTAAG CCAATGTGTG 1050 

TTGAAGCATT CTCTGACTAC CCACCATTGG GTAGATTCGC TGTCAGAGA 1099 



2) INFORMATION FOR SEQ TD~NO~: 422 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1095 bases < 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 



AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

3 0 ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

CAAGGATGGT CAAACTAGAG AACACGCTCT ATTGGCTTTC ACCTTAGGTG 250 

TTAGACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAARTGGGAT 300 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 350 

35 GGTTGGTTAC AACCCAAAGA CTGTTCCATT CGTTCCAATY TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA CCTTATTAGA 500 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT ACAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

40 GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 650 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTCCGTTGAA ATGCACCATG 700 

AACAATTAGA ACAAGGTGTT CCAGGTGATA ACGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

45 TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GCTTTCACTG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 

50 



2) INFORMATION FOR SEQ ID NO: 423 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

255 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambi ca 
5 (B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 



CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

10 CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CC AGGTC ACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

AGATTGTGCT ATTTTRATYA - TTGCTGGTGG TGTCGGTGAA^TTCGAAGeTG 200- 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 300 

15 GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGGGATGG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

2 0 AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GTCGGTAGAG TTGAAACCGG TATCATCAAG CCAGGTATGA 650 

TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGCACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 750 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 800 

25 GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGACCA TCCAAAGTCT 1000 
GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT ' 1050 

30 GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 

GAGA 1104 



35 2 ) INFORMATION FOR SEQ ID NO: 424 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 



GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC GATATCGCTT 50 

TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 200 

55 TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC TTACACCTTG 250 

GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 300 

GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC AATCTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 450 

60 GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT AAGACCTTGT 500 

256 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGGAGGCCAT TGACGCCATT GAGCCACCTT 
TTGAGATTGC CATTGCAAGA TGTCTACAAG 
GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT 
CCTTTGCTCC AGCTGGTGTC ACCACTGAAG 
5 CACGAACAAT TGGCTGAGGG TGTCCCAGGT 
CAAGAACGTT TCCGTCAAGG AAATCAGAAG 
CCAAGAACGA CCCACCAAAG GCTGCTGCTT 
GTCTTGAACC ACCCAGGTCA AATCTCCTCY 
CTGTCACACT GCYCACATTG CTTGTAAGTT 
10 TCGACAGAAG AACYGGTAAG AAGTTGGAAG 
TCYG GTGACG CTGCTATCGT CAAGATGGTC 
YGAAGCTTTC ACCGACTACC "CACCATTGGG 



PCT/CA00/01150 



ppapappa ap 


PPA p a a pf^fi* 




a t vpptppt a 


TTPPT A PTPT 


DUU 


PA APPPPPPT 


ATPPTTPTPA 


ccn 
DOU 


TP A APTPPPT 


pp a a atppap 


inn 


PA P A A fflTTn 


PTTTPA 7A PPT 
bill LA/iLb 1 


7 1; n 


21 PPT A A PPTP 


TPTPPTP APT 


OUU 


PVTTP APTPP 


TP A APTVATV 


Q c; rv 


PPTT A PTPTP 


PAPTVTTPP A 




PP 7A P7A PPTTP 




ybu 


AAGAGCCAAA 


GTTCATCAAG 


1000 


CCAACCAAGC 


CAATGTGYGT 


1050 


TAGATTCGCT 


GTCAGAGA 


1098 



15 



20 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 



35 



40 



45 



50 



55 



TCTTGACAAG 
CTTTATGGAA 
CCAGGTCACA 
TGATTGTGCT 
GTATCTCCAA 
TTAGGTGTTA 
GTGGGATGAA 
TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TATTAGAAGC 
CCATTAAGAT 
TGTGCCAGTC 
TTGTTTTCGC 
CACCATGAAC 
CGTCAAGAAC 
ACTCCAAGAA 
ATTGTCTTGA 
AGATTGTCAC 
AGATTGACAG 
AAGTCTGGTG 
TGTTGAAGCT 
A 



TTAAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATCTTAATTA 
GGATGGTCAA 
AGCAATTAAT 
AAGAGATTTG 
TGGTTACAAC 
GTGACAACAT 
ACTAAGGAAA 
TATTGATGCT 
TACCATTACA 
GGTAGAGTTG 
ACCATCTGGT 
AATTAGAACA 
GTCTCTGTTA 
CGACCCACCA 
ACCACCCTGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCAGCTAT 
TTCACTGAAT 



AAAGAGAAAG 
CCAAAATACC 
TAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTATC 
AAGAAATTGT 
CCAAAGACTG 
GATTGAACCA 
CCAAGGCAGG 
ATTGAACCAC 
AGATGTTTAC 
AAACCGGTGT 
GTTACCACTG 
AGGTGTCCCA 
AGGATATTAA 
CAAGGTTGTG 
TCAAATTTCT 
TTGCATGTAA 
AAGTCCGTTG 
CGTTAAGATG 
ACCCACCATT 



AGGTATCACT 
ACGTTACCGT 
ATTACTGGTA 
TGTCGGTGAA 
ACGCTTTATT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCTACTAACT 
TGTTGTTAAG 
CTGTCAGACC 
AAGATTGGTG 
TATTAAGCCA 
AAGTCAAGTC 
GGTGACAATG 
GAGAGGTAAC 
CTTCCTTCAA 
GCAGGTTACT 
GTTCGATGAA 
AAGACCATCC 
GTTCCAACCA 
AGGTAGATTC 



ATTGATATTG 50 

TATTGATGCT 100 

CTTCCCAAGC 150 

TTCGAAGCTG 200 

AGCATTCACC 250 

ACTCTGTTAA 300 

CAAAACTTCA 350 

TCCAATTTCT 400 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACTGACAAG 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCTCAAGTC 850 

CTCCAGTTTT 900 

TTAATCGAAA 950 

AAAGTCTGTT 1000 

AGCCAATGTG 1050 

GCAGTCAGAG 1100 
1101 



60 



2 ) INFORMATION FOR SEQ ID NO: 426 
(i) SEQUENCE CHARACTERISTICS: 



257 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 
10 (B) STRAIN: ATCC 90018 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

15 GGAAATTCGA AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

CACAGAGATT TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CTAAGGATGG TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 250 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 300 

20 CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 350 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 400 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

25 AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 600 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 750 

GAACGTTTCA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

30 AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA 1050 

35 AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 1095 



2) INFORMATION FOR SEQ ID NO: 427 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

55 CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 250 

60 ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 300 

258 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TATCGGTACG GTACCTGTCG 
GCATGGTCGT CACTTTCGCC 
GTCGAGATGC ACCACGAGCA 
CGGTTTCAAC GTCAAGAACG 
5 TCTGCGGTGA CTCCAAGAAC 
GCCCAGGTCA TCGTTTTCAA 
CCCCGTTTTG GACTGCCACA 
TCCAGTCCAA GATGGACCGT 
AAGTTCATCA AGGCTGGTGA 
10 GC 



GCCGTGTCGA GACCGGTATC 
CCCGCTGGTG TCACCACTGA 
GATCCCCGAG GGTCTCCCCG 
TTACCGTCAA GGATATCCGC 
GACCCCCCCA AGGGCTGCTC 
CCACCCCGGT CAGATCTCCA 
CCGCCCACAT TGCCTGCCGC 
CGTACTGGTA AGACCCTTGA 
CTCCGCTATC GTCAAGATGG 



PCT/CAOO/01150 

ATCAAGCCCG 350 

AGTGAAGTCC 400 

GTGACAACGT 450 

CGTGGTAACG 500 

TTCCTTCACT 550 

ACGGTTACTC 600 

TTCGACGAGA 650 

GGAGAACCCC 700 

TTCCCTCCAA 750 
752 



2) INFORMATION FOR SEQ ID NO: 428 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

3 0 AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

AAGTTCGAAA CTCCAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 200 

AAGGATGGTC AAACCAGAGA ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

35 TAGACAATTG ATTGTTGCTG TTAACAAGAT GGATTCCGTT AAGTGGGATG 300 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 350 

GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 400 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 500 

40 GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 700 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

45 ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 800 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 850 

GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

50 TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



55 2) INFORMATION FOR SEQ ID NO: 42 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

259 



SUBSTITUTE SHEET (RULE 2b) 



10 



35 



45 



WO 01/23604 PCT/CA00/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 
AAATTGAAGG CTGAAAGAGA AAGAGGTATC ACCATTGATA TCGCTTTGTG 50 



GAAATT'CGAA - ACTCCAAAAT _ ACCACGTTAC _ CGTTATTGAT _ GCTCCAGGTC 1-0 0- 

ACAGAGATTT CATCAAGAAC ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTACTGGT GAATTCGAAG CTGGTATTTC 200 

15 TAAAGATGGT CAAACCAGAG AACACGCTTT GTTGGCTTAC ACCTTGGGTG 250 

TCAAACAATT GATTGTTGCT GTCAACAAGA TGGACTCTGT TAAATGGGAC 300 

AAAAACAGAT TTGAAGAAAT TATCAAGGAA ACTTCTAACT TCGTCAAGAA 350 

GGTTGGTTAC AACCCTAAGG* CTGTTCCATT CGTTCCAATC TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCTTCTACCA ACTGTCCATG GTACAAGGGT 450 

2 0 TGGGAAAAAG AAACCAAGGC TGGTAAGGTT ACCGGTAAGA CTTTGTTGGA 500 

AGCCATTGAT GCTATTGAAC CACCTTCAAG ACCAACTGAC AAGCCATTGA 550 

GATTGCCATT GCAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTCGGTAGAG TTGAAACTGG TGTCATCAAA GCCGGTATGG TTGTTACTTT 650 

CGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCCGTCGAA ATGCACCACG 700 

25 AACAATTGGC TGAAGGTGTC CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TTAAAGAAAT TAGAAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGATCCA CCAAAGGGTT GTGACTCTTT CAACGCTCAA GTTATTGTCT 850 

TGAACCACCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCTC ATATTGCTTG TAAATTCGAC ACCTTGGTTG AAAAGATTGA 950 

30 CAGAAGAACT GGTAAGAAAT TGGAAGAAAA TCCAAAATTC GTCAAATCCG 1000 

GTGATGCTGC TATTGTCAAG ATGGTTCCAA CCAAACCAAT GTGTGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTCA GAGA 1094 



2 ) INFORMATION FOR SEQ ID NO: 430 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 

CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT CACTATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCAAAG TACCACGTTA CTGTCATTGA TGCCCCAGGT 100 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 150 

55 TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 200 

CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 250 

GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 300 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 350 

AGGTTGGTTA CAACCCAAAG ACTGTTCCAT TTGTCCCAAT TTCYGGTTGG 400 

60 AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 450 

260 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
TTGGGAGAAG GAGACCAAGG CTGGTGTTGT 
ATGCCATTGA CGCCATTGAG CCACCAACAA 
AGATTGCCAC TCCAGGATGT CTACAAGATT 
AGTCGGTAGA GTCGAGACCG GTGTCATCAA 
5 TTGCCCCATC CGGTGTCACC ACTGAGGTTA 
GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC 
GAACGTCTCT GTTAAGGAGA TCAGAAGAGG 
AGAACGACCC ACCACAGGGT GCTGAGTCCT 
TTGAACCACC CAGGTCAGAT CTCTGCTGGT 
10 TCACACCGCC CACATTGCTT GTAAGTTCTC 
AC AGAAGAT C CGGTAAGTCC CTTGAGGCCT 
GGTGATGCCG CTATCGTCAAT "GATGGTTCCA" 
GGCCTTCACT GACTACCCAC CACTCGGTAG 



PCT/CAOO/01150 



CAAGGGTAAG 


ACCTTGCTCG ' 




GACCAACTGA 


CAAGCCATTG 


550 


GGTGGTATCG 


GAACTGTTCC 


600 


GCCAGGTATG 


GTTGTTACCT 


650 


AGTCCGTCGA 


GATGCACCAC 


700 


AACGTTGGTT 


TCAACGTTAA 


750 


TAACGTTGCC 


GGTGACTCCA 


800 


TCAACGCTCA 


GGTCATTGTC 


850 


TACTCTCCAG 


TTTTGGACTG 


900 


TGAGCTTTTG 


GAGAAGATTG 


950 


CTCCAAAGTT 


CGTCAAGTCT 


1000 


TCCAAGCCAT 


TGTGTGTTGA — 


—1-050- 


ATTCGCTGTC 


AGAGA 


1095 



15 



2) INFORMATION FOR SEQ ID NO: 431 

(i) SEQUENCE CHARACTERISTICS; 
20 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 



GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

35 TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 2 00 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 250 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 300 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 350 

40 CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 400 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 4 50 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 500 

CGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

45 GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 700 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 850 

50 CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA ACCCAAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 

55 



2) INFORMATION FOR SEQ ID NO: 432 
60 (i) SEQUENCE CHARACTERISTICS: 



261 



SUBSTITUTE SHEET (RULE 26) 



5 



WO 01/23604 PCT/CAOO/01150 

(A) LENGTH: 1072 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 
10 (B) STRAIN: ATCC 7351 

n^rSEQUWCE "DESCRIPTION": ~SEQ "ID-NO : -4-32 



AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

15 AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 200 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 250 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 300 

20 CAAGGAGACC TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 350 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 400 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 450 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

25 AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 750 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 800 

30 CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 850 

GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

35 GGGTCGTTTC GCTGTCAGAG AC 1072 



2) INFORMATION FOR SEQ ID NO: 433 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 

55 AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

.CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 150 

CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGCACACTGG 2 00 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

60 CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 300 

262 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ATCGGAACAG TCCCAGTCGG 
TATGGTTGTG ACCTTCGCTC 
TCGAAATGCA CCACCAGCAG 
GGCTTCAACG TCAAGAACGT 
5 CGCTGGTGAC TCCAAGAACG 
CCCAGGTCAT CGTCCTCAAC 
CCAGTCCTTG ACTGCCACAC 
CCTCGAGAAG ATCGACCGCC 
AGTTCATCAA GTCTGGTGAT 
10 C 



CCGTGTCGAA ACCGGTGTTA 
CTTCCAACGT CACCACTGAA 
CTCACCCAGG GTAACCCTGG 
CTCTGTCAAG GAAGTCCGCC 
ACCCACCAAA GGGCTGCGAC 
CACCCTGGTC AAGTCGGTGC 
TGCCCACATT GCTTGCAAGT 
GTACCGGTAA ATCCGTTGAG 
GCCGCTATCG TCAAGATGGT 



PCT/CA00/01150 

TCAAGCCTGG * ■■ J50 

GTCAAGTCCG 400 

TGACAACGTT 450 

GCGGTAACGT 500 

TCCTTCAACG 550 

TGGTTATGCC 600 

TCTCCGAGCT 650 

AACAACCCCA 700 

TCCATCCAAG 750 
751 



2) INFORMATION FOR SEQ ID NO: 434 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY : Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 

30 AAGCTCAAGG CCGAGCGAGA GCGAGGTATC ACCATCGACA 
GAAGTTCGAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC 
ACCGAGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 
GCCATCCTCA TCATCGCCTC CGGTATCGGA GAGTTCGAGG 
CAAGGACGGT CAGACCCGAG AGCACGCCCT TTTGGCCTTC 

35 TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG 
TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT 
CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG 
AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA 
GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC 

4 0 GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA 
GAGGCCATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG 
GCGATTGCCC CTCCAGGATG TCTACAAGAT CGGTGGTATC 
CCGTCGGTCG AGTCGAGACC GGTGTCATCA AGGCTGGTAT 
TTCGCCCCTG CCAACGTCAC CACCGAAGTC AAGTCCGTCG 

45 CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT 
AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG 
AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC 
CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC 
GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT 

50 GACCGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT 
CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC 
AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT 



TCGCCTTGTG 


50 


GGCCCCGGTC 


100 


GGCCGACTGT 


150 


CTGGTATCTC 


200 


ACCCTCGGTG 


250 


CAAGGTTAGT 


300 


TTACAGTGGT 


350 


TTTCATCAAG 


400 


TCTCCGGATG 


450 


TGGTACAAGG 


500 


GACCTTGCTC 


550 


ACAAGCCCTT 


600 


GGTACGGTGC 


650 


GGTCGTCACC 


700 


AAATGCACCA 


750 


TTCAACGTCA 


800 


CTCCGACTCG 


850 


AGGTCATTGT 


900 


GTTTTGGACT 


950 


TGAGAAGATC 


1000 


TCGTCAAGTC 


1050 


ATGTGTGTCG 


1100 


CCGAGA 


1146 



55 

2) INFORMATION FOR SEQ ID NO: 435 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
60 (B) TYPE: Nucleic acid 

263 



SUBSTITUTE SHEET (RULE 26) 



5 



35 



50 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala jeanselmei 

(B) STRAIN: ATCC 64755 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 



CAAGCTGAAG GCCGAGCGTG" AGCGTGGTAT" "CACCATCGAT-ATCGCCTTGT— 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

15 CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 200 

CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 2 50 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 3 00 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 3 50 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 4 00 

20 AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 600 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

25 TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC 700 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 800 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 900 

30 CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 

GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 



2) INFORMATION FOR SEQ ID NO: 436 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

55 TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 200 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 300 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC ACCACCAAGT 350 

60 GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC CTCTTTCATC 400 

264 



SUBSTITUTE SHEET (RULE 26) 
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AAGAAGGTCG GCTACAACCC CAAGGCTGTC GCTTTCGTCC CCATCTCCGG 450 

TTTCAACGGT GACAACATGC TTACCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGTTGGGA GCGTGAGATC AAGTCCGGCA AGCTCACTGG CAAGACCCTC 550 

CTCGAGGCCA TTGACTCCAT CGAGCCCCCC AAGCGTCCCG TTGACAAGCC 600 

5 CCTTCGTCTT CCCCTTCAGG ATGTCTACAA GATCGGTGGT ATTGGAACGG 650 

TTCCCGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG TATGGTCGTT 700 

ACCTTCGCTC CTTCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACGAGCAA CTCACTGAGG GCCAGCCCGG TGACAACGTT GGTTTCAACG 800 

TGAAGAACGT CTCCGTCAAG GACATCCGAC GTGGTAACGT CGCTGGTGAC 850 

10 TCCAAGAACG ACCCCCCTAT GGGTGCCGCT TCTTTCACCG CCCAGGTCAT 900 

_ CGTCCTCAAC CACCCCGGTC AGGTCGGTGC TGGTTACGCT CCCGTCCTCG 950 



ATTGTCACAC TGCCCACATT GCCTGCAAGT~TCGCCGAGAT~CCAGGAGAAG 1000- 

ATCGACCGCC GAACCGGTAA GGCTACTGAG GCCGCCCCCA AGTTCATCAA 1050 
GTCTGGTGAC TCCGCCATCG TCAAGATGGT TCCCTCCAAG CCCATGTGTG 1100 
15 TTGAGGCTTT CAC • 1113 



2) INFORMATION FOR SEQ ID NO: 437 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Geotrichum spp. 

(B) STRAIN: LEV-4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 

35 GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 50 

AAGAAGGTTG GTTAYAACCC CAAGACTGTT GCTTTCGTCC CCATCTCTGG 100 

TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 150 

AGGGATGGCA GAAGGAGACC AAGGCTGGTG TCACTAAGGG TAAGACCCTC 200 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 250 

40 CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 3 00 

TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGCCGG TATGGTCGTC 3 50 

ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 400 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 450 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 500 

45 TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 600 

ATTGCCACAC CGCCCATATT GCTTGCAAGT TCGACACTTT GATCGAGAAG 650 

ATTGACCGTC GTACTGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 726 

50 



2) INFORMATION FOR SEQ ID NO: 438 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

265 



SUBSTITUTE SHEET (RULE 26) 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
5 (B) STRAIN: G186A5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 



CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

10 CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CC A ATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

CACATGGTAC^AAGGGCTGGA-ACAAGGAGAC TGCCTCTGGC - AAGTCTTGTG 2-O-0 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 300 

15 TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

20 GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 



2) INFORMATION FOR SEQ ID NO: 439 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalis 
40 (B) STRAIN: ATCC 6258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

45 CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 200 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA ACCGAAAAGC 250 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 300 

50 GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 350 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 400 

ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 450 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 500 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

55 TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 600 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 

60 

266 
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2) INFORMATION FOR SEQ ID NO: 440 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1091 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 



CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 50 

GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 100 

20 CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 150 

CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 200 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 250 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 300 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 350 

25 AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 400 

CACGGTGACA ACATGATCGA GGCCACCACC AACATGCCTT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 500 

ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 600 

30 TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 650 

TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC 700 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 750 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 800 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 850 

35 ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 900 

CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 950 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 1000 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 1050 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 1091 

40 



2) INFORMATION FOR SEQ ID NO: 441 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 
55 (B) STRAIN: ATCC 42756 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT ACTCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
60 GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 100 

267 
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CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CACCAACATG 150 

CCGTGGTACA AGGGCTGGGA GAAGGAGACC AAGTCGGGCA AGGCCACTGG 200 

TAAGACCCTT CTGGACGCTA TTGACGCCAT TGAGCCGCCG ACGCGCCCGA 250 

CCGACAAGCC TCTCCGTCTT CCTCTCCAGG ATGTGTACAA GATCGGTGGT 3 00 

5 ATCGGTACYG TCCCGGTCGG CCGTGTTGAG ACCGGTGTTA TCAAGCCCGG 350 

TATGGTTGTG ACCTTCGCTC CGTCGAACGT CACSACTGAA GTTAAGTCGG 400 

TCGAGATGCA CCACGAGCAG ATCCCTGAGG GTCTTCCGGG TGACAACGTT 450 

GGTTTCAACG TGAAGAACGT GTCGGTCAAG GACATTCGCC GTGGTAACGT 500 

CGCCTCGGAC TCGAAGAACG ACCCGGCTCA GGAGGCTGCC TCGTTCAATG 550 

10 CTCAGGTCAT TGTGATGAAC CACCCTGGTC AGATCAGCAA CGGTTACTCG 600 

CCRGTGCTCG ACTGCCACAC TGCTCACATT GCCTGCCGCT TCAACAACAT 650 



15 



30 



CCTCCAGAAG ATCGACCGTC~GTTCGGGTAA~GGTTCTYGAA GAGAACCeCA -700- 

AGTTCATCAA GTCGGGTGAC GCTGCCATGG TTGAGATGAT CCCGACCAA 74 9 



2) INFORMATION FOR SEQ ID NO: 442 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

35 TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 200 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 300 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

4 0 TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 4 50 

TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

45 CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 700 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

50 TCCAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

55 TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



60 



2) INFORMATION FOR SEQ ID NO: 443 

268 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Metschnikowia pulcherrima 
(B)_ STRAIN: DS M 70336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 



15 GGACAAGTTG AAGGCTGAGA GAGAGAGAGG TATCACCATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 200 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 250 

20 GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 300 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 500 

25 TGGAGGCCAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 600 

GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 700 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

30 CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

35 TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



40 2) INFORMATION FOR SEQ ID NO: 444 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

55 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 
GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 
CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 
CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 200 
60 GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 

269 
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TGGTATCTCC AAGGATGGCC AGACCCG' 

CCCTCGGTGT TAAGCAGCTC ATCGTCG( 

AAGTGGTCTG AGGCCCGTTT CCAGGAGi 

CATCAAGAAG GTCGGCTACA ACCCCAAGAC 
5 CTGGTTTCCA CGGCGACAAC ATGCTTTCCC 

TACAAGGGCT GGGAGAAGGA GACCAAGGCT 

CCTCCTTGAG GCCATCGACT CCATCGAGCC 

AGCCCCTCCG CCTTCCCCTT CAGGATG"! 

ACAGTCCCTG TCGGCCGTAT CGAGACTC 
10 CGTGACCTTC GCTCCTTCCA ACGTCACCAC 
^TGCACCACGA GCAGCTCTCC GAGGGTGTCC 

AACGTCAAGA - ACGTCTCCGT~CAAGGAGi 

TGACTCCAAG AACGACCCCC CTCTGGGTGC 

TCATCGTCCT CAACCACCCC GGCCAGGTCG 
15 CTCGACTGCC ACACCGCCCA CATTGCC' 

GAAGATCGAC CGCCGTACCG GCAAGTCTGT 

TCAAGTCTGG CGACTCTGCC ATCGTCAi 



TGA 



GCACGCTCTG 


CTCGCCTffCJP 


3(W" 


TCAACAAGAT 


GGACACCACC 


350 


! ATCAAGGAGA 


CCTCCAACTT 


400 


' CGTCGCTTTC 


GTCCCCATCT 

X ^m>> V«t> ^^«X X X 


450 


' CCTCCACCAA 


CTGCCCCTGG 


500 


' GGCAAGTCCA 


CCGGCAAGAC 


550 


' CCCCAAGCGC 


CCCAGCGACA 


600 


' ACAAGATCGG 


CGGTATCGGC 


650 


' GTCATCAAGC 


CCGGCATGGT 


700 


1 CGAAGTCAAG 


TCCGTTGAGA 


750 


' CCGGTGACAA 


CGTCGGCTTC 


800 


'"CGTCGTGGCA- 


-ACGTCGCCGG 


850 


1 CGCTTCTTTC 


GATGCCCAGG 


900 


1 GTGCTGGCTA 


CGCGCCCGTC 


950 


' AAGTTCGCCG 


AGATCAAGGA 

nun x \<nnu\jn 


1000 




V_ \_ \^r\rA\D x x V-/"V 


X v J v 


. TGATTCCCTC 


CAAGCCCATG 


1100 


1 CTGGGCCGCT 


TCGCCGTCCG 


1150 






1153 



20 



2) INFORMATION FOR SEQ ID NO: 445 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE SS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
35 (B) STRAIN: ATCC 32071 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 



TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

40 CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 300 

45 TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

50 GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 

GCCCATGTGC GTC 763 

55 



2) INFORMATION FOR SEQ ID NO: 446 
60 (i) SEQUENCE CHARACTERISTICS: 

270 
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(A) LENGTH: 1346 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
10 (B) STRAIN: ATCC 64101 

_ (xirSEQUENCE""DESCRTPTIONT-SEQ-I-D-NO :-446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

15 GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 100 

ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 200 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 250 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 300 

20 CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA 350 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 450 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 500 

GTTGGGAGAA GGAGACCAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 550 

25 GAGGCCATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 700 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 750 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 800 

30 TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

35 GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 1200 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 1250 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 1300 

4 0 TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 1346 



2) INFORMATION FOR SEQ ID NO: 447 



45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

60 AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
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GAAATTCGAA ACTCCAAAAT ACCATGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAAAAT ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATYTTAA TTATTGCCGG TGGTATTGGT GAATTCGAAG CTGGTATCTC 200 

AAAAGATGGT CAAACCAGAG AACACGCTTT NTTAGCTTAC ACCTTAGGTG 250 

5 TTAAACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAAATGGGAT 300 

GAATCTCGTT TCGAAGAAAT TGTCAAGGAA ACYTCAAACT TTATCAAGAA 350 

AGTTGGTTAC AACCCAAAAA CTGTTCCATT CGTTCCAATC TCAGGTTGGA 400 

ATGGTGATAA CATGATTGAA CCATCAWCTA ACTGTCCATG GTACAAAGGT 450 

TGGAAAAAAG AAACCAAAGC TGGTGAAGCT AAAGGTAAAA CTTTATTAGA 500 

10 AGCCATTGAT GCTATTGACC CACCATCAAG ACCAACTGAT AAACCATTAC 550 

GTTTACCATT ACAAGATGTT TACAARATTG GTGGTATTGG TACTGTGCCA 600 



GTCGGTAGAG TTGAAACCGG" TGTTATCAAA~CCAGGTATGG -TTGTTAeGTT— —650- 

TGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCTGTTGAA ATGCATCATG 700 

AACAATTGAC TGAAGGTTTA CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

15 AATGTTTCTG TTAAAGAAAT CCGTCGTGGT AACGTCTGTG GTGACTCTAA 800 

AAACGATCCA CCAAAAGCTG CTGAATCATT CAATGCTCAA GTTATTGTCT 850 

TAAACCATCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT TTTAGATTGT 900 

CACACTGCTC ACATTGCTTG TAAATTTGAC ACTTTAATTG AAAAAATTGA 950 

CAGACGTACT GGTAAGAAAT TAGAAGAAGC TCCAAAATTC ATCAAATCAG 1000 

20 GTGATGCTGC TATTGTTAAA TTTGTTCCAT CAAAACCATT ATCAGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTCGT TTCGCTGTCA GAGA 1094 



25 2) INFORMATION FOR SEQ ID NO: 448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 



CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 200 

45 GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 250 

TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 300 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 400 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 450 

50 AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

55 ATCATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 750 

GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

60 AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 
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AATCAGGTGA TGCTGCTATT GTTAAATTTG TTC CATC AAA ACCATTATCA ' 1050 
GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 1100 



2) INFORMATION FOR SEQ ID NO: 449 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
10 (B) TYPE: Nucleic acid 
CC) S.TRANDEDNESS : Doubl e 



15 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTCTTTG 100 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GCACAATTCC AGCCCTCGAT 150 

25 AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 200 

CCCGGCCATC GTGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC 250 

TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 300 

GCATCTCCAA GGATGGCCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 350 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG ACACCAACAA 4 00 

30 CTGGTCCGAG GACCGATACA AGGAGATCAT CAAGGAGACC TCCAACTTCA 450 

TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

GGTTTCCACG GAGACAACAT GCTTACCCCC TCCACCAACT GCCCCTGGTA 550 

CAAGGGTTGG GTCCGTGAGG TCAAGGGTAA CACCCTTACC GGCAAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

35 CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGATCGGTG GTATTGGCAC 700 

TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGGTCG 750 

TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 800 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 850 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

40 ACTCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGACCG CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

45 TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 1200 

A 1201 



50 2) INFORMATION FOR SEQ ID NO: 450 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

273 
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10 



WO 01/23604 

(A) 
(B) 



PCT/CA00/01150 



ORGANISM: Rhizopus oryzae 
STRAIN: ATCC 56015 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

CAAGCTTAAG GCTGAACGTG AACGTGGTAT CACCATCGAT 
GGAAGTTCGA AACCCCCAAG TACCAAATTA CCGTTATTGA 
CACCGTGATT TCATCAAGAA CATGATTACC GGTACTTCTC 
TGCTATTCTT ATCATTGCTG GTGGTACTGG TGAATTCGAA 
CCAAGGATGG TCAAACCCGT GAACACGCCC TTTTGGCTTT 
GTCCGTCAAT TGATTGTTGC TGTCAACAAG ATGGATACCA 



CGAAGCTCGT TTCAACGAAA" TYGTCAAGGA~AGTTTCTTCC 
AGATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT 
CACGGTGACA ACATGTTGGA AGAATCTACC AACATGCCCT 

15 ATGGAACAAG GAAACCAAGG CTGGTGCCAA- GTCTGGTAAG 
ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA 
CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG 
CGTCGGTCGT GTCGAAACTG GTGTCATCAA GGCTGGTATG 
TCGCTCCTGC TGCTGTCACC ACTGAAGTTA AGTCCGTCGA 

20 GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT 
GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 
AGAACGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA 
TTGAACCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG 
TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC 

25 ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT 
GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA 
AGCTTACACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT 



ATTGCTCTCT 50 

TGCTCCCGGT 100 

AAGCCGATTG 150 

GCTGGTATCT 200 

CACTCTCGGT 250 

CCAAGTGGTC 300 

TTCATCAAGA 350 

CTCTGGTTGG 400 

GGTACAAGGG 450 

ACCCTCTTGG 500 

CAAGCCTCTC 550 

GTACTGTCCC 600 

GTTGTCACCT 650 

AATGCACCAC 700 

TCAACGTCAA 750 

TCTGACTCCA 800 

AGTCATTATC 850 

TTTTGGATTG 900 

GAAAAGATTG 950 

CGTCAAGTCT 1000 

TGTGTGTTGA 1050 

CGTGA 1095 



30 



40 



2) INFORMATION FOR SEQ ID NO: 451 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1092 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 

GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 100 

CGTGATTTCA TCAAGAGTGA GTTAACCATA ACATCAAACA GTGTTGCAAA 150 

50 CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

GGCCGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 300 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTCACTTAC 400 

55 CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTCGGATTCG ACCCCAAGGG 500 

TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 600 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 650 

60 CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 700 

274 
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ACAAGATTGG TGGTATCGGA ACAGTGCCAG 
ACCATCAAGG CTGGTATGGT CGTCACCTTC 
CGAAGTCAAG TCTGTTGAGA TGCACCACGA 
CAGGTGACAA CGTCGGATTC AACATCAAGA 
5 CGAAGAGGAA ACGTCTGCGG TGACTCCAAG 
TGCTTCCTTC ACCGCCCAGG TCATTGTCCT 
GTAACGGATA CTCTCCAGTT CTCGATTGCC 
CATATTAGTT TGAACTCTTT TGAACAATAC 
TTCAGCCCAC ATTGCATGCA AGTTCGACAC 
10 GACGATCCGG AAAGTCCATC GAAGATACCC 

GACGCCGCCA TCGTCAAGAT GGTCCCCACC 

TTTCACCGAC TACCCACCTC "TTGGACGATT 



Tprjp a apt 


PP AP APTPPT 


7cr\ 

/ jU 


pptppttpap 


PTPTPZiPPAP 


0 u u 


ppapptppap 


PPTPPTPTTP 


fl c: n 


apptttpa^t 


PA 21 PP A TA TP 


cm n 


a a pp a tp pp p 






PA APPAPPPP 


PPTP AAA TPP 


i nn.fi 

1UUU 


a papTPPTPa 


PTP A »'i "PPM'TP 


XU jU 


TA A PPTPZ\ AT 


PA 1 1" 1 1 ATA PTT 
v-Al 1AJ.AL 1 1 


11UU 


pptpptap ap 


A A PA TTP A PP 




CCAAGTTCGT 


CAAGTCTGGT 


1200 


AAGCCAATGT 


GCGTTGAGGC 


1250 


CGCCGTCCGT" 


GA 


1-29-2 



15 

2) INFORMATION FOR SEQ ID NO: 452 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1289 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE : Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salwonicolor 

(B) STRAIN: ATCC 32311 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 100 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

35 GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 200 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 250 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 400 

40 TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 450 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

45 CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 

GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

50 CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTCCGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

55 GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 1200 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



60 
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PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISMySpofothrijf- schenckii 

(B) STRAIN: WSA-148 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG TATCACCATC GATATTGCTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

20 CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 200 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 250 

CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 300 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 350 

AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 400 

25 TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG AAGGAGGGCA 450 

AGAGCGGCAA GGTTACCGGT AAGACTCTGC TGGACGCCAT TGACGCCGTC 500 

GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT . CCCTGTCGGC CGTATCGAGA 600 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 650 

30 ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 700 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGGCAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC GCCCACATTG 900 

35 CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG CCGCCATTGT 1000 

CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 1070 



40 



2) INFORMATION FOR SEQ ID NO: 454 

( i ) SEQUENCE CHARACTERI STI CS : 
45 (A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 



CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

60 GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 

276 
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ATCCTTATCA TTGCTGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA ' 23d 1 

GGACGGTCAG ACCAGAGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 250 

GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 300 

GATCGTTTCA ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 350 

5 TGGTTACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 400 

GTGACAATAT GATTGAGCCA TCCACCAACT GCCCATGGTA CAAGGGTTGG 450 

GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 

CATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC 550 

TTCCTCTTCA GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

10 GGTCGTGTTG AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 650 

C CCAGC TGGT GTCACCA CTG AAGTCAAGTC TGTCGAGATG CACCACGAAC 700 

AGATCCCAGA AGGTACCCCA~~GGTGACAACG~TTGGTTTCAArCGTCAAGAAC 750 - 

GTCTCCGTCA AGGAAATCAG ACGTGGTAAC GTTACTGGTG ACTCCAAGAA 800 

CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTCA 850 

15 ACCACCCTGG TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 

ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 950 

ACGTTCCGGT AAGAAGGTCG AAGACTCTCC TAAGTTCGTC AAGGCCGGTG 1000 

ATGCCGCCAT TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 1092 

20 



2) INFORMATION FOR SEQ ID NO: 455 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
35 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

40 GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 200 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 300 

45 TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC ACCAACTGGT 350 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 4 00 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

50 CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

55 TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

60 GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

277 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC" CGTCCGTGA' tl%9 



5 2) INFORMATION FOR SEQ ID NO: 456 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 



2) INFORMATION FOR SEQ ID NO: 457 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

60 

278 



TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 

CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 

CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 200 

25 GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTT.CACC 250 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 300 

GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 4 00 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 450 

30 CAAGGGATGG ACCAAGGAAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 500 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG . 550 

CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAACCGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

35 CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 8 00 

ACTCCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

40 AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 



PCT/CA00/01150 



GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

5 CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 200 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 300 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

10 TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 4 00 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 450 



GGAGAAlJGAG - TC~CAAGGCTG~GG 500 - 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 600 

15 CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 700 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

20 AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 1085 

25 



2) INFORMATION FOR SEQ ID NO: 458 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
4 0 (B) STRAIN: DAL- 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

45 TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 200 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 300 

50 CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 350 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 4 00 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 4 92 

55 

2) INFORMATION FOR SEQ ID NO: 459 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1154 bases 

279 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 



GTCCGTGGTC AAGAAGTTAT TGACACTGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

15 ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 200 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 250 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 300 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 350 

20 CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 4 00 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

25 CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 700 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 750 

GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTCTCG 800 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

30 AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 900 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 1000 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

35 CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 1150 

AAGC 1154 



40 2) INFORMATION FOR SEQ ID NO: 460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 



CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 200 

60 GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 250 

280 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

GGTCCAATTG AATGTAAGGA AAAGAAACCA ATTCACGCTG AACCACCATC 300 

ATTCGTTGAA CAATCCACTG CTGCCGARAT TTTGGAAACC GGTATCAAGG 350 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAARAT TGGTTTATTC 4 00 

GGTGGTGCTG GTGTCGGTAA GACCGTCTTT ATCCAAGAAT TGATTAACAA 450 

5 CATTGCCAAA GCCCATGGTG GTTTCTCTGT CTTTACCGGT GTYGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAAGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTCTTCG GTCAAATGAA 600 

CGAACCACCA GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CTGAATACTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTCATTGAT 700 

10 AACATTTTCA GATTCACCCA AGCTGGTTCC GAAGTGTCTG CTTTGTTAGG 750 

TCGTATTCCA TCTGCTGTCG GTTATCAACC AACCTTAGCC ACTGATATGG 800 



GTCTTTTGCA AGAACGTATT~ ACCACCACCA - AGAAAGGTTC - CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCTGATGAT TTGACCGATC CTGCTCCAGC 900 

CACTACATTC GCCCATTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

15 CTGAATTGGG TATCTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

TTATTGGACG CTTCTGTTGT TGGTCAAGAA CATTACGATG TCGCTACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAATC CTTACAAGAT ATCATTGCTA 1100 

TTTTGGGTAT GGATGAATTG TCTGAAGCTG ATAAATTGAC TGTCGAAAGA 1150 

GCCCGTAAGA TCCAAAGATT CTTGTCTCAA CCATTCGCTG TTGCTGAAGT 1200 

20 TTTCACTGGT ATCCCAGGTA GATTAGTCAG ATTGCAAGAC ACTGTCAAAT 1250 

CATTCAAGGA TGTTTTGGAA GGTAAATACG ATAACTTGCC AGAAA 1295 



25 2) INFORMATION FOR SEQ ID NO: 461 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 



TAACTTGCCA GCTATTTTGA ATGCTTTGAC TTTGAAGAAC GGTGACCAAG 50 

ATTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC CGTCAGAGCT 100 

ATTGCTATGG ATGGTACTGA AGGTTTAGTC AGAGGTACTG AAGTCAACGA 150 

TACCGGTGCC CCAATCTCCG TTCCAGTCGG TAGAGGTACC TTAGGTAGAA 2 00 

45 TCATCAATGT TGTTGGTGAA CCAATTGATG ACAGAGGTCC AATTGAATGT 250 

AAGGAAAAGA AACCAATTCA TGCAGAACCA CCATCCTTCG TTGAGCAATC 300 

CACTGCTGCC GAAATTTTGG AAACCGGTAT CAAGGTTGTC GACTTATTGG 350 

CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG TGCTGGTGTC 4 00 

GGTAAGACCG TCTTTATCCA AGAATTGATT AACAACATTG CTAAAGCCCA 450 

50 TGGTGGTTTC TCCGTCTTTA CCGGTGTCGG TGAAAGAACC AGAGAAGGTA 500 

ACGATTTGTA CCGTGAAATG AAAGAAACCG GTGTCATCAA CTTAGAAGGT 550 

GACTCCAAGG TCGCTTTGGT CTTTGGACAA ATGAACGAAC CACCAGGTGC 600 

TAGAGCTAGA GTTGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCAGAG 650 

ATGAAGAAGG TCAAGATGTC TTGTTGTTCA TCGATAACAT TTTCAGATTC 700 

55 ACCCAAGCTG GTTCCGAAGT GTCTGCTTTG TTAGGTCGTA TTCCATCTGC 750 

CGTCGGTTAT CAACCAACCT TAGCTACTGA TATGGGTCTT TTGCAAGAAC 800 

GTATTACCAC CACCAAGAAA GGTTCCGTCA CCTCTGTCCA AGCTGTCTAT 850 

GTCCCAGCTG ATGATTTGAC CGATCCTGCT CCAGCCACCA CATTCGCCCA 900 

TTTGGATGCC ACTACTGTCT TGTCTAGAGG TATTTCTGAA TTGGGTATTT 950 

60 ACCCAGCTGT CGATCCATTG GATTCCAAAT CCAGATTATT GGACGCTGCC 1000 

281 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GTTGTTGGTC AAGAACATTA TGATGTCGCT 
GCAAGCTTAC AAATCCTTAC AAGATATCAT 
AATTGTCTGA AGCTGATAAA TTGACTGTCG 
AGATTCTTGT CTCAACCATT CGCCGTTGCT 
5 AGGTAGATTA GTCAGATTGC AAGACACTGT 
TGGAAGGTAA ATACGATCAC TTGCCAG 



PCT/CAOO/01150 

ACTGGTGTTC AACAAACT^f 1050 
TGCTATTTTG GGTATGGATG 1100 
AAAGAGCCCG TAAGATTCAA 1150 
GAAGTTTTCA CTGGTATTCC 1200 
CAAATCATTC AAGGATGTTT 1250 

1277 



10 2) INFORMATION FOR SEQ ID NO: 462 



25 



(l) SEQUENCE CHARACTERISTICS": ~ 

(A) LENGTH: 1278 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double- 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



GTAACTTGCC AGCTATTTTG AACGCTTTGA CCTTGAAGAA CGGTGAAAAC 50 

GACTTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 

TATTGCTATG GATGGTACTG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 

ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 200 

30 ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 250 

CAAGGAAACC AGACCAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 300 

CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 350 

GCCCCTTACG CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGCCGGTGT 400 

CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 450 

35 ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 

AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 550 

TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 

CTAGAGCTAG AGTTGCTTTA ACCGGTTTAA CCATTGCCGA ATACTTCAGA 650 

GACGAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 700 

40 CACCCAAGCC GGTTCCGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 750 

CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 800 

AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 850 

CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 

ACTTGGATGC TACCACTGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

45 TACCCAGCTG TCGATCCATT GGATTCCAAA TCCAGATTGT TAGATGCTGA 1000 

TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTCTG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC ACCGGTATCC 1200 

50 CAGGTAGATT AGTTAGATTG CAAGACACTG TTAAATCTTT CAAGGAAGTC 1250 

TTAGAAGGTA AATATGATCA CTTACCAG 1278 



55 2) INFORMATION FOR SEQ ID NO: 463 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

282 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 

10 

TCTGGTC AGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 50 

CTGTCGGCAG~AG^ACCCTG _ GGCAGAATCA"~TCAACGTTAT^CGGTGAACCT 3:00 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 200 

15 CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 250 

ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 300 

ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTTCACAG 350 

GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 4 00 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 4 50 

20 CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC GCCTTGACCG 500 

GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 550 

CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC AAAGAAGGGT 700 

2 5 TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 750 

TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTGCTAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 900 

CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

3 0 ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 1000 

ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 1050 

TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 1100 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 1150 

CCAG 1154 

35 



2) INFORMATION FOR SEQ ID NO: 464 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
50 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 

CCACTACGAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 50 

55 AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

TGCCAGCGTC ACTGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 200 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 2 50 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 300 

60 GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 350 

283 



40 



45 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TTGTCGACTT GTTGGCTCCA TACGCCAGAG GTGGTAAGAT TGGATTGTTCT "400 

GGTGGTGCCG GTGTCGGTAA GACTGTGTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCTCACGGTG GTTTCTCCGT GTTCACCGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTC 550 

5 ATCAACTTGG AAGGTGAATC CAAGGTGGCC TTGGTGTTCG GTCAAATGAA 600 

CGAACCTCCA GGAGCTAGAG CCAGAGTTGC CCTTACCGGT TTGACCATCG 650 

CTGAATACTT CAGAGATGAG GAGGGTCAAG ATGTGTTGTT GTTCGTCGAC 700 

AACATTTTCA GATTCACTCA AGCTGGTTCT GAAGTGTCGG CTTTGTTGGG 750 

TCGTATTCCT TCGGCTGTCG GTTACCAACC TACTTTGGCC ACCGATATGG 800 

10 GTTTGTTGCA AGAGCGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTGCC AGCCGATGAT TTGACCGATC CTGCTCCTGC 900 



TACTACTTTT"GCTCACTTGG"ATGCTACCAC TGTGTTGTCT~AGAGGTATCT 950- 

CCGAGTTGGG TATTTACCCA GCTGTCGATC CTTTGGATTC CAAGTCGAGA 1000 

TTGTTGGATG CCTCTGTTGT CGGTGAGGAG CACTACTCGG TTGCTTCTAA 1050 

15 CGTTCAACAA ACCTTGCAAG CTTACAAGTC < TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGACGAATTG TCGGAAGCTG ACAAGTTGAC CGTCGAGAGA 1150 

GCCCGTAAGA TCGAGAGATT CTTGTCTCAA CCATTTGCTG TTGCCGAAGT 1200 

TTTCACTGGT ATCAGTGGTA AGTTGGTCAG ATTGGAGGAC ACTATCAGAT 1250 

CTTTCAAGGA AGTCTTGGAA GGTAAGTACG ATCACTTGCC AGA 1293 

20 



2) INFORMATION FOR SEQ ID NO: 465 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 
35 (B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 

CACTTTGACG ATGGTAACTT . GCCAGCCATT TTCAACGCCT TGAAGTTGAA 50 

40 GAACGGTGAC CAGGACTTGG TCTTGGAGGT CGCCCAGCAC TTGGGTGAGA 100 

ACACCGTCAG AACCATTGCC ATGGACGGTA CCGATGGTTT GGTCAGAGGC 150 

GAGGCTGTCA CTGACACTGG TGCTCCAATC TCCGTGCCTG TTGGTCGTGA 200 

GACTTTGGGT CGTATCATCA ACGTTATTGG TGAGCCAATT GACGAGAGAG 250 

GACCAATCAA GTCCAAGAAG AGAAACCCAA TCCACACTGA CCCACCAACC 300 

45 TTCGTTGAGC AGTCTACTTC TGCTGAGGTT TTGGAGACTG GTATTAAGGT 350 

TGTCGACTTG TTGGCCCCTT ACGCCAGAGG TGGTAAGATT GGTTTGTTCG 400 

GTGGTGCCGG TGTCGGTAAG ACCGTCTTTA TCCAAGAGTT GATTAACAAC 450 

ATTGCCAAGG CCCACGGTGG TTTCTCTGTC TTTACCGGTG TCGGTGAGAG 500 

AACCAGAGAA GGTAACGATT TGTACCGTGA AATGCAGGAG ACCGGTGTCA 550 

50 TCAACTTCGA GGGTGACTCC AAGGTCGCCT TGGTCTTCGG TCAGATGAAC 600 

GAGCCACCAG GAGCTAGAGC TAGAGTTGCT TTGACCGGTT TGACCATTGC 650 

CGAATACTTC AGAGATGAAG AAGGTCAGGA TGTGTTGTTG TTCGTTGACA 700 

ACATTTTCAG ATTCACTCAG GCTGGTTCCG AGGTGTCCGC CTTGTTGGGT 750 

CGTATTCCAT CTGCTGTCGG TTACCAGCCA ACCTTGGCCA CCGATATGGG 800 

55 TACCTTGCAA GAAAGAATTA CCACCACCAA GAAGGGTTCC GTCACCTCTG 850 

TCCAGGCCGT TTACGTGCCA GCTGATGATT TGACCGATCC TGCCCCAGCT 900 

ACCACTTTCG CTCACTTGGA TGCTACCACT GTGTTGTCTA GAGGTATTTC 950 

CGAGTTGGGT ATCTACCCAG CTGTCGACCC ATTGGACTCC AAGTCTAGAT 1000 

TGTTGGATGC CTCTGTTGTC GGTAAGGAGC ACTACGATGT TGCTACCAAC 1050 

60 GTCCAGCAGA CCTTGCAGGC CTACAAGTCC TTGCAGGATA TCATTGCCAT 1100 

284 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTTGGGTATG GATGAATTGT CCGAAGCCGA 
CTAGAAAGAT TCAGAGATTC TTGTCCCAGC 
TTCACTGGTA TCGAGGGTAA GTTGGTCAGA 
TTTCAAGGAG GTCTTGGATG GTAAGTACGA 



PCT/CAOO/01150 

CAAGTTGACT GTCGAGAGAG 1150* 

CATTCGCCGT CGCCGAGGTT 1200 

TTGGAGGAGA CCGTCAAGTC 1250 

CCACTTGCCA GAG 1293 



2) INFORMATION FOR SEQ ID NO: 466 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 bases 



15 



(B) TYPET 'Nuclei"c — efc'id 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
20 (B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 

AACGGTGGAT CTAAGTTAGT CTTAGAAGTT GCTCAACATT TGGGTGAAAA 50 

25 CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

AACCAGTTAA TGATACTGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 150 

ACTTTAGGTA GAATCTTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 200 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TCATCAAGAT CCTCCTGCTT 250 

TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 300 

30 GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 350 

TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 4 00 

TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TCACTGGTGT CGGTGAAAGA 450 

ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA CTGGTGTTAT 500 

TAACTTAGAA GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 550 

35 AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 650 

CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 700 

GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 750 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 800 

40 CCAAGCAGTT TATGTCCCAG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

CTACTTTCGC CCACTTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 900 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 1000 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

45 TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 1100 

AAGAAAGATT C 1111 



50 2) INFORMATION FOR SEQ ID NO: 4 67 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

285 



SUBSTITUTE SHEET (RULE 26) 



WO 01/2J604 
(A) 
(B) 



PCT/CA00/01150 



ORGANISM: Candida kefyr 
STRAIN: ATCC 28838 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA 
CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG 
CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC 
AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG 

10 TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG 
AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT 
TTGAACAATC-CACTGCTGCT-GAAGTTTTGG--AAACCGGTAT- 
GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT 
TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT 

15 CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA. CCGGTGTCGG 
AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG 
CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA 
CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC 
TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA 

20 TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG 
TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA 
TTGCAAGAAA GAATTACCAC TACCAAGAAG GGTTCCGTTA 
AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT 
CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG 

25 TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT 
GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT 
AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT 
GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG 
AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC 

30 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT 
AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 



GATTGAWACT 50 

GTGAAAACAC 100 

CGTGGTGAGA 150 

TAGAGAAACT 200 

AAAGAGGCCC 250 

CCATCCTTTG 300 

CAAGGTTGTC 350 

TGTTCGGTGG 4 00 

AACAACATTG 450 

TGAAAGAACC 500 

GTGTCATCAA 550 

ATGAACGAAC 600 

TATCGCTGAA 650 

TCGACAACAT 700 

TTGGGTCGTA 750 

TATGGGTTTG 800 

CCTCCGTCCA 850 

CCAGCTACYA 900 

TATCTCCGAA 950 

CTAGATTGTT 1000 

ACTCAAGTTC 1050 

TGCCATTTTG 1100 

AAAGAGCCAG 1150 

GAAGTTTTCA 1200 

CGCTTCCTTC 1250 
1283 



35 2) INFORMATION FOR SEQ ID NO: 468 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1287 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 



CTTCGAACAA GGCCAATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 50 

ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 100 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 150 

AACCGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 200 

55 CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 250 

CCAGTTGACT GTAAGGAAAG AAAGCCAATT CACGCTGATC CTCCAGCTTT 300 

CGTTGAACAA TCCACTGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 350 

TCGATTTATT AGCACCTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 400 

GGTGCTGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 450 

60 YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

286 
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CCAGAGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 
AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 600 
ACCACCAGGA GCTAGAGCAA GAGTTGCTTT AACTGGTTTG ACCATTGCAG 650 
AATATTTCAG AGATGAAGAA GGTCAAGATG TCTTGTTATT CATTGATAAC 700 
5 ATTTTCAGAT TCACCCAAGC AGGTTCTGAA GTCTCTGCAT TATTAGGTAG 750 
AATTCCATCT GCTGTCGGTT ATCAACCAAC TTTAGCAACC GATATGGGTC 800 
TTTTACAAGA AAGAATTACC ACCACCAAGA AGGGTTCCGT TACTTCTGTC 850 
CAAGCTGTTT ATGTCCCAGC CGATGATTTA ACCGATCCTG CTCCAGCTAC 900 
TACTTTCGCC CACTTGGATG CAACCACTGT CTTGTCYAGA GGTATTTCCG 950 

10 AATTAGGTAT CTACCCAGCT GTCGATCCAT TAGATTCTAA GTCTAGATTA 1000 

TTAGATGTTG CAGTTGTTGG TCAAGAACAT TATGAAGTTG CAACTCAAGT 1050 



CCAAGAAACT" "TTACASGCTT "ACAAGTCTTT~ACAAGftTATT-ATTGeTATTT HO 0 

TGGGTATGGA TGAATTATCT GAACAAGATA AGTTAACYGT TGAAAGAGCA 1150 
AGAAAGATCC AAAGATTCTT ATCTCAACCA TTCTCTGTTG CAGAAGTTTT 1200 
15 CACTGGTATT CCAGGTAAGT TAGTCAGATT AGAAGAAACC ATCAAGTCTT 1250 
TCAGGGATGT TCTTGCAGGT AAGTACGATC ACTTACC 1287 



20 2) INFORMATION FOR SEQ ID NO: 469 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambic a 

(B) STRAIN: ATCC 24750 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 



TCGAACAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

GATGGTCAAA AGTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 200 

40 CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 250 

TGTCGAMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 300 

TTGACCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 350 

GATTTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 4 50 

45 CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 550 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 700 

50 TTTCAGATTC ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 750 

TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 800 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 850 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 900 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

55 TTAGGTATTT ACCCAGCTGT CGATCCATTA GATTCTAAGT CTAGATTATT 1000 

GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA ACYCAAGTCC 1050 

AAGAAACTTT ACAAGCTTAC AAGTCCTTAC AAGATATCAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

60 CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 

287 
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AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 



PCT/CAOO/01150 
1283 



5 2) INFORMATION FOR SEQ ID NO: 470 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1140 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



20 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 



AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

GTGAAACCTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATTCACA CTGAGCCACC 150 

ATCGTTTGTT GAACAATCCA CTTCTGCTGA AGTTTTGGAG ACTGGTATCA 2 00 

25 AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 250 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 300 

CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT " GGTGTCGGTG 350 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGACCGGT 4 00 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

30 GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 500 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 550 

GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 650 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 700 

35 TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 900 

TAACGTTCAA CAAACTTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

4 0 CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 1000 

AGAGCCAGAA AGATCCAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 1100 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 114 0 



2) INFORMATION FOR SEQ ID NO: 471 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 



288 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 



PCT/CAOO/01150 



TCAATTCGAA CAAGGTCAAT TACCACCAAT TTTAAACGCT TTAGTCATGG 50 

ACAACGGTGG AACTAAGTTA GTTTTAGAAG TTGCTCAACA TTTAGGTGAA 100 

5 AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TCAACCAGTT AATGATACTG GTGCTCCAAT CTCTGTCCCA GTCGGTAGAG 200 

GTACTTTAGG TAGAATCTTA AACGTTATTG GTGATCCAGT CGATGAAAGA 250 

GGTCCAATCG ATTGTAAGGA AAGAAAACCA ATTCATCAAG ATCCTCCTGC 300 

TTTCGTTGAA CAATCAACTG AAGCTGAAGT TTTAGAAACT GGTATTAAAG 350 

10 TTGTCGATTT ATTAGCTCCT TACGCTAGAG GTGGTAAGAT TGGTTTATTC 400 

GGT GGT GCAG G TGTTGGTAA AACCGTTTTT ATTCAAGAAT TAATTAACAA 450 

TGTTGCAAAA" GCTCATGGTG GTTTCTCCGT"TTTCACTGGT~GTCGGTGAAA- 500 

GAACTAGAGA AGGTAACGAT TTATACAGAG AAATGAAGGA AACTGGTGTT 550 

ATTAATTTAG AAGGTGATTC TAAGGTCGCA TTAGTTTTCG GTCAAATGAA 600 

15 CGAACCACCT GGAGCAAGAG CAAGAGTTGC TTTAACTGGT TTAACTATTG 650 

CTGAATATTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT ATTCATTGAT 700 

AACATTTTCA GATTTACTCA AGCAGGTTCT GAAGTTTCCG CTTTGTTAGG 750 

TAGAATTCCA TCCGCTGTCG GTTATCAACC AACTTTAGCA ACTGATATGG 800 

GTCTTTTACA AGAAAGAATT ACTACTACTA AGAAAGGTTC TGTTACTTCC 850 

2 0 GTTCAAGCAG TTTATGTCCC AGCAGATGAT TTAACTGATC CTGCTCCAGC 900 

AACTACTTTC GCCCACTTAG ATGCAACTAC TGTCTTATCT AGAGGTATTT 950 

CCGAATTAGG TATTTACCCA GCTGTCGATC CATTAGATTC TAAATCTAGA 1000 

TTATTAGATG TTGCAGTTGT TGGTCAAGAA CATTATGATG TTGCAACTCA 1050 

AGTTCAAGAA ACTTTACAAG CTTACAAATC TTTACAAGAT ATTATTGCTA 1100 

25 TTTTAGGTAT GGATGAATTA TCTGAACAAG ATAAATTAAC CGTTGAAAGA 1150 

GCTAGAAAAA TTCAAAGATT CTTATCTCAA CCTTTCTCTG TCGCAGAAGT 1200 

TTTCACTGGT ATCCCAGGTA AGCTTGTTAG ATTAGAAGAA ACTATTTCTT 1250 

CATTCAGAGA TGTCTTAGCA GGTAAGTACG ATCACTTACC AGAAAA 1296 

30 

2) INFORMATION FOR SEQ ID NO: 472 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



4 0 (ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 



ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

50 AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TACTTCTGTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 200 

GTACTTTGGG TAGAATCATC AATGTTACTG GTGACCCAAT TGATGAAAGA 250 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 300 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 350 

55 TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA AACCGTGTTT ATCCAAGAAT TGATCAACAA 450 

CATTGCCAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

60 CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

289 
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CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC " TOO 

AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCCGCTGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACTACCA AAAAGGGTTC AGTTACTTCA 850 

5 GTCCAAGCCG TTTACGTGCC AGCTGATGAT TTAACCGATC CTGCTCCAGC 900 

TACCACTTTC GCTCACTTGG ATGCCACCAC TGTGTTGTCT AGAGGTATTT 950 

CGGAGTTGGG TATTTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

TTGTTGGATG CTGCCGTTGT TGGTCAAGAA CATTACGACG TTGCCACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAGTC CTTGCAAGAT ATCATTGCTA 1100 

10 TCTTGGGTAT GGATGAATTG TCCGAACAAG ATAAATTGAC TGTCGAAAGA 1150 

GCT C GTAAG A TTCA AAGATT CTTGTCTCAA CCATTTGCTG TCGCTGAAGT 1200 

TTTCACTGGT ATTCCAGGTA AATTGGTTAG" ATTGTCTGAA - ACTGTCAAGT 1-250- 

CATTCAAGGA AGTCTTGGAA GGTAAGTACG ATAACTTGCC AGAAAAT 12 97 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 473 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1285 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 

GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 100 

35 AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 

CACTGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 200 

GTACCCTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 250 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 300 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 350 

40 TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATCCAGGAGC TGATTAACAA 4 50 

CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 500 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

45 CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 650 

CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 700 

AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 750 

TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 800 

GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

50 GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 

CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 1000 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 1050 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

55 TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 12 00 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 12 85 



290 
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WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 474 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



"( viTORTGINAU "SOURCE : 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 



TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGCTTTGGA AATCGACACT 50 

CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 

20 TGTCAGAACC ATTGCTATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 150 

ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCCCAGTTGG TAGAGAAACC 200 

TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 250 

AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 300 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 350 

25 GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 450 

CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 550 

CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 

3 0 CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 700 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACTAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 850 

35 AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 900 

CTTTCGCGCA TTTGGATGCC ACCACTGTGT TGTCCAGAGG TATCTCTGAA 950 

TTGGGTATCT ACCCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 1000 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

40 GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAGAGCTAG 1150 

AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 1200 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 1250 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 1283 

45 



2) INFORMATION FOR SEQ ID NO: 475 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

60 

291 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 



PCT/CA00/01150 



CGACGAAGGT AACTTGCCAG CTATTTTGAA TGCTTTGACT TTGAAGAACG 50 

GTGACCAAGA CTTGGTTTTG GAAGTTGCTC AACATTTGGG TGAAAACACC 100 

5 GTCAGGGCTA TTGCTATGGA TGGTACCGAA GGTTTAGTCA GAGGTACTGC 150 

TGTCACTGAT ACCGGTGCTC CAATTTCTGT CCCAGTTGGT AGAGGTACCT 200 

TGGGTAGAAT TATCAACGTT GTTGGTGAAC CAATTGATGA CAGAGGTCCA 250 

ATTGAATGTA AGGAAAGAAA GCCAATTCAC GCTGAACCAC CTTCATTCGT 300 

TGAACAATCT ACTGCTGCCG AAATTTTGGA AACCGGTATT AAGGTTGTCG 350 

10 ATTTGTTGGC TCCTTACGCT AGAGGTGGTA AGATTGGTTT GTTCGGTGGT 400 

GCTGGTGTCG GTAAAACCGT CTTCATCCAA GAATTGATTA ACAACATTGC 450 

CAAAGCTCAT "GGTGGTTTCT~CTGTCTTTAC- CGGTGTCGGT GAAAGAAGCA— —500 

GAGAAGGTAA CGATTTGTAC CGTGAAATGA AAGAAACTGG TGTCATCAAC 550 

TTGGAAGGTG ACTCCAAGGT CGCCTTGGTC TTCGGTCAAA TGAACGAACC 600 

15 ACCTGGTGCT AGAGCTAGAG TTGCTTTGAG TGGTTTGACC ATTGCTGAAT 650 

ACTTCAGAGA TGAAGAAGGT CAAGATGTCT TGTTGTTCAT TGACAACATT 700 

TTCAGATTCA CCCAAGCTGG TTCCGAAGTG TCTGCTTTGT TGGGTCGTAT 750 

TCCATCTGCT GTCGGTTATC AACCAACCTT GGCCACTGAT ATGGGTCTTT 800 

TGCAAGAACG TATTACCACC ACCAAGAAAG GTTCTGTTAC TTCTGTCCAA 850 

20 GCTGTTTATG TCCCAGCCGA TGATTTAACC GATCCAGCTC CAGCTACCAC 900 

CTTCGCTCAC TTGGATGCCA CTACTGTCTT GTCTAGAGGT ATTTCTGAAT 950 

TGGGTATTTA CCCAGCTGTC GATCCATTGG ATTCTAAATC CAGATTATTG 1000 

GATGCTACTG TTGTTGGTCA AGAACATTAT GATGTTGCCA CTGGTGTTCA 1050 

ACAAACTTTA CAAGCTTACA AGTCCTTGCA AGATATCATT GCTATTTTGG 1100 

25 GTATGGATGA ATTGTCCGAA GCCGATAAAT TGACTGTCGA AAGAGCTCGT 1150 

AAGATTCAAA GATTCTTGTC CCAACCATTC GCTGTTGCCG AAGTTTTCAC 1200 

TGGTATCCCA GGTAGATTGG TTAGATTACA AGATACTGTT AAATCTTTCA 1250 

AAGATGTCTT GGAAGGTAAA TACGATCACT TGCCAGAAAA 1290 

30 

2) INFORMATION FOR SEQ ID NO: 476 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN : Csp 388 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 



TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTCCTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

50 GGTACCGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

AATTACTGTC CCAGTTGGTC GTGAGACCTT GGGTCGTATC ATCAACGTTA 200 

TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACACCAA GCACAGAAAC 250 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 3 00 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 350 

55 GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGACCGTT 400 

TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 450 

TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 500 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 550 

GCTCTCGTCT TCGGTCAGAT GAACGAGCCA CCAGGTGCCC GTGCCCGTGT 600 

60 CGCTTTGACC GGTTTGACCA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 
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AGGATGTCTT GTTGTTTATT GACAACATTT TCAGATTCAC CCAGGCCGGT" 700 

TCCGAGGTGT CTGCCTTGCT TGGTCGTATT CCATCCGCTG TCGGTTACCA 750 

GCCAACTTTG GCCACCGATA TGGGTTTGTT GCAGGAGAGA ATTACCACCA 800 

CCCAGAAGGG TTCCGTCACT TCTGTCCAGG CCGTTTACGT CCCAGCTGAT 850 

5 GATTTGACTG ATCCTGCTCC AGCCACCACT TTCGCCCACT TGGACGCCAC 900 

CACTGTGTTG TCCCGTGGTA TCTCTGAGTT GGGTATTTAC CCAGCTGTCG 950 

ACCCATTGGA CTCCAAGTCC AGATTGTTGG ACGCTGCCGT TGTTGGTGAC 1000 

GAGCACTACA ACACCGCCAC CGATGTCCAG CAGACCCTTC AGGCTTACAA 1050 

GTCTCTCCAG GATATCATTG CTATTTTGGG TATGGATGAG TTGTCTGAGG 1100 

10 CTGACAAGTT GACTGTCGAG AGAGCCAGAA AGATTCAGCG TTTCCTTTCC 1150 

CAGCCATTCG CTGTCGCTGA GGTTTTCACC GGTATCCCAG GTAGATTGGT 1200 

TAGACTTCAG~GACACCATCA~AGTCCTTCAG-AGAGGTTTTG-GAeGGTAAGT 1-250- 

ACGACCACTT GCCAGAG 1267 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 477 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1296 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 



CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 
AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA . 100 

35 AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 200 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 300 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 350 

40 TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

45 CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

50 GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

55 TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 1200 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 1296 
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2) INFORMATION FOR SEQ ID NO: 478 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



45 



60 



(vi) OR I G I NAL "SOURCES 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 



TCAATTCGAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCACCTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCCCAGCA CTTGGGTGAG 100 

20 AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 200 

GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 250 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 300 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 350 

25 TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 450 

CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGACTC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

30 CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTGACCGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTCGTCGAC 700 

AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CACCTTGGCA ACTGATATGG 800 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

35 GTGCAGGCCG TCTACGTCCC CGCTGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

CTGAGTTGGG TATCTACCCC GCCGTCGACC CCTTGGACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 1050 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

4 0 TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 1200 

TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 1250 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 1295 



2) INFORMATION FOR SEQ ID NO: 479 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479 



PCT/CA00/01150 



TTCAGGAATT GATTGTAAGT TCTGTTATCA ACTAAAGCCG ACAGCGGTTG 50 

CTGATATGCT CTAGAACAAC ATTGCTAAGG CTCACGGTGG TTACTCCGTG 100 

5 TTCACTGGTG TCGGTGAGCG TACCCGTGAG GGTAACGATT TGTACCATGA 150 

AATGCAAGAG ACCCGTGTCA TTCAACTCGA CGGAGAGTCC AAGGTCGCTC 200 

TTGTCTTCGG TCAAATGAAC GAGCCCCCTG GTGCCCGTGC CCGTGTTGCC 250 

CTTACCGGTT TGACCATTGC TGAATACTTC CGTGACGAGG AAGGCCAAGA 300 

CGGTAGGCTT CATGCTTCTA TCGCTAGGGG CGTGTGATAC AGGAGGCTAA 350 

10 TCGCTTTTCT AGTGCTTCTC TTTATTGACA ACATTTTCCG TTTCACTCAA 400 

GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT CGTATTCCTT CCGCTGTCGG 450 



15 



30 



TTACCAACCT ACTCTCGCCG TCGACATGGG" TGTTATGCAG GAACGTATCA 500- 

CCACCACCAC CAAGGGATCC ATTACTTCAG TGCA 534 



2) INFORMATION FOR SEQ ID NO: 480 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 494 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



CGTCTTGATT CAAGAATTGA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGTGAGCGAA CTCGAGAGGG TAACGATCTG 100 

35 TACCACGAAG TGAGTTGCGC CGTCCGAGTG TTTCCCGGGG AATCGCAAGA 150 

CTGATGTTGT CCCTTCTTCT CAGATGAGGG AAACCGGTGT CATCAACCTC 200 

GAGGGTGACT CCAAGGTCGC CTTGGTCTTC GGTCAGATGA ACGAGCCCCC 250 

TGGAGCCCGA GCCCGAGTCG CCTTGACCGG TTTGACCATT GCCGAATACT 300 

TCCGAGACGA GGAGGGTCAG GATGTCTTGT TGTTCATTGA CAACATTTTC 350 

4 0 CGATTCACCC AAGCCGGTTC CGAAGTGTCC GCCTTGTTGG GTCGTATCCC 400 

CTCCGCCGTC GGTTACCAGC CCACTCTGTC CACCGACATG GGTACCATGC 450 

AGGAGCGAAT TACCACCACC AAGAAGGGTT CCATCACTTC CGTC 494 



45 



55 



2) INFORMATION FOR SEQ ID NO: 481 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 415 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 
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TTCAGGAGCT 
TTCACTGGTG 
AATGCAGGAG 
TGGTCTTCGG 
CTTACCGGTC 
CGTGCTGCTC 
AGGTGTCTGC 
ACCCTCGCCG 
CAAGGGTTCC 



TATCAACAAC ATCGCCAAGG CCCACGGTGG TTACTCCGTC 
TCGGTGAGCG AACTCGTGAG GGTAACGATC TGTACCACGA 
ACTTCCGTCA TTCAGCTTGA TGGCGAGTCC AAGGTCGCCC 
TCAGATGAAC GAGCCCCCTG GAGCTCGTGC CCGTGTCGCC 
TTACTGTAGC TGAATACTTC AGAGATGAGG AGGGTCAGGA 
TTCATTGACA ACATTTTCCG ATTCACTCAG GCCGGTTCCG 
CCTTCTCGGT CGTATCCCCT CTGCCGTCGG TTACCAGCCC 
TCGACATGGG TGGTATGCAA GAGCGTATTA CCACCACCAC 
ATTAC 



100 
150 
200 
250 
300 
350 
400 
415 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 482 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Geotrichum spp. 

(B) STRAIN: Lev-4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCTCGA GGTTGCCCAG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTACCATTC CCGTTGGTCG TGGTACTCTT 200 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 250 

TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

AGCAGTCTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 350 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGCCA 450 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 700 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 750 

CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 850 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 1200 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 1250 

GAGATCCTTG AGGGCAAGTA CGACCACCTC C 1281 



2) INFORMATION FOR SEQ ID NO: 483 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G185A5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 

15 ATTCAAGAAT TGATCGTACG TTCCTCCGCC- CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTCTAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 200 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 250 

20 TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 300 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 350 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 400 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 450 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

25 CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 550 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 



30 2) INFORMATION FOR SEQ ID NO: 484 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 



GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

GTGGTGCCAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGACTGGTA 200 

50 TCAAAGTCGT TGACCTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 300 

CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 350 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 400 

GGTGTCATTA ACCTCGAGGG TGACTCGAAG GTGGCTCTGG TGTTCGGTCA 450 

55 GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT ACTGGTCTGA 500 

CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 600 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCACCA CCACCAAGAA GGGTTCGATT 700 

60 ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 750 
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CCCTGCCACT ACCTTCGCCC ACCTTGACGC TACCACTGTG CTTGACCGTT 800 

CGATCGCTGA GCTGGGTATC TACCCCGCTG TTGACCCGTT GAACTCGAAC 850 

TCGCGTATGC TTGACCCCGC TATTGTGGGT CAGGAGCACT ACGACGTGGC 900 

CTCTGGCGTG CAGAAGCTGC TCCAGGACTA CAAGTCGCTT CAAGATATCA 950 

TTGCCATTCT GGGTATGGAT GAGCTTTCTG AGGAGGACAA GCTCACTGTC 1000 

GAGCGTGCCC GTAAGATGCA GCGTTTCATG TCGCAGCCTT TCGCTGTCGC 1050 

CCAGGTCTTT ACTGGTATCG AAGGTCGTCT TGTTGCCCTG AAGGACACGA 1100 

TCAAGGCCTG CAAGGAGATC CTGTCGGGCA AGCACGACAA CCTCC 1145 



2) INFORMATION FOR SEQ ID NO: 485 ~ ~ 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1261 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:- Linear 

20 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 



TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GCCTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 100 

30 ATTGCTATGG ACGGTACCGA GGGTCTTGTC CGTGGTCAGA AGGTCCTTGA 150 

CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 200 

TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 250 

GACGTCTACC GTCCAATCCA CCGTGAGCCC CCGGCGTTCG TTGAGCAGTC 3 00 

GACTGATGCT GAGATTCTTG AGACTGGTAT CAAGGTGGTC GACCTGCTCG 3 50 

35 CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 4 00 

GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 450 

CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 500 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 600 

40 CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 650 

ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 700 

ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 800 

GTATYACCAC CACCAAGAAG GGTTCGATTA CCTCGGTGCA GGCYGTTTAC 850 

45 GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

CTTGGACGCC ACCACGGTGT TGGACCGTTC GATTGCYGAG CTGGGTATCT 950 

ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 1000 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 1100 

50 AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 1150 

CGTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 1200 

GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTCCGGTAA G 1261 



2) INFORMATION FOR SEQ ID NO: 486 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 12 82 bases 

298 



SUBSTITUTE SHEET (RULE 26) 



10 



40 



55 



WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 



— -(-x-i-)-SEQUENCE- DESCRIPTION :_SEQ_ID_ NO: 486 



AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GACCAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

15 CAGAACCATT GCCATGGACG GTACCGAGGG- TTTGGTCAGA GGCGCCTCTG 150 

TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 2 00 

GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 250 

CAACACCAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 300 

AGCAATCCAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 350 

20 TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTGT TCATCCAGGA GTTGATTAAC AACATTGCCA 4 50 

AGGCCCACGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCAGA 500 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGACTGGTG TCATCAACTT 550 

CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 600 

25 CAGGAGCTAG AGCYAGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 700 

CAGATTCACC CAAGCCGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 800 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

30 CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA 1000 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AGCTTACAAG TCCTTGCAGG ATATCATTGC CATTTTGGGT 1100 

35 ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

AAGATCCAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 1200 

CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACACCGTT AGATCCTTTA 1250 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 1282 



2 ) INFORMATION FOR SEQ ID NO: 487 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4 82 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 



TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGCCAAGG CTCACGGTGG 100 

60 TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 
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TGTACCACGA AATGCAGGAA ACTGGTGTCA 
AAGGTCGCCC TCGTGTTCGG TCAGATGAAC 
CCGTGTCGCT CTTACTGGTT TGACCATTGC 
AAGGTCAGGA CGTGCTTCTC TTCATTGACA 
5 GCCGGTTCTG AGGTGTCTGC CCTTCTGGGT 
TTACCAGCCC ACCCTTGCCG TCGACATGGG 
CCACCACCAC CAAGGGTTCC ATCACCTCCG 



PCT/CA00/01150 



TTP h P PT HCZ ZV 


nnnTPA atpp 

OUU ± \jr\ri. ILL 


000 








P(7Aptapttp 




inn 


nLn I X J. ILLu 


TTTPAPTPZVfJ 
111 LriL, 1 L*rHJ 


J -? V 


CGTATCCCCT 


CTGCCGTCGG 


400 


TATCATGCAG 


GAGCGTATTA 


450 


TC 
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— 2-)-INFGRMATI0N-E0R--SEQ JD NO : 488 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

t 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 



TCGAACAAGG 
CCAAGTGGTA 
CACTGTCAGA 

30 AACAAGTTAC 
ACTTTAGGTC 
TCCAATTAAC 
TCAGTGAACA 
GTYGATTTAT 

35 TGGTGCCGGT 
TTGCTAAAGC 
ACCAGAGAAG 
TAACTTGGAA 
AACCACCAGG 

40 GAATACTTCA 
CATTTTCAGA 
GTATTCCATC 
TTGTTACAAG 
CCAAGCTGTT 

45 CCACTTTCGC 
GAATTAGGTA 
ATTAGATGCT 
TTCAACAAAC 
TTAGGTATGG 

50 AAGAAAAATC 
TCACTGGTAT 
TTCAAAGATG 



TAACTTACCA 
CTCCATTAGT 
ACTATTGCTA 
TGATACTGGT 
GTATTATCAA 
ACCAAACAAA 
ATCAACTGCT 
TAGCTCCATA 
GTCGGTAAAA 
TCATGGTGGT 
GTAACGATTT 
GGTGATTCTA 
TGCTAGAGCT 
GAGATGAAGA 
TTCACCCAAG 
TGCTGTCGGT 
AACGTATTAC 
TATGTCCCAG 
CCATTTGGAT 
TTTACCCAGC 
TCAGTTGTTG 
TTTACAAGCT 
ATGAATTGTC 
CAAAGATTCT 
YCCAGGTAGA 
TTTTGGAAGG 



GCTATCTTGA 
TTTAGAAGTT 
TGGATGGTAC 
TCTCCAATCA 
CGTTGTTGGT 
GAAACCCAAT 
GCTGAAGTTT 
CGCTAGAGGT 
CTGTCTTTAT 
TTCTCAGTTT 
ATACCGTGAA 
AGGTCGCTTT 
CGTGTTGCTT 
AGGTCAAGAT 
CCGGTTCAGA 
TATCAACCAA 
CACCACACAA 
CTGATGATTT 
GCTACTACTG 
TGTCGATCCA 
GTCAAGAACA 
TACAAATCTT 
TGAACAAGAT 
TATCTCAACC 
TTGGTTAGAT 
TAAATATGAT 



ATGCTTTAGA 
GCTCAACATT 
TGAAGGTTTA 
CTGTCCCAGT 
GAACCAATTG 
TCACGCTGAA 
TAGAAACTGG 
GGTAAAATTG 
CCAAGAATTG 
TCACCGGTGT 
ATGAAAGAAA 
AGTTTTCGGT 
TAACTGGTTT 
GTCTTGTTAT 
AGTTTCTGCC 
CTTTAGCAAC 
AAAGGTTCAG 
AACAGATCCT 
TCTTGTCTCG 
TTAGATTCTA 
TTATGATGTT 
TACAAGATAT 
AAATTGACTG 
ATTTGCTGTT 
TAAAAGACAC 
CACTTACCAG 



AATCAAAACC 50 

TAGGTGAAAA 100 

GTTCGTGGTG 150 

TGGTCGTGAA 200 

ATGAACGTGG 250 

CCACCTTCAT 300 

TATCAAAGTT 350 

GTTTATTCGG 400 

ATTAACAACA 450 

TGGTGAAAGA 500 

CTGGTGTTAT 550 

CAAATGAATG 600 

GACCATTGCT 650 

TCGTTGATAA 700 

TTATTAGGTC 750 

TGATATGGGT 800 

TTACTTCTGT 850 

GCTCCAGCTA 900 

TGGTATTTCA 950 

AATCAAGATT 1000 

GCTACCAACG 1050 

TATTGCTATT 1100 

TCGAAAGAGC 1150 

GCCGAAGTTT 1200 

TATCAAATCA 1250 
1290 



55 

2) INFORMATION FOR SEQ ID NO: 489 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1291 bases 
60 (B) TYPE: Nucleic acid 

300 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 214 9 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

C^AATTCGAA^CAAGGTAACT ~TA~CCAGCTAT~ CTTGAATGCT~TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 150 

15 TGGTGAACAA GTTACTGATA CTGGTTCTCC AATCACTGTC CCAGTTGGTC 200 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 300 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 350 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 4 00 

2 0 TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 450 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 600 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

25 TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

30 AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

35 AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 12 00 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 1250 

AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATCACTT A 1291 



40 



50 



2) INFORMATION FOR SEQ ID NO: 490 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 508 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 

CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

60 GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 2 00 

301 
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TCATTCAGCT CAAGAACGAC AAGTCCAAGG 
ATGAACGAGC CCCCCGGAGC TCGTGCCCGT 
CATCGCCGAG TACTTCCGTG ACGTCGAAGG 
TCGACAACAT TTTCCGATTC ACCCAGGCCG 
CTCGGACGTA TCCCATCTGC TGTCGGATAC 
TATGGGTGGT ATGCAAGAGC GAATCACAAC 
CCTCCGTC 



CCGCTCTGGT CTTCGGACAG 
GTCGCTCTGA CCGGTCTCAC 
ACAGGATGTG CTACTCTTCA 
GTTCAGAGGT ATCTGCCCTG 
CAGCCCACAC TCTCAACCGA 
CACCAAGAAG GGTTCGATTA 



25T) 
300 
350 
400 
450 
500 
508 
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_2.LlNF0RI^nDN_F0R^SEQ ID NO: 491 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 686 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 200 

CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 300 

TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 400 

CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 600 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 686 



2) INFORMATION FOR SEQ ID NO: 492 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 
TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 
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ACTCCGTTTT CACCGGTGTC GGTGAGCGTA 
CACACTTTCG CCGACCTGGC CATTCCGGAA 
ACCTCGAACA GATCTCTACC ACGAGATGAT 
TCGACAACGA CAAGTCGAAG ACTGCTCTCG 
CCCCCTGGCG CCCGTGCCCG TGTCGCTCTT 
GTACTTCCGT GACGACGAGG GCCAGGACGT 
TCTTCCGTTT CACCCAGGGT ACGTTCGATC 
TCGTGGTGAC TGACAACCTG TTGCGCGTGC 
CCCTTCTCGG TCGTATCCCG TCCGCTGTCG 
ACCGACATGG GTGGCATGCA GGAGCGTATC 
TCTCTCGTTT CGCTCTGCAT CGCTCACGCA 



CCCGTGAGGG 
CGTGCGGACT 
CGAGACCGGT 
TCTTCGGCCA 
ACTGGTCTCA 
GCTTCTCTTC 
CCGCCCGTCC 
AGCCGGTTCG 
GATACCAGCC 
ACGTACGCCC 
TGTTCGCCCG 



TAACGGTGAG 
GACGAGGACC 
GTCATTCAGC 
GATGAACGAG 
CCATCGCGGA 
ATCGACAACA 
AACACGAATG 
GAGGTGTCTG 
CACTCTCTCG 
TCTTCTGCTT 
ACAGCACCAC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
"625 



CAAGAAGGGT TCGATCACCT GTGTC 
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2) INFORMATION FOR SEQ ID NO: 493 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 

TCTCGTCCGT GGTGCTAAGG CCACTGACAC TGGTTCCCCC ATTACCATCC 50 

CCGTCGGCCC CGGTACCCTC GGTCGCATCA TGAACGTCAC CGGTGACCCG 100 

ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC CCATCCACGC 150 

TGAGGCTCCC GAGTTCGTTG ACCAGTCGAC CACCGCTGAG GTTCTCGTGA 200 

CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG TGGTGGTAAG 250 

ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGACCGTGT TCATCCAGGA 300 

GCTCATCAAC AACATCGCCA AGGCCCACGG TGGTTACTCC GTCTTCACCG 350 

GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG 4 00 

GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG CCCTGGTGTT 4 50 

CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC GCCTTGACCG 500 

GTTTGACTGT CGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGATGGTATG 550 

TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA ATTACTAACT 600 

TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC CCAGGCCGGT 650 

TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG TCGGTTACCA 700 

GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAGCGT ATTACCACCA 750 

CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT GCCCGCTGAC 800 

GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC TGGACGCCAC 850 

CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC CCCGCTGTCG 900 

ACCCCCTCGA CTCCAAGTCG CGTATGCTGG ACCCCCGTAT TGTCGGTGAC 950 

GACCACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC AAGAGTACAA 1000 

GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG CTGTCTGAGG 1050 

CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG TTTCCTGAGC 1100 

CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG GCCAGCTGGT 1150 

CGATCTGAAG GACACTATCG CTTCGTTCAA GGCTATCCTG AGCGGTGAGG 1200 

GTGACAGCCT T 1211 
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2) INFORMATION FOR SEQ ID NO: 494 



PCT/CA00/01150 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 



TCTTGTTAGA GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 50 

CAGTTGGTAA CGGCACTTTG GGCCGTATCG TCAACGTTCT CGGTGAGCCA 100 

ATTGATGAGC GTGGACCAGT CAAGGCTGAC AAGTTCAGAC CTATTCACGC 150 

20 TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 200 

CCGGTATTAA GGTTGTCGAC TTGCTTGCCC CTTATGCCAG AGGTGGTAAG 250 

ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 300 

GCTTATTAAC AACATTGCTA AGGCCCACGG TGGTTACTCT GTCTTCACTG 350 

GTGTCGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 400 

25 GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 450 

CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 500 

GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGCCA GGATGTCTTG 550 

TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 

TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 650 

30 CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 700 

TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 

TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC ACCGAATTGT 800 

CCCGATCTAT CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 850 

TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGGCCAAG AGCACTACGA 900 

3 5 CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 950 

ATATCATTGC CATTTTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 1000 

ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 1050 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 1100 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG 1133 

40 



2) INFORMATION FOR SEQ ID NO: 495 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
55 (B) STRAIN: WSA-225 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 



GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 
60 GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 
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AGTATTAATT GTTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAt! ' VB9 

TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 200 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 300 

5 GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

TCAAGACGGT GAGTTTCTTA TGGATAAAAA AAAATTTTTT TTTTTTTTTT 4 00 

TTTTTTTCAA GAAATTCATG TTCTAACAAA GTGTATTCTA GTGCTTCTCT 450 

TCATCGACAA CATTTTCCGT TTCACTCAGG CTGGTTCCGA AGTGTCTGCC 500 

CTGCTTGGTC GTATCCCATC TGCCGTCGGT TACCAACCCA CTCTTGCCGT 550 

10 CGACATGGGT GGTATGCAGG AACGTATTAC CACCACCAAG AAGGGATCCA 600 

TTACCTCC 608 



30 



15 2) INFORMATION FOR SEQ ID NO: 496' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 



GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 50 

AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT 100 

CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 150 

CACGAAATGC AGGAGACCTC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 200 

35 CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 250 

TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 300 

CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 350 

CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 400 

TTACAAGATT ATGCTAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 450 

40 GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 500 

AATATCGACG GCGTACTCTT CGCATCAAAA AACCAAAGAG GTGGTTTGGT 550 

GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 600 

GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 650 

TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 700 

45 CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 750 

GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 794 



50 2) INFORMATION FOR SEQ ID NO: 497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1148 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

305 
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(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 

5 

TCTTGTCCGA GGCACCGCCG TCGCTGACAC CGGTGCTCCC ATCACTATCC 50 

CCGTCGGCCG AGGTACCCTT GGTCGAATCA TCAACGTCTG TGGTGAGCCC 100 

ATTGACGAGC GAGGACCCAT CGAGGCTTCC AAGTACCTCC CCATCCACGC 150 

TGACCCCCCT ACCTTCGCTG AGCAGTCTAC CTCCGCTGAG GTTCTCGAGA 200 

10 CCGGTATTAA GGTCGTCGAC CTCCTCGCCC CTTACGCCCG AGGTGGTAAG 250 

ATTGGTCTCT TCGGTGGTGC CGGTGTCGGT AAGACTGTCT TCATCCAGGA 300 



GCTGATTAA~C - AACATTGCCAT"AGGCC 3-50- 

GTGTCGGTGA GCGAACCCGA GAGGGTAACG ATCTTTACCG AGAGATGAAG 4 00 

GAGACTGGTG TCATCAACCT CGAGGGTGAG TCTAAGGTCA CCCTCGTCTT 450 

15 CGGTCAGATG AACGAGCCTC CCGGAGCCCG TGCCCGAGTC GCCCTTACTG 500 

GTCTGACCAT TGCCGAGTAC TTCCGAGACG AGGAGGGTCA GGATGTGTTG 550 

CTCTTCGTTG ACAACATTTT CCGATTCACC CAGGCCGGTT CCGAGGTGTC 600 

CGCTCTGCTT GGTCGAATTC CCTCCGCTGT CGGTTACCAG CCCACTCTGG 650 

CCACCGATAT GGGTGCCCTC CAGGAGCGAA TTACCACCAC CCAGAAGGGT 700 

20 TCCGTCACTT CCGTCCAGGC CGTCTACGTG CCTGCCGATG ATTTGACCGA 750 

TCCTGCTCCC GCCACCACCT TCGCCCATCT TGACGCCACC ACCGTCCTGT 800 

CCCGAGGTAT TTCCGAGCTG GGTATCTACC CCGCTGTCGA TCCCCTTGAT 850 

TCCAAGTCTC GACTTCTGGA TATCGATGTT GTCGGAAAGG AGCACTACGA 900 

TGTTGCTTCC AACGTCCAGC AGACCCTCCA GGCTTACAAG TCTCTCCAGG 950 

25 ATATCATTGC CATTCTTGGT ATGGATGAGC TGTCCGAGCA GGACAAGCTG 1000 

ACCGTCGAGC GAGCTCGAAA GATCCAGCGA TTCCTGTCTC AGCCCTTCAC 1050 

CGTCGCCGAG GTTTTCACCG GTATTGAGGG ACGACTTGTC TCTCTCAAGG 1100 

ACACTGTCCG ATCCTTCAAG GAGATCCTTG ACGGTAAGCA CGATGCTC 1148 



30 



45 



2) INFORMATION FOR SEQ ID NO: 498 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT CGTTGTTGCT GCCTCCGACG GTCAGATGTA GGTGGAACAT 50 

CTTGGGAAAT ACGTCGTAAA ACACGTCGCT TACGTTTTCG CGAATAGGCC 100 

50 CCAGACTCGT GAGCATTTGC TGCTCGCCCG CCAGGTTGGT GTCCAGAAGA 150 

TCGTTGTCTT CGTCAACAAA ATCGATGCTA TTGATGATCC GGAGATGCTG 200 

GAACTGGTCG AACTCGAGAT GCGTGAGCTG CTGAACAGCT ACGGTTTCGA 250 

GGGTGAAGAG ACTCCGATCA TTTTCGGTTC CGCTCTCTGT GCTCTCGAAG 300 

GACGCCGTGA CGACATCGGT AAAGACAGAA TTGAGCAGCT TATGAACGCT 350 

55 GTCGACACCT GGATCCCCAC TCCTCAGCGT GACCTCGACA AACCTTTCTT 400 

GATGTCTGTC GAGGAAGTGT TCTCTATCGC CGGCCGTGGT ACCGTGGCTT 4 50 

CTGGTCGTGT CGAGCGTGGT ATCTTGAAGA AGGACTCTGA GGTTGAGATT 500 

GTTGGAGGCT CCTTCGAACC CAAGAAGACC AAAGTCACCG ACATTGAAAC 550 

CTTCAAGAAG AGCTGTGATG AATCGCGTGC TGGTGACAAC TCTGGTCTCC 600 

60 TCCTGCGTGG TATCCGACGT GAAGACGTCA AGCGTGGTAT GGTCATTGCT 650 
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PCT/CA00/01150 



GTTCCCGGCA GCACCAAGGC 
CCTGACCGAG GCGGAGGGTG 
GTCCCCAAGT CTTCATCCGT 
AGATCTTCCG AGAGATTAGC 
TGCTGACCTC AGCTTCCCTG 
GTGACAACGT CGAGATGATC 
GCTGGTCAAC GCTTCA 



TCACGACAAG 
GTCGTCGTAC 
ACTGCAGGTA 
GATATATGCT 
ACGGCGACCA 
CTGAAGACCC 



TTCCTCGTCT CCATGTACGT 
TGGCTTCGGT GCCAACTACC 
AGTTCCCGCA CACCGTGTCC 
AATGATTCAT CAGACGAGGC 
ATCTCGCAGA GTTATGCCTG 
ACCACCCTGT TGCTGCTGAG 



700 
750 
800 
850 
900 
950 
966 
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2) INFORMATION FOR SEQ ID NO: 499 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid - 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 200 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 250 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 300 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 400 

GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 450 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 650 

AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 

TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 846 



2) INFORMATION FOR SEQ ID NO: 500 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

» 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 



PCT/CA00/01150 



GGTGCCATTA TTGTTGTTGC TGCCTCTGAT GGACAGATGC CCCAGACCCG 
TGAGCATCTT TTGCTTGCCC GCCAAGTCGG TATGCAAAAG GTCGTTGTGT 
5 TTGTTAACAA GATTGATACC ATTGATGACC CTGAAATGCT TGATCTTGTC 
GAGATGGAGA TGCGTGAACT GTTGAATGAA TATGACTTCG ATGGAGATAA 
CTCTCCTGTC ATTATGGGCT CTGCTCTTGC TGCTCTTGAG GACAAGAACC 
CCGAGATTGG TAAGGACCGT ATCATGCAGC TCTTGGACGC TGTTGATGAA 
TGGATCCCTA CCCCCGAGCG TGACCTTGAC AAGCCTTTCA TGATGCCTAT 

10 TGAGGCCTCT TTCTCCATTT CTGGTCGTGG TACTGTTGCC ACTGGCCGTG 
TCGAGCGTGG TATTCTCAAG AAGGGTGAGG AAGTCGAGAT CGTTGGTTTC 
AACAAGCAGC - CCCTGAAATC "TGTTGTTACT" "GGTATTGAAA - TGTTCAAGAA 
GGAACTTGAT CAGGCCCAGG GCGGTGATAA TGCTGGTATC TTGCTTCGTG 
GTATTCGTCG TGAGGACTTG CAGCGTGGTA TGGTTTTGGC CAAGCCTGGA 

15 ACTGTTAAGG CTCACACCAA GTTCCTTTCC TCCATCTACG TTCTCTCCAA 
GGAAGAGGGC GGCCGTCACT CTCCTTTCGG TATGAACTAT CGTCCCCAGA 
TGTTCGTTTC TGCAGCTGAT GTCACCGTTA CTCTTACTTT CCCTGAGGGT 
GTTGAACAGC ACACTCAGGT CTTCCCTGGT GAGAACACCG AGATGGTTGG 
CGAGCTCGTT CACCCTACTG CTATTGAGGT TGGTCAACGC TTCAAC 

20 



2) INFORMATION FOR SEQ ID NO: 501 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 
35 (B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 

AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA ACCGAGAGCA 50 

40 CTCCCGGATC TTGGTTTAAA TGGCACTAAT ATAAGACAGG CCTCAAACTC 100 

GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 150 

TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 200 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 250 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 300 

45 CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 350 

CTGGATTCCC ACTCCTCAGC GTGAGACTGA CAAGCCCTTC TTGATGTCCA 400 

TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 450 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC CGATATCGAA ACCTTTAAGA 550 

50 AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 650 

AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 700 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

55 GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 850 

GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

CACCACCTTG CACCCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 



60 
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2) INFORMATION FOR SEQ ID NO: 502 



PCT/CAOO/01150 



( i ) SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE^ 

(A) ORGANISM: FusaTlum~oxysporum- 

(B) STRAIN: WSA-212 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGACCCCG AGATGCTTGA GCTCGTCGAG 150 

20 ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 200 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 250 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 300 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 400 

25 AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 450 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC ATTGAGACCT TCAAGAAGTC 500 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

30 GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 750 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

TGTCACCATG GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 84 9 



35 



2) INFORMATION FOR SEQ ID NO: 503 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 



TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

55 GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT 150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 250 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 300 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 350 

60 TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 4 00 

309 
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GGATCCCCAC 
GAGGAAGTAT 
TGAGCGCGGT 
GCTCCACCCC 
5 TCCTGTGACG 
TATCAAGCGT 
GCGTCAAGGC 
GCTGAGGGTG 
GTTCATCCGC 
10 TTCTTTTTTT 

GCCTGATTCA 

ATCTCAGCTT 
GACAACGTCG 
CGGCCAGCGA 

15 



PCT/CAOO/01150 



ACCACAACGT 

TCTCTATCTC 

GTCCTCAAGA 

CATCAGGACG 

AGTCTAGAGC 

GAAGATATCC 

CCACGACAAG 

GTCGCCGAAC 

ACAGCTGGTA 

CCTTTTCCTC 

CGAAATTAAC 

CCCTAGTGGA" 

AGATGATCCT 

TTCA 



GATACCGAAA 

CGGCCGTGGA 

AGGATTCAGA 

AAGGTAACTG 

TGGGGACAAC 

GCCGTGGTAT 

TTCTTGGTGT 

CGGATTCGGC 

TGTCAAAATG 

TCTATCTCTC 

TAACCCGTTT 

"GCAGATGAAAT 

CCAGACACAC 



AACCTTTCTT 
ACCGTTGCCT 
AGTCGAGCTA 
ATATCGAAAC 
TCCGGTCTTT 
GGTAGTTGCC 
CGATGTATGT 
CAGAACTATC 
GGACCCCTTT 
TTTCTGTTTC 
GATTATAGAC 
GCAAACTCGT 
CGCCCCGTGG 



GATGTCCGTT1 

CCGGTCGTGT 

ATTGGGGGCG 

TTTCAAGAAA 

TATTGCGTGG 

GTTCCTGGCA 

CCTGACCGAA 

GTCCTCAAAT 

TCATAATCCT 

CTTTCAACTC 

GAAGCCGCCC 

TATGCCTGGT 

CTGCTGAGGC 



4'5® 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
-1-000- 
1050 
1064 



2 ) INFORMATION FOR SEQ ID NO: 504 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
30 (B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 

TGGTGCTATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

35 GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 100 

TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 150 

GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTCGATG CTGTAGAGGA 200 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGACCA 250 

CCTATGGGTT CGAGGGTGAG AAGACACCTA TCATCTTTGG TTCTGCGCTC 300 

40 TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 350 

ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 400 

ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 450 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 500 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG ACCAAGGTGA 550 

45 CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 600 

AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA TCCGCCGTGG 650 

TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 700 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 800 

50 TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAACTAATT 850 

GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 900. 

CAAAATGGTC ATGCCCGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 

55 

2) INFORMATION FOR SEQ ID NO: 505 

( i ) SEQUENCE CHARACTER I ST I CS : 
60 (A) LENGTH: 931 bases 

310 
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10 



WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS r Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 



CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

15 CACTTGCTCC TCGCCCGTCA GGTCGGTGTT< CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

20 TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

25 CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

30 CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 
ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G .931 



35 2) INFORMATION FOR SEQ ID NO: 506 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCTCAAACCA 50 

GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 200 

55 ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 250 

GAAGAAATTG GTGTCAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 3 00 

ATATATCCCA ACCCCACAAA GAGATTTAGA AAAACCATTC TTGATGGGTG 350 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTACCGTTGT TACTGGTCGT 4 00 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 450 

60 AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTTCAAAA 500 

311 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AAGAATTGGA CCAAGCTATG GCTGGTGATA 
GGTATCAAAA GAGATGACAT YAAAAGAGGT 
TACCATCTCW GCTCACACTA AATTCTTAGC 
AAGAAGAAGG TGGTCGTCAC TCAGGTTTTG 
5 TTATTCATCA GAACTGGTGA TGTTACCGTT 
TGGTGATTCA TCTCAACAAA TCTTACCAGG 
GTGAATTGGT TCACCCAACT GCTTTAGAAG 



PCT/CAOO/01150 

ACTGTGGTAT CTTATTACGT 550 

ATGGTTATTG CTAAAACCGG 600 

CTCAATGTAT ATTTTGACTA 650 

GTGAACATTA CAGACCTCAA 700 

GTTTTAACCT TYCCAGAAGG 750 

TGACAATGTC GAAATGGTTT 800 

CTGGTCAAAG ATTCAA 84 6 



10 



20 



2 ) INFORM ATION FOR SEQ ID NO: 507 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 964 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 



GGAGCTGTTG 
CCGCCCAGAC 
CAGACCAGAG 

30 GGTCGTTTTC 
AGCTTGTCGA 
GGTGAGGAGA 
CCGTCGACCT 
TCGACACCTG 

35 ATGTCCATTG 
CGGTCGTGTT 
TTGGTGGCTC 
TTCAAGAAGT 
TCTCCGAGGT 

40 CCCCCGGATC 
CTGACTGAGG 
CCCTCAAATG 
TTACTAAGTA 
CATCTTTCAG 

45 GACAACGTCG 
TGGCCAACGA 



TCGTTGTCGC 
GGATGAAAGG 
AACATTTGCT 
GTTAACAAGG 
ACTTGAAATG 
CCCCCATCAT 
GAGCTTGGTG 
GATCCCCACC 
AGGAAGTGTT 
GAGCGTGGTA 
TACCACCCCT 
CCTGCGATGA 
ATCAAGCGTG 
CACCAAGGCT 
CTGAGGGTGG 
TTCATCCGTA 
GATCATTGCT 
CTGGCCTGGA 
AGATGATTTG 
TTCA 



AGCTTCTGAC 
ATTTGACGTT 
CCTTGCCCGC 
TCGATGCCGT 
CGTGAACTCC 
TTTTGGCTCT 
TCGAGAAGAT 
CCCGAGCGCG 
CTCTATCTCT 
TCCTCAAGAA 
ATCAAGACCA 
ATCTCGAGCT 
AGGACTTGAA 
CACACCGACT 
TCGTTCCAAC 
CTGCTGGTAT 
AACTTGTATT 
GAAGACCAAG 
CAAAACCCTC 



GGTCAAATGT 
TCTAACATCA 
CAGGTCGGTG 
TGAGGACCCA 
TCAGCCACTA 
GCTCTCTGTG 
TGACGAGCTA 
CCACTGATAA 
GGTCGTGGTA 
GGATTCCGAC 
AGGTCACAGA 
GGTGACAACT 
GCGTGGAATG 
TCATGGTCTC 
GGCTTCACCC 
GTAACCAAAG 
CCCTTCCGTA 
ACAAGAAGGC 
CACCCCATTG 



AATTGAATGC 50 

GTCTAGGCCT 100 

TCCAGAAGCT 150 

GAGATGTTGG 200 

CAGTTTTGAG 250 

CCCTCGAGTC 300 

TTGAACGCCG 350 

GCCTTTCCTC 4 00 

CCGTCGTCTC 450 

GTCGAAATTG 500 

TATCGAAACC 550 

CTGGTCTCCT 600 

GTTGTTGCTG 650 

CCTCTACGTC 700 

ACAAGTACCG 750 

TTTCCGCTAT 800 

GACGAAGCCG 850 

TATGCCTGGT 900 

CTGCCGAGGC 950 
964 



50 2) INFORMATION FOR SEQ ID NO: 508 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

312 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 

5 

GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 200 

10 CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 300 



CACATCCCTA CCCCTAfiCCG~TGA"CCTTGAG" AAGCCCTTCC-TGATGCCCGT- 350 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 400 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 4 50 

15 AACAACAAGC CCATCAAGGC TGTTGTTACO GGTATTGAGA TGTTCAAGAA 500 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 700 

20. TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 800 

CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



25 

2) INFORMATION FOR SEQ ID NO: 509 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1067 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

45 CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 200 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 250 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 300 

CTACAAGGAG GACCGTTACA GCGAAATCCA GAAGGAAGTT CAGGGTTACC 350 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

50 GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 450 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 600 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 650 

55 AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

60 CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 950 

313 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ACCCCAAGAC CATCAAGAAC GGCGACGCCG 
TGCAAGCCCA TGGTCGTCGA GGCCTTCACT 
TTTCGCCGTG CGTGACG 



PCT/CA00/01150 

CTATGGTCGT CCTGAAGCCG 1000 
GAATACGCTC CCCTTGGTCG 1050 

1067 



2) INFORMATION FOR SEQ ID NO: 510 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1049 bases 
(B) TYPE: Nucleic acid 

(C) ~ STRANDEDNESS "Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 1 Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 

GTGAACGTGA ACGTGGTATT ACTATTGATA TTACCTTATG GAAGTTCGAG 50 

ACCACCAAGT ACTACTACAC CGTCATTGAT GCCCCTGGTC ACCGTGACTT 100 

CATCAAGAAC ATGATTACGG GTACTTCTCA AGCCGATGTT GCTATGCTTG 150 

TTGTACCAGC TGAGGCTGGT GGTTTCGAGG CCGCTTTCTC CAAGGAAGGA 200 

CAGACCCGTG AGCACGCTCT TTTGGCTTTC ACCCTTGGTG TCAAACAGAT 250 

CATCTGTGCC ATTAACAAGA TGGACAAGTG CGACTACAAG GAGGACCGTT 300 

ACAGTGAAAT CCAGAAGGAA GTCCAGGGTT ACCTCAAGAA GGTCGGTTAC 350 

AATATTGAGA AGGTGCCCTT CGTTGCCATC TCCGGTTTCA TGGGAGACAA 400 

CATGGTTGAG CGTTCCACCA ACATGCCCTG GTATAAGGGA AAGACATTGG 450 

TCGAGGCCCT TGATCAGATG GAACCCCCAA AGAGGCCCGT TGACAAGCCA 500 

CTTCGTCTTC CCCTCCAGGG TGTCTACAAG ATCGGTGGTA TCGGTACCGT 550 

CCCCGTCGGT CGTGTTGAAA CTGGTATGTT GAAGGCTGGT ATGATTCTAA 600 

CCTTTGCTCC TAACCCAATC ACCACTGAAT GCAAATCCGT TGAAATGCAC 650 

CACGAAACCG TTGAGGTTGC TTACCCCGGT GACAACGTCG GTTTCAACGT 700 
AAAGAACGTT TCTACTTCTG ACATTCGCAG TGGTCACGTT GCCTCTGATT ' 750 

CTAAGAACGA CCCTGCCAAG GCTGCTGTTT CCTTCACTGC CCAGGTCATT 800 

GTGCTCAACC ACCCTGGTAC CATTAAGGCC GGTTACTGCC CCGTCGTCGA 850 

TTGCCACACC GCTCACATTT CATGTAAATT CGAAGAGATC ACCAGCCGTA 900 

TGGACAAGCG TACCGGTAAA TCTCTTGAGG AAAACCCCAA GACCATCAAG 950 

AACGGTGACG CTGCCATGGT TGTGCTCAAG CCAATGAAGC CCATGGTTGT 1000 

CGAATCCTTC ACTGAGTATG CTCCTCTTGG TCGTTTCGCT GTTCGTGAC 1049 



2) INFORMATION FOR SEQ ID NO: 511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN: ATCC 11745 

314 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 

TGGACAAGCT GAAGGCGGAG CGCGAGCGCG GTATCACGAT CGATATTGCC SO 

CTGTGGAAGT TCGAGTCGCC CAAGTCCGTG TTCACGATCA TCGATGCCCC 100 

5 CGGCCACCGC GACTTCATCA AGAACATGAT CACCGGCACC TCCCAGGCCG 150 

ATGCCGCCAT TCTGATGATT GACTCGACCC AGGGTGGCTT CGAGGCTGGC 200 

ATCTCCAAGG ACGGCCAGAC CCGCGAGCAC GCCCTGCTTG CCTTCACGCT 250 

GGGCGTGAAG CAGATGGTTG TGTGCTGCAA CAAGATGGAC GACAAGACGG 300 

TGCAGTACGC CCAGGCCCGC TACGAGGAGA TCAGCAAGGA GGTCGGCGCG 350 

10 TACCTGAAGC GCGTGGGCTA CAACCCGGAG AAGGTGCGCT TCATCCCGAT 4 00 

CTCGGGCTGG CAGGGCGACA ACATGATCGA GAAGTCCGAC AACATGTCGT 450 



GGTACAAGGG TCCCACGCTG~CTGGAGGCGC _ TCGACCTGCT~GGAGGeCCCC 500- 

GTGCGTCCGG TGGACAAGCC GCTGCGCCTG CCCCTGCAGG ACGTGTACAA 550 

GATCGGCGGT ATCGGCACTG TGCCCGTGGG CCGTGTGGAG ACCGGCGTGA 600 

15 TGAAGCCGGG CGACGTTGTG GTGTTCGCGC' CTGCCAACGT GACGACCGAG 650 

GTGAAGTCGA TCGAGATGCA CCACGAGCAG CTGGCTGAGG CCGTGCCCGG 700 

CGACAACGTG GGCTTCAACG TGAAGAACGT GTCCGTGAAG GATATCCGCC 750 

GTGGTAACGT GTGCGGCAAC ACGAAGAGCG ACCCCCCGAA GGAGGCGGCC 800 

GACTTCACTG CCCAGGTGAT CGTGCTGAAC CACCCCGGCC AGATCAGCAA 850 

20 CGGCTACGCG CCGGTGCTGG ACTGCCACAC GAGCCACATC GCGTGCAAGT 900 

TCGCGGACAT CGAGTCCAAG ATCGACCGCC GCTCTGGCAA GGAGCTGGAG 950 

AAGAGCCCGA AGGCCATCAA GTCCGGCGAT GCGGCCATCG TGAAGATGAT 1000 

CCCGCAGAAG CCGATGTGCG TGGAGGTGTT CAACGACTAC CCGCCGCTGG 1050 

GCCGCTTCGC TGTCCGCGAT 1070 

25 



2) INFORMATION FOR SEQ ID NO: 512 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 
40 (B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

45 AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 250 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 300 

50 TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 450 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

55 TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 650 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 800 

60 TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 

315 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
ATTGCCATAC ATCACACATT GCATGTAAAT 
ATTGATAGAA GAACAGGTAA ATCCATGGAA 
TAAGAATGGA GATTCAGCAC TTGTTAAGAT 
GTGTTGAAGA ATTTGCTAAA TTCCCACCAT 
5 GA 



PCT/CAOO/01150 

TCGAAGAATT ATTA^GCAAG 900^ 
GGAGGAGAAC CAGAATATAT 950 
TGTTCCAACT AAACCACTTT 1000 
TGGGAAGATT TGCTGTTAGA 1050 

1052 



10 



2) INFORMATION FOR SEQ ID NO: 513 
(i) SEQUENCE CHARACTERISTICS: 



TA) LENGTH!" "108 2" bases" 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Giardia lamblia 

(B) STRAIN: Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 

25 GACGAGCGCG AGCGCGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTCGA 50 

GACGAAGAAG TACATCGTCA CGATCATCGA CGCCCCGGGC CACCGCGACT 100 

TCATCAAGAA CATGATCACG GGGACGTCCC AGGCCGACGT CGCGATCCTC 150 

GTCGTCGCGG CGGGCCAGGG CGAGTTCGAG GCCGGGATCT CGAAGGACGG 200 

CCAGACGCGC GAGCACGCGA CCCTTGCGAA CACGCTCGGG ATCAAGACGA 250 

30 TGATCATCTG CGTCAACAAG ATGGACGACG GCCAGGTCAA GTACTCGAAG 300 

GAGCGCTACG ACGAGATCAA GGGCGAGATG ATGAAGCAGC TCAAGAACAT 350 

CGGCTGGAAG AAGGCCGAGG AGTTCGACTA CATCCCGACG TCCGGCTGGA 400 

CCGGGGACAA CATCATGGAG AAGTCCGACA AGATGCCCTG GTACGAGGGC 450 

CCGTGCCTGA TCGACGCGAT CGACGGGCTC AAGGCCCCGA AGCGCCCGAC 500 

35 CGACAAGCCC CTCCGCCTCC CGATCCAGGA CGTCTACAAG ATCTCGGGCG 550 

TCGGGACCGT CCCCGCGGGC CGCGTCGAGA CGGGCGAGCT CGCGCCCGGG 600 

ATGAAGGTCG TCTTCGCCCC GACGTCCCAG GTCTCGGAGG TCAAGTCCGT 650 

CGAGATGCAC CACGAGGAGC TCAAGAAGGC CGGGCCCGGG GACAACGTCG 700 

GCTTCAACGT CCGCGGGCTC GCCGTCAAGG ACCTCAAGAA GGGCTACGTC 750 

40 GTCGGGGACG TGACGAACGA CCCGCCCGTC GGCTGCAAGA GCTTCACCGC 800 

CCAGGTCATC GTCATGAACC ACCCGAAGAA GATCCAGCCC GGCTACACGC 850 

CCGTCATCGA CTGCCACACC GCGCACATCG CGTGCCAGTT CCAGCTCTTC 900 

CTCCAGAAGC TCGACAAGCG CACGCTCAAG CCCGAGATGG AGAACCCGCC 950 

CGACGCAGGC CGCGGCGATT GCATCATCGT CAAGATGGTC CCCCAGAAGC 1000 

45 CCCTGTGCTG CGAGACGTTC AACGACTACG CGCCCCTCGG CCGCTTCGCC 1050 

GTCCGCGACA TGCGCCAAAC CGTTGCCGTC GG 1082 



50 2) INFORMATION FOR SEQ ID NO: 514 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

316 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

5 

ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 50 

ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTTCACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 150 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 200 

10 TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 250 

TGCCTTCACT CTTGGCGTGA AGCAGATGGT GGTGTGCTGC AACAAGATGG 3 00 



ACGACAAG^^GTGACGTAC"GCGCAGTCGC-GCTACGATGA-GATeAGeAAG 350 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 4 00 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 4 50 

15 ACAACATGCC GTGGTACAAG GGTCCCACGC* TGCTGGACGC GCTCGACATG 500 

CTGGAGCCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 600 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 700 

20 GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 800 

AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC ACGAGCCACA 900 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

25 AAGGAGCTGG AGAAGAACCC CAAGGCGATC AAGTCTGGCG ATGCCGCGAT 1000 

CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG TTCAACGACT 1050 

ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



30 



40 



2) INFORMATION FOR SEQ ID NO: 515 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1104 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

50 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

55 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

60 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

317 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGGCGCAGCC CGGCGACAAC GTCGGCTTC* 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
5 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



PCT/CAOO/01150 

kcSrdMiKX* c^rtfeSTG • -/gof 

AACTCGAAGA ACGACCCGCC BM 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 100 0 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1104 



2) INFORMATION FOR SEQ ID NO: 516 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
25 (B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

30 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

35 GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 4 50 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

40 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

45 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

50 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 



55 2) INFORMATION FOR SEQ ID NO: 517 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

318 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT^GCGCTGTGGA—AGTTCGAGTe-GCCeAAGTGG-GTGTTGAGGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

15 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

20 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

2 5 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

30 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTG 1099 



35 



45 



2) INFORMATION FOR SEQ ID NO: 518 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1098 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

55 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

60 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

319 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GACAACATGC CGTGGTACAA GGGTCCCACG 
GCTGGAGCCG CCGGTGCGCC CGGTGGACAA 
AGGACGTGTA CAAGATCGGC GGTATCGGGA 
GAGACCGGCA TCATGAAGCC GGGCGACGTG 
5 CGTGACGACT GAGGTGAAGT CGATCGAGAT 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
10 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 

CAAG GAGCTG GAGAAGAACC CCAAGGCGAT 

" TCGTGAAGAT GGTGCCG'CA'G" AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 



PCT/CA00/01150 

CTGCTGGACG CGCTCGACAT 5Q/) 

GCCGCTGCGC CTGCCCCTGC 550 

CTGTGCCCGT GGGCCGCGTG 600 

GTGACGTTCG CGCCCGCCAA 650 

GCACCACGAG CAGCTGGCGG 700 

ACGTGAAGAA CGTGTCGGTG 750 

AACTCGAAGA ACGACCCGCC 800 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTT 1098 



15 

2) INFORMATION FOR SEQ ID NO: 519 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 
<B) STRAIN: ATCC 50120 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

35 CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 150 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 250 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

40 GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 450 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

45 ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

50 ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 

55 



2) INFORMATION FOR SEQ ID NO: 520 
60 (i) SEQUENCE CHARACTERISTICS: 

320 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 
10 (B) STRAIN: ATCC 50121 



(xi) SEQUENCE"DESCRI PTI 0N1~ SEQ- ID-NO": -520- 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

15 GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 250 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

20 GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 350 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 450 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

25 AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

3 0 CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 850 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGCCGC 900 

TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

3 5 GGCCGCTTTG CCGTGCGCGA C 1071 



2) INFORMATION FOR SEQ ID NO: 521 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1114 bases 

(B) TYPE: .Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

55 TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATCACCGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 200 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 250 

60 ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 300 

321 
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AACAAGATGG ACGACAAGAC GGTGCAGTAC GCGCAGGCGC GCTACGAGGA 350/ 
GATCAGCAAG GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG 400 
AGAAGGTGCG CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC 450 
GAGAAGTCCG ACAACATGTC GTGGTACAAG GGTCCCACGC TGCTGGAGGC 500 
5 GCTGGACATG CTGGAGGCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC 550 
TGCCCCTGCA GGACGTGTAC AAGATCGGCG GCATTGGCAC GGTGCCGGTG 600 
GGCCGTGTGG AGACCGGCAT CATGAAGCCC GGCGACGTGG TGACGTTCGC 650 
GCCCGCCAAC GTGACGACGG AGGTGAAGTC GATCGAGATG CACCACGAGC 700 
AGCTGCAGGA GGCTGTGCCC GGCGACAACG T.CGGCTTCAA CGTGAAGAAC 750 
10 GTGTCGGTGA AGGACATCCG CCGTGGTAAC GTGTGTGGCA ACTCGAAGAA 800 
CGACCCGCCG AAGGAGGCGG CTGACTTCAC GGCGCAGGTG ATCGTGCTGA 850 



ACCACCCCGG CCAGATCAGC AACGGCTACG~CGCCGGTGCT~GGACTGCCAC 900 

ACCAGCCACA TCGCGTGCCG CTTCGCGGAC ATCGAGTCGA AGATCGACCG 950 
CCGCTCCGGC AAGGAGCTGG AGAAGAACCC CAAGTCCATC AAGTCCGGCG 1000 
15 ACGCCGCCAT CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG 1050 
TTCAACGACT ACCCGCCGCT GGGCCGCTTT GCGGTGCGCG ACATGCGCCA 1100 
AACCGTTGCC GTCG 1114 



20 



30 



2) INFORMATION FOR SEQ ID NO: 522 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

40 ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

45 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

50 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

55 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 110 0 

CGTCGG 1106 



60 



322 
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PCT/CAOO/01150 
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2 ) INFORMATION FOR SEQ ID NO: 523 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 
15 (B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

20 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

25 GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

30 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

35 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

40 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCG 1105 



45 2) INFORMATION FOR SEQ ID NO: 524 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 

323 
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TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

5 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

10 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 



AGGACGTGTA CAAGATCGGC - GGTATCGGGA _ CGGTGCCCGT"~GGGCCGCGTG —600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

15 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

20 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 1098 



25 



2 ) INFORMATION FOR SEQ ID NO: 525 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1081 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

45 CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 200 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 3 00 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 350 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 400 

50 TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

55 CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

60 CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 

324 
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CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGdCGCAG lOOOf 

AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 10^6 

TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 

5 

2) INFORMATION FOR SEQ ID NO: 526 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

TO STRANDEDNESS;— Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:' Leishmania tropica 

(B) STRAIN: ATCC 50129 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 

AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 

25 CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 200 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 250 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 300 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 350 

30 CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 400 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 

TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

35 GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 650 

CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 

CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 800 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

40 CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 900 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 1000 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

45 TG . 1102 



50 



2) INFORMATION FOR SEQ ID NO: 527 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Neospora caninum 

325 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: Suarez-4 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

5 GGACRAACTT AAAGCTGAAC GTGAGCGTGG TATCACCATT GATATCTCCC 50 

TGTGGAAATT TGAGACCAGC AAGTACTATG TTACCATCAT TGATGCCCCA 100 

GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGCCGGTA 200 

TCTCCAAGAA CGGGCAGACC CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

10 GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 300 

ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 



AYNTTAAAAA~ AATTGGYTAC" AACCCCGACA - CAGTANCATT _ TGKGCCAATT 400 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGTCM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

15 CCCTGCTTGA AGCTYTGGAT TGCATTYTGC CACCAAYTTG CCCAACTGAC 550 

AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 700 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 750 

20 CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 800 

GTGACAGCAA AAATGATCCA CCCATGGAAG CTGCTGGCTT CACAGCTCAG 850 

GTGATTATTT TGAACCATCC AGGCCAAATC AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC ACATTGCTTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 1000 

25 TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 1050 

GTGTGTCGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGCTGTGC 1100 

GTGAC 1105 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 528 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 935 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 3 0001 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 

GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

50 TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CGAAATCACA 200 

GCCGAAATGA CACGCATCCT TACAGGCATC GGCTACAAGC CAGAAATGTT 250 

CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 300 

CTCCAAACAT GCCATGGTAC AATGGCCCAT ACCTTCTTGA AGCCCTCGAT 350 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 400 

55 CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTCCA GTCGGCCGTG 450 

TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 500 

ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGCTGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 650 

60 CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 

326 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AGGCAAGATC CACGCCGGCT 
ACATCGCCTG CAAGTTCGAC 
GGCAAGAAGG CTACAGAGAA 
TATCGTCGAG GTTGTCCCAT 
5 AGTACCCACC ACTCGGCCGT 



ACCAGCCAGT TTTCGACTGC 
AAGCTCATCC AGCGTATCGA 
CCCAGAATAC ATTCAGAAGG 
CCAAGCCACT CGTCGTCGAG 
TTCGCCATCC GTGAT 



PCT/CA00/01150 

CACAOAGCTC 750 J 

TCGTCGCCAC 8 00/ 

ATGATGCCGC 850 
TCCTTCCAGG 900 
935 



10 



2) INFORMATION FOR SEQ ID NO: 529 
(i) SEQUENCE CHARACTERISTICS: 



"(A) LENGTH": ~T0'6 5~bases - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATR0795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52-9 

25 AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

30 TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 300 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 350 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 400 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 500 

35 TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 600 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

4 0 AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

45 AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 



50 2) INFORMATION FOR SEQ ID NO: 530 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 



327 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Crithiaia rascicuiata 

(B) STRAIN: ATCC 11745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

5 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 
CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 
CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 
AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 200 
10 CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 250 
GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 300 



AAGCTGGCGG ACCAGGCCGC GGAGGACGCG~~ATTCTGACGA — CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGGCTGT 400 

TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

15 AACGTGGCCA AGGGCCACGG TGGTTTCTCC' GTGTTCGCCG GCGTTGGCGA 500 

GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 

AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

20 ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 750 

GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 850 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 900 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 950 

25 TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC 1150 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGCCGAG 1200 

30 GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 1250 

GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 1297 



35 2) INFORMATION FOR SEQ ID NO: 531 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

55 GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

60 CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

328 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGT&3AAG 55J0 
GTGATTGACC TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT 600 
GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 
TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 
5 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 
GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 
TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 
TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 
CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 
10 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 
CGTATCATGG ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 



GGATATCGTG CAGATGCTGA CCAAGTACAjr~GGAGCTGCA"G~"GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC 1150 
CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 
15 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 
AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



20 2) INFORMATION FOR SEQ ID NO: 532 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 



TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

40 TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

45 AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

50 ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

55 GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

60 GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 1250 

329 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGA^SA 1297. 



5 2) INFORMATION FOR SEQ ID NO: 533 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 



CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

25 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

30 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

35 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

4 0 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

4 5 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



50 2) INFORMATION FOR SEQ ID NO: 534 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 



330 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 

5 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

10 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

AAAGCTGGCG""GACCAGGCCG -CGGAGGACAe-GATCGTGAGG-AGGGGGATGA 3 5 0- 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

15 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGG GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

20 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

25 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

30 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



35 2) INFORMATION FOR SEQ ID NO: 535 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1301 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 



GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

55 GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 300 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 350 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 400 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

60 CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 
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GCGAGCGCAC GCGCGAGGGC ACGGACCTGT ACCTGGAGAT GATGCAGTCG " ' SSt) 
AAGGTGATTG ACCTGAAGGG CGAGTCGAAG TGCGTGCTTG TGTACGGGCA 600/ 
GATGAACGAG CCCCCGGGTG CGCGCGCGCG CGTTGCGCAG TCTGCGCTGA 650 
CGATGGCGGA GTACTTCCGC GACGTGGAGG GCCAGAACGT GCTGCTGTTC 700 
5 ATCGACAACA TCTTCCGCTT CACGCAGGCG AACTCCGAGG TGTCCGCGCT 750 
GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA CCAGCCGACG CTTGCGGAGG 800 
ATCTTGGTAT GCTGCAGGAG CGCATCACGT CGACAACGAA GGGGTCGATC 850 
ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG GATGATATCA CGGATCCCGC 900 
GCCCGCGACG ACGTTCTCGC ACCTTGACGC GACGACTGTG CTGGACCGCG 950 

10 CGGTGGCGGA GTCGGGCATC TACCCTGCCG TGAACCCGCT GGAGTGCGCG 1000 
TCGCGTATCA TGGACCCCGA TGTGATCGAT GTGGACCACT ACAACGTTGC 1050 

GCAGGATATC" "GTGCAGATGC - TGACCAAGTA - CAAGGAGGTG-GAGGAGATGA 1-100- 

TTGCGGTGCT TGGCATCGAC GAGCTGAGCG AGGAAGACAA GGTTGTGGTG 1150 
GACCGCGCGC GCAAGGTGAC CCGGTTCCTG TCGCAGCCGT TCCAGGTTGC 1200 

15 GGAGGTGTTC ACGGGCATGA CGGGCCACTA. CGTGCAGCTG GTCGACACGG 1250 
TGGAGTCGTT CTCTGGCTTG CTGATGGGGT CGTACGACCA GATCCCGGAG 1300 
A 1301 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 536 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536 



CTTCGCGGAG 
ACCTCGGCCG 
GCGAACACCG 

40 GAAGTCGAAG 
GCGAGACGCT 
CGCGGCCCTG 
GAAGCTGGCG 
AGGTGATCGA 

4 5 TTCGGTGGTG 
TAACGTGGCG 
AGCGCACGCG 
GTGATTGACC 
GAACGAGCCC 

50 TGGCCGAGTA 
GACAACATCT 
GGGTCGCATT 
TGGGCATGCT 
TCTGTGCAGG 

55. CGCGACGACG 
TGGCAGAGTC 
CGTATCATGG 
GGATATCGTG 
CCGTGCTTGG 

60 CGCGCGCGCA 



GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT 
CGATGAGCCG CTGACGCTGG AGATTGTGCA 
GTCGCTGCAT TGCGATGCAG ACGACGGACC 
GTCGTGTCGA CCGGTGGCAA CATCTCTGTG 
GGGTCGCATC TTCAACGTGC TTGGCGATGC 
TGGGTGAGAA GATGCGCATG GCGATCCACG 
GATCAGGCGG CAGAGGACAC GATCCTGACG 
TCTTATTCTG CCGTACTGCA AGGGTGGTAA 
CCGGTGTAGG CAAGACTGTG ATTATTATGG 
AAGGGCCACG GTGGGTTTTC CGTGTTTGCT 
CGAGGGCACT GACCTGTACC TGGAGATGAT 
TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT 
CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT 
CTTCCGCGAT GTGGAGGGCC AGAACGTGCT 
TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT 
CCTGCCGCCG TGGGCTACCA GCCGACGCTT 
GCAGGAGCGC ATTACGTCGA CGACGAAGGG 
CCGTGTACGT GCCTGCGGAT GATATCACGG 
TTCTCGCACC TGGACGCGAC GACTGTGCTG 
GGGCATTTAC CCTGCGGTGA ACCCGCTGGA 
ACCCCGATGT GATCGATGTG GACCACTACA 
CAGATGCTGA CCAAGTACAA GGAGCTGCAG 
TATCGACGAG CTGAGCGAGG AGGACAAGGT 
AGGTGACCCG GTTCCTGTCG CAGCCGTTCC 

332 



GTGACGGAGA 50 

GCACTTGGAC 100 

TGCTGAAGCT 150 

CCTGTTGGCC 200 

GATTGACCAG 250 

CCGAGGCGCC 300 

ACCGGCATCA 3 50 

GATCGGTCTG 400 

AGCTGATCAA 450 

GGCGTGGGCG 500 

GCAGTCGAAG 550 

ACGGACAGAT 600 

GCGCTGACGA 650 

GCTGTTCATT 700 

CTGCGCTGCT 750 

GCGGAGGATC 800 

CTCGATTACG 850 

ACCCGGCGCC 900 

GACCGTGCGG 950 

GTGCGCGTCG 1000 

ACGTTGCGCA 1050 

GATATCATTG 1100 

TGTGGTGGAC 1150 

AGGTTGCGGA 1200 
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GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACAdGGTGG 12 50 
AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGkGA 129B 



5 

2) INFORMATION FOR SEQ ID NO: 537 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1297 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

CD") TOPOLOGY" Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

25 GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

30 TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

35 TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

40 CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

45 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 1297 



50 

2) INFORMATION FOR SEQ ID NO: 538 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1297 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGTGAGAAG ATGCGCATGG - CGATCCACGC _ CGAGGCCCCG 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 450 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 1250 

GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 1297 



2 ) INFORMATION FOR SEQ ID NO: 539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539 
CCITACATCC TBGTYGCICT IAACAAG 27 



2) INFORMATION FOR SEQ ID NO: 540 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA • 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540 
GGDGCITCYT CRTCGWAITC CTG 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 

(B) TYPE7~Nuclelcr acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTKGAAATGT TCCGCAAGCT GCT 



2) INFORMATION FOR SEQ ID NO: 542 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
CGGAARTAGA ACTGSGGACG GTAG 



2) INFORMATION FOR SEQ ID NO: 543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
ATCTTAGTAG TTTCTGCTGC TGA 



2) INFORMATION FOR SEQ ID NO: 544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AYGTTGTCGC CMGGCATTMC CAT 23 



2)1 NFORMATI ON" FOR - SEQ~I D~NO :~ 54 5 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 
TACATCCTBG TYGCICTIAA CAAGTG 26 

25 

2) INFORMATION FOR SEQ ID NO: 546 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546 
40 CCRCGICCGG TRATGGTGAA GAT 23 



2) INFORMATION FOR SEQ ID NO: 547 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547 

55 

GTACAGTTGC TTCAGGACGT ATC 23 



60 2) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 

acgttcgatt tcatcacgtt— G 2 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GAACGTGATA CTGACAAACC TTTA 



2) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAAGAAGAAC ACCAACGTTG 



2) INFORMATION FOR SEQ ID NO: 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 

GAAGAAAAAA TCTTCGAACT GGCTA 2 
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2) INFORMATION FOR SEQ ID NO: 552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

~ ( 1 1) - M0I7ECUL"E ~TYPET~DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 
TACACGGCCG GTGACTACG 



2) INFORMATION FOR SEQ ID NO: 553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 
GGCCGTGTTG AACGTGGTCA AATCA 



2) INFORMATION FOR SEQ ID NO: 554 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554 



GTTCCTTACA TCGTTGTTTT TCTC 



2) INFORMATION FOR SEQ ID NO: 555 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
5 TCTCGAACTT TCTCTATGTA TGCA 24 



2) INFORMATION FOR SEQ ID NO: 556 

10 

(i)_^QUENCE_CHARACTE^RJCSTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 



(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 

20 

CGGCGCNATC YTSGTTGTTG C 21 



25 2 ) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 

CCMAGGCATR ACCATCTCGG TG 22 



40 



2) INFORMATION FOR SEQ ID NO: 558 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
TCITTYAART AYGCITGGGT 20 

2) INFORMATION FOR SEQ ID NO: 559 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CCGACRGCRA YIGTYTGICK CAT 23 



2 ) INFORMATION FOR SEQ ID NO: 560 

15 (i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
25 GAYTTCATYA ARAAYATGAT YAC 23 



2 ) INFORMATION FOR SEQ ID NO: 561 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 

40 

ACIGTICGGC CRCCCTCACG GAT 23 



45 2 ) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 

CARATGRAYG ARCCICCIGG IGYIMGIATG 30 
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PCT/CAOO/01150 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(x i) S EQUENCE DESCRIPTION: SEQ ID NO: 563 



15 



50 



GGYTGRTAIC CIACIGCIGA IGGCAT 26 



2) INFORMATION FOR SEQ ID NO: 564 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 29 bases 

'(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
TAYGGICARA TGAAYGARCC ICCIGGIAA 29 

30 

2) INFORMATION FOR SEQ ID NO: 565 

35- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
45 GGYTGRTAIC CIACIGCIGA IGGDAT 26 



2) INFORMATION FOR SEQ ID NO: 566 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 

60 
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2) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ- ID NO: 567 
TCRTCIGCIG GIACRTAIAY IGCYTG 



2) INFORMATION FOR SEQ ID NO: 568 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568 
RTIATIGGIG CIGTIRTIGA YGT 



2) INFORMATION FOR SEQ ID NO: 569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 
RTIRTIGGIS CIGTIRTIGA TAT 



2) INFORMATION FOR SEQ ID NO: 570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
5 RTIRYIGGIC CIGTIRTIGA YGT 23 



2) INFORMATION FOR SEQ ID NO: 571 

10 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH : - 2"3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 

20 

RTIRTIGGIC CIGTIRTIGA TGT 23 



25 2) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 

RTIRTIGGIS CIGTIRTIGA 20 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 573 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573 
CCICCIACCA TRTARAAIGC 20 

2 ) INFORMATION FOR SEQ ID NO: 574 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
ATIGCIATGG AYGGIACIGA RGG 



2) INFORMATION FOR SEQ ID NO: 575 

(i) SEQUENCE CHARACTERISTICS : • 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
TIACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
AACTTCRTCA AGAAGGTYGG TTACAA 



2) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 
CATGATTGAA CCATCCACCA 



344 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
2 ) INFORMATION FOR SEQ ID NO: 578 



PCT/CAOO/01150 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 



15 



CATGATTGAA GCTTCCACCA 20 



2 ) INFORMATION FOR SEQ ID NO: 579 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579 
GAAGGCCGTG CTGGTGAGAA 2 0 

30 



2) INFORMATION FOR SEQ ID NO: 580 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
45 GCTAAACCAG CTACAATCAC TCCAC 25 



50 



2 ) INFORMATION FOR SEQ ID NO: 581 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 

60 
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ACATCGGTGC ATTATTACGT GG /22 



5 2) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 

TTTCAACTTC GTCGTTGACA CGAACAGT 28 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 583 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 35 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 35 

2) INFORMATION FOR SEQ ID NO: 584 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 20 

50 



2) INFORMATION FOR SEQ ID NO: 585 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
ACCACATACT GAATTCAAAG 



2) INFORMATION FOR SEQ ID NO: 586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH~2~(r Eases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 
CAGAAGTATA CGTATTATCA 



2) INFORMATION FOR SEQ ID NO: 587 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 
CGTATTATCA AAAGACGAAG 



2) INFORMATION FOR SEQ ID NO: 588 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 



2) INFORMATION FOR SEQ ID NO: 589 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589 
GCACGAAACT TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
TATACGTATT ATCTAAAGAT 



2 ) INFORMATION FOR SEQ ID NO: 591 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
TCCTGGTTCT ATTACACCAC 



2) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
CAAAGCTGAA GTATACGTAT 
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2) INFORMATION FOR SEQ ID NO: 593 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 



TTCACTAACT ATCGCCCACA 



2) INFORMATION FOR SEQ ID NO: 594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 



2) INFORMATION FOR SEQ ID NO: 595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595 
TTAAAGCAGA CGTATACGTT 



2) INFORMATION FOR SEQ ID NO: 596 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:. Linear " 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 
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GAAATTATTG GTATCAAAGA 
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2) INFORMATION FOR SEQ ID NO: 597 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 
ATTGGTATCA AAGAAACTTC 



2) INFORMATION FOR SEQ ID NO: 598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT 



2) INFORMATION FOR SEQ ID NO: 599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CGGTGAAGAA ATCGAAATCA 



2) INFORMATION FOR SEQ ID NO: 600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
5 ATGCAAGAAG AATCAAGCAA 20 



2) INFORMATION FOR SEQ ID NO: 601 

10 

(i) SEQUENCE CHARACTERISTICS: 

" (A) LENGTH f~20 — bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 

20 

GTTTCACGTG ATGATGTACA 20 



25 2) INFORMATION FOR SEQ ID NO: 602 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 

AAGTTGAAGT TGTTGGTATT 20 



(i) 

30 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 603 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
GGTATTAAAG ACGAAACATC 20 

2) INFORMATION FOR SEQ ID NO: 604 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 20 



10 



2 ) INFORMATION FOR SEQ ID NO: 605 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single . 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
25 GAAATGTTCC GTAAATTATT 20 



2 ) INFORMATION FOR SEQ ID NO: 606 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 

40 

ATTAGACTAC GCTGAAGCTG 20 



45 2) INFORMATION FOR SEQ ID NO: 607 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 2 9212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 
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CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACATAT CTTATTATCA CGTAACGTTG GTGTACCATA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 200 

5 ATGTTCCAGT TATCGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGACGAG 250 

TCTTATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG ACGTGTTGAA 400 

CGTGGTGAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

10 CGAAACATCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTAT 50 0 

TAGACTACGC TGAAGCAGGC GACAACTTCG GTGCTTTATT ACGTGGTGTA 550 



GCACGTGAAG ATXTCGAA'C G ~TGGACAAGTA~TTAGCTAAAC-CAGCTACAAT 600" 

CACTCCACAC ACAAAATTCA AAGCTGAAGT ATACGTATTA TCAAAAGAAG 650 
AAGGCGGACG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 700 
15 TTCCGTACAA CAGACGTTAC TGGTGTTGTA' GAATTGCCAG AAGGTACTGA 750 
AATGGTAATG CCTGGTGATA ACGTTGCTAT GGACGTTGAA TTAATTCACC 800 
CAATCGCTAT CGAAGACGGA A 821 



20 



30 



2) INFORMATION FOR SEQ ID NO: 608 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 

CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCT CGTCAAGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTGAACA AAGTAGACAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

4 0 TGAAATGGAA GTTCGTGACC TATTAACAGA ATACRAATTC CCTGGTGRCG 200 

ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA AAGCTCTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATTCT TGAATTAATG GCTGCAGTTG ACGAATACAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 350 

ACGTGTTCTC AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

45 CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAGTTGTTG GTATTGCTGA 450 

AGAAACTTCA AAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACYACGC TGAAGCTGGA GACRACATTG GTGCTTTACT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

50 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



55 

2) INFORMATION FOR SEQ ID NO: 609 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 



CGGTGCGATC TfAGTAGTAT ~CTGCTGCTGA~~CGGTCCTATG~CCTCAAACTC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWACCATA CATCGTTGTT 100 

TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 150 

15 TGAAATGGAA GTTCGTGACC TATTGTCTGA ATATGACTTC CCAGGCGACG 200 

ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 250 

TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 300 

TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

20 CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 450 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 600 

CACACCTCAT ACAAAATTCA AAGCTGAAGT TTATGTTTTG ACAAAAGAAG 650 

25 AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 610 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae Rd 

(B) STRAIN: KW20 

(C) ACCESSION NUMBER: extracted from U32739 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 



AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 100 

50 GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 300 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 3 50 

55 TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 4 00 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 450 

GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

ACATCGGTGC ATTATTACGT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 600 

60 CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 
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AGAAGTGTAC GTATTATCAA AAGATGAAGG 
TCAAAGGTTA CCGTCCACAA TTCTATTTCC 
ACAATCGAAT TACCAGAAGG CGTGGAAATG 
CAAGATGACA GTAAGCTTAA TCCACCCAAT 
GTTTCGCAAT CCGTGAAGGT GGCCGTACAG 



PCT/CAOO/01150 

TGGTCGTCAT ACTCCA^TCT 700 

GTACAACAGA CGTGACTGGT 750- 

GTAATGCCAG GCGATAACAT 800 

TGCGATGGAT CAAGGTTTAC 850 

TAGGTGCAGG C 891 
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2) INFORMATION FOR SEQ ID NO: 611 
(i) SEQUENCE CHARACTERISTICS: 



~ (A) LENGTH f "81"8 "Bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14 990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

25 CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 250 

30 GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

35 ACTACGCTGA- AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 600 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

40 GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGCTATCGA AGACGGAA 818 



45 2 ) INFORMATION FOR SEQ ID NO: 612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 
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20 



35 



55 



2) INFORMATION FOR SEQ ID NO: 613 



(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 



2) INFORMATION FOR SEQ ID NO: 614 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 653 bases 

356 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAbACCCG 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

5 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

10 GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 



(TGAAG"GCCGT~"GCCGGTGAGA — ACGTAGGTGT _ TCTGeTGCGT--GGT-A-TCAAAG 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

15 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTC 825 



GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 100 

40 TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMSCTGGA TACTTACATC 300 

CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

45 CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 400 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTCAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

50 CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 778 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

_(xi) SEQ UENCE DE SCRIPTION :SEQ ID NO: 614 



GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

15 CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 200 

ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 300 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 350 

20 GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 400 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 450 

AAGTTCAAAA AGCAATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 500 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTCTATTAC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

25 CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 



30 2) INFORMATION FOR SEQ ID NO: 615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 



CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AACACGTGAA 50 

CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

50 CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 250 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 300 

CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 350 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 400 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 450 

55 CACAAAAAGC TGTTGTAACT GGGGTCGAAA TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GTATCCAACG 550 

TGAAGATATC GAACGTGGAC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 700 

60 TACAACTGAC GTAACTGGAA CAATCGTTTT ACCTGGAGGC ACTGAAATGG 750 

357 
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TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATOCAGTT 80/0 
GCCATCGAAA ACGGAACAAC TTTCTCTAT 829 



2) INFORMATION FOR SEQ ID NO: 616 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 bases 
10 (B) TYPE: Nucleic acid 
(C) STRANDEDNESS: Double 



(D) TOPOLOGY~LiWar 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

25 GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 200 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 300 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 350 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 400 

30 GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

35 AGGTGGCCGT CATACGCCA 669 



2) INFORMATION FOR SEQ ID NO: 617 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 512 66 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 

55 CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 200 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 250 

60 GAACAAGAAG AAGTAATTAT GAACTTGATG GATACTGTGG ATGAATATAT 300 

358 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCCAACTCCA 
ATGTCTTCAC 
CGTGGTAAAG 
AGAAACACAA 
TGGATTATGG 
ACTCGTGATG 
TACACCGCAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCCGT 
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GAACGTGACA 
AATTACTGGT 
TCAACGTTGG 
AAAGCTGTTG 
TGAAGCTGGT 
AAGTAGAACG 
ACCAAATTTA 
TCATACTCCT 
CTGATGTGAC 
CCTGGTGATA 
TGAAAAAGGG 



ATGATAAACC 
CGTGGTACTG 
TGATGAAATT 
TAACCGGTTT 
GATAACGTTG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTTAATA 
TGGTAACATC 
ATGTAACAAT 
ACTACTTTCT 



ATTCTTGTTA CCACflGGAAG 
TTGCTTCAGG TCGTATCGAC 
GAAATTATCG GAATTAAACC 
GGAAATGTTC CGTAAAACTT 
GGGTCTTATT ACGTGGGATT 
TTAGCAAAAC CAGGTTCCAT 
TTATATCTTA ACAAAAGAAG 
ACTATCGTCC TCAATTTTAT 
GCATTACCTG AAGGAACTGA 
TGAAGTTGAA TTGATTCATC 
CAATT 



350/ 

400 

450 

500 

550 

600 

650 

700 

750 

800 

835 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 618' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 

GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 100 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 200 

TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 300 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 350 

TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC 400 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 



2) INFORMATION FOR SEQ ID NO: 619 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: K00428 

359 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 



TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 
5 ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 
GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 
AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 
AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 
TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 

10 GTAATTTCTA GGACCTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 

AfG^GAXGACAACTTTAACC^GCAGCCATTA^CGAAAACGTT— AGCCGCAAAA 
GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 
AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 

15 CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 
AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 
AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 
CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 
ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 

20 ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTCCA ATTATCATGG 
GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 
GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACACCTGA 
AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 
TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 

25 AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 
AACAACAGTT ACTGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 
TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 
TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 
AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 

30 ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 
GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 
GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 
CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 
ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 

35 CAACTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 
ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 
TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 
ATGTGCACAC ACC 

40 

2) INFORMATION FOR SEQ ID NO: 620 

( i ) SEQUENCE CHARACTERI STICS : 
45 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
ATTGGTGCAT TGCTACGT 18 

55 



2) INFORMATION FOR SEQ ID NO: 621 

60 (i) SEQUENCE CHARACTERISTICS: 

360 
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100 
150 
200 
250 
300 
350 
400 
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500 
550 
600 
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(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
10 (B) STRAIN: ATCC 19434 



(xir~SEQUENCE~DESCRTPTION-:— SEQ -ID-NO :- -6 2-1- 



TGGTGCAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

15 GTGAACACAT TTTATTGTCA CGCCAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 200 

ACACTCCTGT GATCAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 250 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 300 

20 CCCAACACCA GAACGTGATA CAGATAAACC ATTACTATTG CCAGTGGAAG 350 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 400 

CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 450 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AGAAATGTTC CGTAAAACGT 500 

TAGATTACGG GGAAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

25 CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 600 

TACTCCACAT ACAAAATTCA AAGCAGAAGT GTACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTATCGTCC ACAGTTCTAC 700 

TTCCGCACAA CTGATGTTAC AGGAACAATC ACATTGCCAG AAGATACAGA 750 

A 751 

30 



2 ) INFORMATION FOR SEQ ID NO: 622. 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
45 (B) STRAIN: ATCC 13264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 

GTCAAATGGG ACGAATCCAG ATTCCAAGAA ATTGTCAAGG AAACCTCCAA 50 

50 CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 100 

TCTCTGGTTG GAACGGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 200 

GACTTTGTTG GAAGCCATTG ACGCCATTGA ACAACCATCT AGACCAACTG 250 

ACAAGCCATT GAGATTGCCA TTGCAAGATG TTTACAAGAT TGGTGGTATT 300 

55 GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 350 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTCCGTTG 4 00 

AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 450 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

60 CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 600 

361 
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GTTTTGGATT GTCACACTGC TCACATTGCT 
GGAAAAGAAC GACAGAAGAT CTGGTAAGAA 
TCTTGAAGTC CGGTGACGCT GCTTTGGTCA 



TGTAGATTCG ACGAATTGTT 
GTTGGAAGAC CATCCAAAGT 
AGTTCGTTCC ATCTAAGCCA 



ssjb 

700 
750 



20 



25 



30 



35 



40. 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 623 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1269 bases 
(B) TYPE: N ucleic _acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:' Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 

TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATCACCA TCGACATTGC 50 

TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 

CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 

GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 200 

TATCTCCAAG GACGGTCAGA CCCGAGAGCA CGCCCTCCTC GCCTTCACCC 250 

TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 300 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGGAGACCA ACGGTTTCAT 350 

CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG 4 00 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 450 

CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 500 

ACAAGGGATG GACCAAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 

CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 

GCCCCTCCGT CTCCCTCTCC AGGACGTCTA CAAGATCGGT GGTATCGGCA 650 

CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 700 

TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 750 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 800 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 850 

GACAACGTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATCCGACG 900 

AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 950 

CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 1000 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 1050 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAACCGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTCTGGTGAC GCCGCCATTG TCAAGCTTGT 1200 

TGCCCAGAAG CCCCTCTGTG TTGAGACCTA CGCCGACTAC CCCCCTCTTG 1250 

GTCGATTCGC CGTCCGAGA 1269 



2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



362 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 

TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

10 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CACCAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 200 



TAAGACCTTG~TTAGAAGCTA — TTGACGCTAT TGAACCACCA— AeCAGACCAA 2-50- 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 300 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 350 

15 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 400 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

20 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



25 



2) INFORMATION FOR SEQ ID NO: 625 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
CGTTGAAGAC ACGACCCAAA GTATCC 26 

40 

2) INFORMATION FOR SEQ ID NO: 626 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 
55 TACCACCTTT TAAGTAAGGT GCTAAT 26 



60 



2) INFORMATION FOR SEQ ID NO: 627 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
-AT.TGTCTATA_ AAAATGGCGA TAAGTC 26 



15 2) INFORMATION FOR SEQ ID NO: 628 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 

AAAATGGCGA TAAGTCACAA AAAGTA 26 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 629 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629 
AAGTTCCATC TCAACAAGGT CAATA 25 

2) INFORMATION FOR SEQ ID NO: 630 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
CGGAGCTATC CTAGTCGTTT CA 22 

60 
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2) INFORMATION FOR SEQ ID NO: 631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
15 CAGACCAACY GAIAARCCAT TRAGAT 



2) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

30 

CCCTTTGGTG GRTCSTKCTT GGA 



35 2) INFORMATION FOR SEQ ID NO: 633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 

CAGACCAACY GAIAARCCIT TRAGAT 



50 



60 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

AACACYGTCA GRRCIATTGC YATGGA 26 



2) INFORMATION FOR SEQ ID NO: 635 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
~ (C) _ STRANDEDNESST Single" 

(D) TOPOLOGY: Linear 



40 



15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 
AAACCRGTIA RRGCRACTCT IGCTCT 26 

20 

2) INFORMATION FOR SEQ ID NO: 636 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636 
35 ACTGGYGTTG AIATGTTCCG YAA 23 



2) INFORMATION FOR SEQ ID NO: 637 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637 

50 

ACGTCAGTIG TACGGAARTA GAA 23 



55 2 ) INFORMATION FOR SEQ ID NO: 638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 

CCAATGCCAC AAACICGTGA RCACAT 26 



10 



2) INFORMATION FOR SEQ ID NO: 639 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 bases 
15 (B) TYPE: Nucleic acid - 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639 
TTTACGGAAC ATTTCWACAC CWGTIACA 28 

2 ) INFORMATION FOR SEQ ID NO: 640 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 0 
TCCATGGTIT WYGGICARAT GAA 23 

40 

2 ) INFORMATION FOR SEQ ID NO: 641 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 
55 TGATAACCWA CIGCIGAIGG CATACG 26 



2 ) INFORMATION FOR SEQ ID NO: 642 

60 
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WO 01/23604 PCT/CAOO/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 23 



15 2) INFORMATION FOR SEQ ID NO: 643 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 

ACTGGIGTIG ARATGTTCCG YAA 23 



2) INFORMATION FOR SEQ ID NO: 644 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 
ACGTCIGTIG TICKGAARTA GAA 23 

2) INFORMATION FOR SEQ ID NO: 645 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645 
ACGTCIGTIG TICKGAARTA RAA 23 

60 
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2) INFORMATION FOR SEQ ID NO: 64 6 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



10 



20 



50 



60 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
15 ATCGACAAGC CITTCYTIAT GSC . 23 



2) INFORMATION FOR SEQ ID NO: 647 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 

30 

ACGTCCGTSG TRCGGAAGTA GAACTG 26 



35 2) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648 

ACGTCSGTSG TRCGGAAGTA GAACTG 26 



2) INFORMATION FOR SEQ ID NO: 649 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649 

GTCCTATGCC TCARACWCGI GAGCAC 26 



2) INFORMATION FOR SEQ ID NO: 650 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

CO STRANDEDNESST - SingTe - 

(D) TOPOLOGY: Linear 



40 



15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
TTACGGAACA TYTCAACACC IGT 23 

20 

2) INFORMATION FOR SEQ ID NO: 651 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
35 TGACGACCAC CITCYTCYTT YTTCA 25 



2 ) INFORMATION FOR SEQ ID NO: 652 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 

50 

CCWAYAGTIY KICCICCYTC YCTIATA 27 



55 2) INFORMATION FOR SEQ ID NO: 653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 

GAYTTCATIA ARAAYATGAT 20 



10 



2 ) INFORMATION FOR SEQ ID NO: 654 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARATYK GIGGTATYGG 20 

2 ) INFORMATION FOR SEQ ID NO: 655 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
CCRATACCIC MRATYTTGTA 20 

40 

2 ) INFORMATION FOR SEQ ID NO: 656 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
55 AATTAATGGC TGCAGTTGAY GA 22 



60 



2) INFORMATION FOR SEQ ID NO: 657 

371 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 
TTGTCCACGT TCGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 658' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
GATYTAGTCG ATGATGAAGA ATT 



2) INFORMATION FOR SEQ ID NO: 659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 
GCTTTTTGIG TTTCWGGTTT RAT 



2) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 
GTAGAATTGA GGACGGTAGT TAG 
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2) INFORMATION FOR SEQ ID NO: 661 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



10 



20 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
15 GTAGAAYTGT GGWCGATART TRT 23 



2) INFORMATION FOR SEQ ID NO: 662 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: CoryneJbacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 

35 CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 200 

CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG CGACGAGAAG 250 

4 0 TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 300 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 350 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 400 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG ATCATCGGTA TCCGCGAGAA 450 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 500 

45 ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 600 

CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 700 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 750 

50 GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 



10 



AACGGCGAGC AAGACTTGGT GTTGGAGGTT TCTCAGCACT TGGGTGAGAA 50 

GACCGTGCGT— AeeATTGeCA~TGGACGGTAC _ CGAGGGTTTG~GTGAGAGGTA 100" 

CCGCTGTCAC TGACACCGGT GCTCCCATCT CGGTCCCCGT TGGTCAGGGT 150 

ACCTTGGGCC GGATCATCAA CGTTGTCGGT GAGCCCATCG ACGAGCGTGG 2 00 

15 TCCCATCCAG TGCAAGCAGA GAAACCCCAT' TCACGCCGAG CCCCCGTCTT 250 

TCACCGAGCA GTCCGTCGAG GCTGAGGTGT TGGAGACCGG TATCAAGGTT 300 

GTCGACTTGT TGGCTCCCTA CGCCCGTGGT GGTAAGATTG GTCTTTTCGG 350 

TGGTGCCGGT GTCGGTAAGA CCGTGTTCAT CCAGGAGTTG ATTAACAACA 400 

TTGCCAAGGC CCACGGTGGT TTCTCCGTGT TCACTGGTGT CGGTGAGCGT 450 

20 ACTCGTGAGG GTAACGACTT GTACCGTGAA ATGAAGGAGA CCGGTGTCAT 500 

CAACTTGGAG GGCGACTCCA AGGTGGCCTT GGTGTTCGGT CAGATGAACG 550 

AGCCCCCGGG GGCTCGTGCC CGTGTCGCCT TGACCGGTTT GACCATTGCC 600 

GAGTACTTCC GTGACGAGGA AGGCCAGGAC GTGTTGTTGT TCGTTGACAA 650 

CATTTTCAGA TTCACCCAGG CCGGTTCCGA GGTGTCGGCG TTGTTGGGTC 700 

25 GTATCCCCTC CGCCGTCGGT TACCAGCCCA CTTTGGCCAC CGACATGGGT 750 

TTGTTGCAGG AGAGAATTAC CACCACCAAG AAGGGTTCCG TCACCTCTGT 800 

GCAGGCCGTG TACGTCCCTG CCGATGACTT GACTGACCCT GCCCCCGCCA 850 

CCACTTTCGC TCACTTGGAC GCCACCACCG TGTTGTCGCG TGGTATCTCC 900 

GAGTTGGGTA TCTACCCCGC CGTCGACCCC TTGGACTCCA AGTCGAGATT 950 

30 GTTGGACGTC GAGGTTGTTG GCCAGGAGCA CTACGACGTC GCCACCGGTG 1000 

TCCAGGAGTG CTTGCAGGCC TACAAGTCGT TGCAGGACAT CATTGCCATT 1050 

TTGGGTATGG ACGAGTTGTC CGAGCAGGAC AAGTTGACCG TCGAGAGAGC 1100 

CCGTAAGATC CAGCGTTTCT TGTCGCAGCC CTTCGCTGTC GCCGAGGTTT 115 0 

TCACTGGTAT CCCCGGTAGA TTGGTGAGAT TGCAGGACAC CG 1192 

35 



2) INFORMATION FOR SEQ ID NO: 664 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
50 AAYATGATIA CIGGIGCIGC I CARATGGA 29 



2) INFORMATION FOR SEQ ID NO: 665 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

374 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 



10 ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGA"CAAGAG"AA"CCATCGAA~AAGTTCGAAA-AGGAAGCCGC-TGAATTAGGT ISO 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 2 00 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

15 AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

20 AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

25 GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

30 GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 12 00 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

35 ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 13 00 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



40 



50 



2) INFORMATION FOR SEQ ID NO: 666 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1536 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

60 GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 200 
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GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 360 
CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 300 
GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 350 
CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 4 00 

5 AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 450 
CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 500 
GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GCCAGAGGTG 550 
GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 600 
CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTCCGTTTT 650 

10 CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG TACCGTGAAA 700 
TGAAGGAAAC TGGAGTCATT AACTTGGAAG GTGAATCCAA GGTCGCCTTA 750 

GTTTTCGGTC _ AAATGAACGA"ACCTCCAGGA~GeGAGAGGeA— GAGTGGGTTT 800- 

AACTGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 850 
TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 900 

15 GTCTCTGCCC TTTTGGGTCG TATTCCATCT< GCCGTCGGTT ATCAACCAAC 950 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 1000 

AGGGTTCTGT CACTTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 1100 

CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 1150 

2 0 TGGATTCTAA ATCAAGGTTA. TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 1250 

ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 1300 

AACTAACTGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 1350 

TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAGTGAGATT 14 00 

25 AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 1450 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 1500 

CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 1536 



30 



40 



2) INFORMATION FOR SEQ ID NO: 667 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 93 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

50 CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 2 00 

ACACTTGGCA GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 250 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 4 00 

55 TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 4 50 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

60 GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 70 0 

376 
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CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTOTTGGGTC T/50 

GTATTCCCGC TGCCGTCGGC TACCAGCCTA CCCTTGCTGA GGATCTTGGG 'SOO 

CAGTTGCAGG AGCGCATTAC GTCCACGACA AAAGGTTCCA TTACCTCTGT 850 

GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

5 CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACCCA CTGGAGTGTG CCTCGCGTAT 1000 

CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 

TGGTGCAGAT GCTTACCAAG TACAAGGAGC TGCAGGATAT CATTGCGGTG 1100 

CTTGGCATTG ATGAGCTCAG TGAGGAGGAT AAACTTATCG TGGACCGTGC 1150 

10 GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC TTTCCAGGTG GCGGAGGTGT 1200 

TTACTGGCAT GACAGGCCAC TACGTGCAGC TGGAGGAGAC CATTGAGTCT 1250 



15 



25 



TTTTCCGGCC TGTTGATGGG~CffCATATGST~CAGGTGCCGG _ AGA— 1293" 



2 ) INFORMATION FOR SEQ ID NO: 668 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1191 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium glutamicum 
(C) ACCESSION NUMBER: X77034 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 



CATCGGTCAC GTTGACCACG GTAAGACCAC 
AGGTTCTGGC TGACACTTAC CCTGAGCTCA 

35 TCCATCGATA AGGCTCCTGA GGAGAAGGAG 
CTCCCACGTT GAGTACCAGA CTGAAAAGCG 
CTCCAGGCCA CGCCGACTAC ATCAAGAACA 
ATGGACGGCG CAATCCTCGT TGTTGCTGCT 
GACCCGTGAG CACGTTCTTC TTGCTCGCCA 

4 0 TCGTTGCTCT TAACAAGTGC GACATGGTTG 
CTCGTCGAGA TGGAAGTTCG TGAACTTCTT 
AGAGGCTCCA ATTGTTCACA TCTCCGCTCT 
AGAAGTGGGG CAAGCAGATC CTTGAGCTCA 
ATCCCTGACC CAGTTCGTGA GACCGACAAG 

4 5 GGACATCTTC ACCATCACCG GTCGTGGCAC 
AGCGCGGTAC CCTGAACGTG AACGATGATG 
GAGAAGTCCA CCTCCACCAC CGTTACCGGT 
TCTTGACTCC GCTGAGGCTG GCGACAACTG 
TCAAGCGCGA AGATGTTGAG CGTGGCCAGG 

50 TACACCCCTC ACACCGAGTT CGAGGGCTCT 
TGAAGGTGGC CGCCACACCC CATTCTTCGA 
ACTTCCGCAC CACCGACGTT ACCGGTGTTG 
GAGATGGTCA TGCCTGGCGA CAACGTCGAC 
GCCTGTCGCT ATGGACGAGG GCCTGCGTTT 

55 GCACCGTTGG CGCTGGTCGT GTCACCAAGA 



60 



2) INFORMATION FOR SEQ ID NO: 669 

377 



ID NO: 668 






CCCCACGTAA 


ACATCGGCAC 


50 


CACCACCGCG 


GCTATCACCA 


100 


ACGAGGCTTT 


CGCCTTCGAC 


150 


CGTGGCATCA 


CGATCAACAT 


200 


CCACTACGCA 


CACGTTGACG 


250 


TGATTACCGG 


CGCTGCTCAG 


300 


ACCGACGGCC 


CAATGCCTCA 


350 


GGTTGGCGTT 


CCTTACATCC 


400 


AGGATGAGGA 


AATCATCGAG 


450 


GCTGAGCAGG 


ACTACGACGA 


500 


GAAGGCTCTT 


GAGGGCGACG 


550 


TGCAGGCTTG 


CGATGACAAC 


600 


CCATTCCTCA 


TGCCTATCGA 


650 


CGTTGTTACC 


GGTCGTGTTG 


700 


TTGACATCAT 


CGGCATCAAG 


750 


ATCGAGATGT 


TCCGTAAGCT 


800 


TGGTCTGCTT 


CTCGGTGGTA 


850 


TTATCGTTAA 


GCCAGGCGCT 


900 


GTCTACGTTC 


TGTCCAAGGA 


950 


CAACTACCGT 


CCTCAGTTCT 


1000 


TGAAGCTTCC 


AGAGGGCACC 


1050 


ATGTCCGTCA 


CCCTGATCCA 


1100 


CGCTATCCGC 


GAAGGCTCCC 


1150 


TCATCAAGTA 


A 


1191 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 

15 ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 50 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA 100 

ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 200 

GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 250 

20 CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 3 00 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 350 

CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 400 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 450 

GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

25 CGCGATCGAG CACTCCGGTT ACTCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 650 

TCTGCGCGTT GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG 700 

AAGGTCGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 750 

30 GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 800 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA 850 

CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ATACGTACCT 900 

GCGGATGACT TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 1000 

35 CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 1050 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

TTATCAGGAA CTGAAAGACA TCATCGCCAT CCTGGGTATG GATGAACTGT 1150 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 1200 

CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 1250 

40 ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 1300 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 1350 

GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 1383 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1410 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004014 

378 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 



PCT/CA00/01150 



ATGAAAGCGA TGGAAGGTAA AATCATTCAG GTTTTAGGCC CGGTGGTAGA 50 

TGTGGAGTTT GAATCCTATC TGCCGGCGAT TTTTGAAGCA CTAGACATTA 100 

5 ATTTTGAAGT TAATGGCGTT CAAAAATCTT TAGTTTTAGA GGTGGCAGCC 150 

CATTTGGGTG GTAATCGGGT GCGAGCGATT GCTATGGATA TGACAGAAGG 200 

CTTAGTGCGT AACCAAGCCG TCAAAGCTCG TGGCAAAATG ATTGAAGTGC 250 

CTGTGGGCGA AGAAGTGTTA GGGCGTATTT TTAATGTTGT GGGCGAGAGC 300 

ATTGATAATT TAGAGCCGCT TAAGCCGTCC TTAACTTGGC CCATTCACAG 350 

10 AAAAGCCCCT AGTTTTGAGC AGCAAAGCAC TAAAACAGAA ATGTTTGAAA 400 

CCGGTATTAA AGTCATTGAC TTGCTCGCGC CTTATTCTAA GGGCGGTAAA 450 

-— _GIAGGCT-TGT-^^ —500 

GCTTATCCAC AATGTGGCTT ATAAGCATAA CGGGTATTCG GTGTTTGCAG 550 

GTGTGGGGGA GCGCACCAGA GAAGGGAACG ATCTGTATTT TGAGATGAAA 600 

15 GAAGGGGGCG TTTTAGACAA AGTTGCGTTG* TGCTATGGGC AAATGAATGA 650 

GCCACCAGGT GCAAGGAATC GCATCGCATT CACCGGCTTG ACGATGGCGG 700 

AGTATTTCCG TGATGAAAAG GGCTTAGATG TGTTGATGTT TATTGATAAC 750 

ATCTTTAGAT ACGCTCAAAG CGGTGCGGAA ATGAGCGCGC TATTAGGCCG 800 

TATCCCTTCA GCGGTGGGGT ATCAGCCCAC GCTAGCCGGG GAAATGGGGA 850 

20 AACTTCAAGA GCGTATCGCT. TCCACTAAAA ATGGCTCTAT CACTTCGGTT 900 

CAAGCGGTGT ATGTGCCAGC AGACGACTTG ACTGACCCAG CCCCTGCTTC 950 

GGTGTTTGCG CATTTAGATG CGACTACGGT GTTGAATAGA AAGATCGCTG 1000 

AAAAAGGGAT TTATCCGGCG GTGGATCCTT TGGATTCCAC TTCAAGGATT 1050 

TTAAGCCCTC AAATGATTGG CGAGAAGCAC TATGAAATCG CCACCGGTAT 1100 

25 CCAGCAAGTT TTGCAAAAAT ACAAGGATTT GCAAGATATT ATTGCGATTT 1150 

TGGGATTGGA CGAATTGAGC GAAGAGGATA AAAAAACGGT TGAAAGGGCC 1200 

AGAAAAATTG AGAAGTTTTT ATCCCAGCCG TTTTTTGTGG CTGAAGTGTT 1250 

TACAGGAAGT CCCGGTAAGT ATGTGACTCT CCAAGAGACT TTAGAGGGCT 1300 

TTGGAGGGAT TTTAGAGGGC AAATACGATC ACATTCCTGA AAACGCGTTT 1350 

30 TACATGGTGG GCAGCATTCA AGAGGTTTTA GAAAAAGCTA AAAACATGAA 1400 

AAATTCCTAA 1410 



35 2) INFORMATION FOR SEQ ID NO: 671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylicum 

(B) STRAIN: DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

55 GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 250 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 300 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 350 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 400 

60 TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

379 
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TTGTTCGGTG GTGCCGGTGT TGGTAAAACA GTTCTTATTC AGC5AACTTAT ioO 
AAATAATATA GCAAAAGAAC ACGGTGGATT ATCAGTATTC ACAGGTGTTG 550 
GAGAAAGAAC AAGAGAAGGT AATGACCTTT ATTATGAAAT GAAAGATTCA 600 

GGAGTTATAA ATAAAACAGC TCTAGTATTT GGTCAGATGA ATGAACCACC 650 

5 TGGCGCTAGA ATGAGAGTTG CTTTAACAGG ACTTACAATG GCTGAATATT 700 

TTAGAGACAA AGGTCAAGAT GTGCTTCTAT TTATAGATAA TATATTCAGA 750 

TTTACACAAG CTGGTTCAGA GGTTTCAGCT TTACTTGGTA GAATACCTAG 800 

TGCCGTTGGT TATCAGCCAA CACTTGCAAA TGAAATGGGT GCTCTTCAAG 850 

AGAGAATAAC ATCAACAAAA CAGGGTTCAA TCACATCCGT TCAGGCTGTA 900 

10 TATGTTCCTG CTGATGACCT TACAGACCCA GCTCCAGCAA CAACATTTAC 950 

GCATCTTGAT GCAACAACAG TTCTTTCAAG AGAAATATCA AACTTAGGAA 1000 

TATATCCTGC" AGTTAGTCCT~CTTGAATCAA — CTTCAAGAAT— ACTTGATGGA- 1-050- 

AGAATTGTTG GAGAAGAGCA TTATGAAGTT GCTAACAAGG TTAAACATAT 1100 

ACTTGAAAGA TATCAAGAAC TTCAAGATAT CATAGCTATA CTTGGTGTTG 1150 

15 ATGAACTTTC AGATGAGGAT AGATTGTTAG TTGGAAGAGC AAGAAGAGTA 1200 

CAGAGATTCT TATCTCAAGC TTTTAGTGTT GCTGAACAAT TTACAGGAAT 1250 

GAAAGGTCAG TTTGTACCTG TAAAAGATAC TATAAGAAGT TTTAAAGAAA 1300 

TATTAGATGG TAAGTGTGAT GATCTTCCAG AAGCTGCATT TTTATTTGCA 1350 

GGAACAATAG AAGATGTAAA AGAAAAAGCT AAAAAAATGA TGGAAAGCTA 1400 

20 A 1401 



2 ) INFORMATION FOR SEQ ID NO: 672 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

(C) ACCESSION NUMBER: M22535 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 



ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 200 

45 AAGTCGTGGA GCAGAGGTTA ATGCAACAGG AAGCGCTATA CAAATGCCAG 250 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 300 

GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGCCAATACA 350 

CAGAGAGGCA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 400 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

50 AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATTCA 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

55 CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 750 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 800 

AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 850 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 900 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

60 GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 1000 

380 
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CGGTTTACGT ACCTGCGGAT GATTTAACGG ATCCAGCACC GGCAACTACC UfSQ 

TTTGCTCACT TAGATGCAAC AACGGTATTG TCTCGTAAAA TTGCAGAGTT 1100 

AGGTATTTAC CCAGCGGTAG ATCCATTAGA TTCTACTTCT AGAATCTTAG 1150 

CTCCAGAAAT TTTAGGAAAA GATCACTACT CTTGTGCACA ACGTGTAAAA 1200 

5 GAGTTGTTAC AACGTTATAA AGAATTACAA GATATTATTG CTATCCTTGG 1250 

TATGGAAGAA TTATCTGAGG AAGATAAAAT GGCAGTTGGT AGAGCAAGAC 1300 

GTGTACAACG TTTCTTATCT CAGCCTTTCC ACGTAGCAGA GCAATTTACA 1350 

GGTCTTAAAG GTGTTTTAGT AGATATCAAG GATACTATTA AAGGATTTAA 14 00 

TATGATTATG GATGGTGAAT TAGATCACTT ACCAGAATCT GCATTTAACC 14 50 

10 TTAAAGGTAC TATTGAAGAA GCTATAGAAG CAGGAGAAAA AATGCTTGCT 1500 

GAAGCATAA 1509 



30 



50 



60 



15 2) INFORMATION FOR SEQ ID NO: 673< 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticii 

(B) STRAIN: HRC-IL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 



2) INFORMATION FOR SEQ ID NO: 674 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 840 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

381 



CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 100 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 150 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 200 

35 TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 250 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 300 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 350 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 400 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

40 GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 700 

45 AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 750 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 

AAGTGGTTCT ATAACATCT 819 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 

AGAGCGAAGT ACCAAGTGTT TACGATGCTC TGAATGTTGT GGATTCCAAA 50 

GAACGTCTGG TTCTGGAAGT TCAACAGCAG CTAGGCGGTG GCGTGATTCG 100 

CGCTATCGTT ATGGGTAGCT CGGATGGTTT ACGTCGTGGA ATGACAGTAC 150 

10 AAAACACTGG CGCTCCAATT TCAGTACCAG TAGGTACTAA AACCCTAGGT 200 

CGTATCATGA ACGTGCTTGG TGATGCGATT GACGAACGTG GCGACATTGG 250 

GGCAGAAGAG- GTGTACTGGA— T-TCAGGGTGG-TGGTGGAAGe— TAGGAAGAAC 3 00- 

AGTCTAGTGC AACTGAACTT TTGGAAACGG GTGTTAAGGT TATCGACCTG 350 

ATCTGTCCGT TTGCGAAAGG CGGTAAAATC GGTCTGTTCG GTGGTGCGGG 400 

15 TGTAGGTAAG ACCGTTAACA TGATGGAACT TATCAACAAC ATCGCGCTAC 450 

AGCACTCAGG TTTGTCAGTA TTTGCTGGGG TAGGTGAGCG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGCAGGAA GCGGGCGTTG TAAACGTTGA 550 

ACAACCAGAA CTGTCGAAAG TAGCGATGGT TTACGGTCAG ATGAACGAGC 600 

CACCAGGCAA CCGTCTGCGT GTAGCACTGA CTGGTCTGAC TATGGCGGAA 650 

20 AAGTTCCGTG ATGAAGGCCG TGACGTACTG CTGTTTATCG ACAACATCTA 700 

CCGTTACACC CTAGCGGGAA CGGAAGTATC TGCTCTGCTT GGCCGTATGC 750 

CTTCAGCGGT AGGTTACCAA CCAACACTGG CTGAAGAGAT GGGTGTTCTG 800 

CAAGAACGTA TCACGTCAAC CAAAAAAGGT TCTATCACCT 84 0 



25 



40 



2) INFORMATION FOR SEQ ID NO: 675 



(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 



CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

45 TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC CCAGGTGATG 200 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 250 

CAGTGGGAAG CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 300 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 350 

50 ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 400 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 450 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

55 ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 828 



60. 



382 
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2 ) INFORMATION FOR SEQ ID NO: 676 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 
15 (B) STRAIN: ATCC 50120 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 

CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

20 AACTCGGGCG CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

GCGCACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 300 

25 GAAGCTGGCG GACCAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AAGTGATCGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG 4 00 

TTCGGCGGTG CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

AGCGCACCCG CGAGGGGACG GATCTGTACC TGGAGATGAT GCAATCGAAG 550 

30 GTGATCGACC TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

GGGCCGCATC CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 800 

35 TCGGCATGTT GCAGGAGCGC ATCACGTCGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

GGCGACGACA TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

TGGCGGAGTC TGGCATCTAC CCCGCGGTCA ACCCACTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

40 GGACATTGTT CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 1250 

AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA 1298 

45 



2 ) INFORMATION FOR SEQ ID NO: 677 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia micro tti 
60 (B) STRAIN: Persing-1 

383 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 



CAAGCTCAAG 
5 GGAAATTTGA 
CATCGTGACT 
TGCTATGCTT 
CCAAAGAAGG 
GTCAAACAGA 

10 GGAGGATCGT 
AGGTGGGTTA 

CACGGTGACA- 

TAAGACCCTC 
TCGAAAAACC 

15 ATTGGTACGG 
AATGATCATT 
TTGAAATGCA 
GGATTTAATG 
GGCTTCGGAT 

20 CACAAGTCAT 
CCTGTGGTTG 
AGACACTAGG 
AGACTATTAA 
CCCATGGTTG 

25 CGTGCGTGAT 



TCTGAGCGTG 
GACCCAGAAA 
TTATCAAAAA 
GTCGTTCCTG 
TCAGACCCGT 
TGATTGTTGC 
TATATGGAAA 
CAAAGTTGAA 
ACATGGTTGA 
GTAGAGGCAC 
TCTTAGATTG 
TACCAGTCGG 
ACTTTTGCCC 
TCACGAGGTT 
TCAAGAATGT 
TCGAAAAATG 
TGTACTCAAT 
ATTGCCATAC 
ATTGACAAGC 
GAATGGTGAC 
TGGAAACTTT 
ATGCGCCAAA 



AGAGAGGTAT 
TACGAGTACA 
TATGATTACT 
CTGAATCTGG 
GAACACGCCT 
TATTAACAAA 
TTTTCAAGGA 
AGCGTGCCGT 
AAAATCTACT 
TTGATCAAAT 
CCCCTGCAGT 
AAGGGTCGAA 
CCACTGGTTT 
GTGGAAGTGG 
GTCTGTTAAG 
ACCCAGCCAA 
CACCCTGGTA 
TGCCCACATT 
GTACTGGCAA 
GCTGCCATGG 
CACCGACTAC 
CCGTTGCCGT 



TACTATTGAC 

CTGTCATAGA 

GGGACTTCAC 

CGGATTCGAG 

TACTAGCCTT 

ATGGATTCTT 

AGTACAGCAG 

TTGTTGCTAT 

"AACATGGCTT" 

GGAGCCTCCA 

CAGTCTATAA 

ACAGGACAAC 

GACCACTGAA 

CTAGCCCCGG 

GATATTAAGA 

GGAAGCTACC 

CCATCAAGGC 

GCTTGCAAAT 

GACTTTGGAA 

TGACTATGAA 

GCCCCGTTGG 

CGG 



ATTACTCTCT 50 

CGCACCTGGT 100 

AAGCCGACGT 150 

GCTGCTTTTT 200 

CACACTTGGC 250 

GTCAGTACAA 3 00 

TACTTGAAGA 350 

TTCAGGATTC 400 

GGTATAAGGG 450 

AAACGTCCGG 500 

AATTGGAGGT 550 

TGAAAGCAGG 600 

TGTAAATCTG 650 

TGATAACGTT 700 

GAGGAAATGT 750 

TCTTTCTCTG 8 00 

CGGTTACTCA 850 

TCGAATCTCT 900 

GAAAATCCTA 950 

ACCAAATAAA 1000 

GCCGGTTCGC 1050 
1083 



2) INFORMATION FOR SEQ ID NO: 678 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev- 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 

45 TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 200 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 250 

50 CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 300 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 350 

TTTTCCGATT CACCCAGGCC GGTTCCGAGG TGTCTGCCTT GCTCGGTCGT 400 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 450 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

55 TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 



60 2) INFORMATION FOR SEQ ID NO: 679 

384 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 
(B) STRAIN" -ATCC-4-4-1 04 



15 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679 



TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 200 

20 GGGCCCTGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGCCGAGT 300 

ACTTCCGTGA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

TTCCGATTCA CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 4 00 

CCCCTCTGCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

25 TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

ACTTACATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 

TC 552 



2) INFORMATION FOR SEQ ID NO: 680 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1018 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

50 AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT , 200 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 250 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTACCAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 400 

55 CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

60 GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

385 
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60 
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CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGAAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 800 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTACTACT ACCAAGAA 1018 



2) INFORMATION FOR SEQ ID NO: 681 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 

2) INFORMATION FOR SEQ ID. NO: 682 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
GTIACIGGYT CYTCRAARTT ICCICC 26 

40 

2) INFORMATION FOR SEQ ID NO: 683 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
55 GTIACIGGIT CISWIAWRTC ICCICC 26 



2) INFORMATION FOR SEQ ID NO: 684 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida tropicalis 

(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

15 ATGGCTGGTG CTTTAGAAAA CGCAAGAAAA< GAAATTAAAC GTCTTTCATT 50 

AGATGACACC AATGAATCCC AATATGGTCA AATCTATTCT GTTTCCGGTC 100 

CGGTTGTTAT TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 

AAAGTTGGTC ATGATAATTT AGTTGGGGAA GTTATTAGAA TTAATGGTGA 200 

TAAAGCAACC ATTCAAGTTT ATGAAGAAAC TGCAGGGGTC ACTGTTGGTG 250 

2 0 ATCCAGTTTT AAGAACTGGT AAACCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 

' AGATGAATCC CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 400 

TATCAAGAAC TGTTCAATAT GATTTCACTC CAGGTCAATT GAAAGTTGGT 450 

GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

25 ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGTACTATTA 550 

CTTCTATTGC TGAAGCCGGT TCTTATAATG TTGAAGAACC AGTTTTGGAA 600 

GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 

AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 

TGTTGACCGG TCAAAGAGTC TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 

30 GGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 

TCAATCTTTG TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 

GTTTCACTAA AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 

ATTGAATCTA TTGAAGTTGG TGACAAAGTC ATGGGTAAAG ATGGTATGCC 95 0 

AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

35 GTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

CAAGATTGCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 
TCTATGTTTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGCCAAG . 12 00 

ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

40 AGCTGCTACT TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 1300 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 1350 

ACCACTCAAA TGATCAACCC AGTTCATTTT GAATCTGGTA AACTCGGTAA 14 00 

CTGGTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 14 50 

ACTTGTTGGG TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 

45 ACCATGAACT CCAAAGATGA TGTTAAATTA GCTACAAGAA TTATGAACTA 1550 

CTCTTCAAAA TTGGGCATGA CTTGTTCTTC TACTGAATCC GGTGAACTCA 1600 

ATGTCGCTGA AAACGAAGAA GAATTTTTCA ATAACCTTGG TGCTGAAAAG 1650 

GATGAAGCTG GTGATTTCAC TTTTGATGAA TTTACCGATG CTATGGATGA 1700 

ATTGACTATC AATGTTCATG GTGCAGCTGC AAGCAAGAAG AACAATTTGT 1750 

50 TGTGGAATGC TTTGAAATCT CTTGGTTTCA GAGCCAAGTC TACTGATATT 1800 

GTCAAGAGTA TTCCTCAACA TATTGCTGTT GATGATATTG TTGTCAGAGA 1850 

ATCTTTGATT GCCGGTTTAG TTGATGCTGC TGGTAATGTT GAAACCAAAT 1900 

CCAATGGTTC TATTGAAGCT GTTGTTAGAA CTTCTTTCAG ACATGTCGCT 1950 

AGAGGTCTTG TCAAGATTGC TCATTCTTTG GGTATTGAAT CATCTATTAA 200 0 

55 TATTAAAGAT ACTCACATTG ATGCTGCTGG TGTTAGACAA GAATTTGCTT 2050 

GTATTGTCAA TTTGACTGGT GCTCCACTTG CTGGTGTTCT TTCTAAATGT 2100 

GCACTTGCAA GAAACCAAAC TCCAGTTGTC AAATTTACCA GAGACCCAGT 2150 

TTTGTTCAAC TTTGATTTGA TCAAATCTGC AAAAGAAAAC TATTATGGTA 22 00 

TTACTTTGGC TGAAGAAACT GATCATCAAT TCCTTTTATC CAACATGGCC 2250 

60 TTGGTGCACA ACTGTGGTGA ACGTGGTAAT GAGATGGCTG AAGTTTTGAT 2300 
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GGAATTCCCA GAATTGTTTA CTGAAATTTC TGGTAGAAAA GAACCAATTA 2350 

TGAAACGTAC CACTTTGGTT GCCAATACTT CTAATATGCC AGTCGCTGCC 2400 

AGAGAAGCTT CTATTTATAC TGGTATTACA TTGGCTGAAT ATTTCAGAGA 2450 

TCAAGGTAAG AATGTTTCTA TGATTGCTGA TTCTTCTTCA CGTTGGGCTG 2500 

5 AAGCTTTGAG AGAAATTTCT GGTAGATTGG GTGAAATGCC TGCTGATCAA 2550 

GGTTTCCCAG CTTATTTGGG TGCTAAATTG GCTTCTTTCT ATGAGCGTGC 2600 

CGGTAAAGCC ACTGCTTTGG GTTCACCAGA TAGAGTTGGT TCAGTTTCTA 2650 

TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG ATTTCTCTGA TCCAGTTACT 2700 

ACTTCTACTT TGGGTATTAC TCAAGTTTTC TGGGGGTTGG ATAAGAAATT 2750 

10 GGCCCAAAGA AAACATTTCC CATCTATTAA CACCAGTGTT TCTTATTCTA 2800 

AATACACCAA TGTTTTGAAC AAATACTATG ATTCCAACTA TCCAGAATTC 2850 

CCACAATTGA~~GAGACAAAAT _ TAGAGAAATT~TTATCTAATG~GTGAAGAATT 2900- 

GGAACAAGTT GTTCAATTAG TTGGTAAATC TGCATTGTCT GATTCTGATA 2950 

AGATTACTTT AGATGTTGCT ACCTTGATTA AAGAAGATTT CTTGCAACAA 3000 

15 AATGGTTATT CTTCATATGA TGCATTCTGT CCAATTTGGA AGACTTTTGA 3050 

TATGATGAGA GCATTTATTT CATATTATGA TGAAGCACAA AAAGCAATTG 3100 

CCAATGGTGC TCAATGGTCT AAATTAGCTG AAAGTACTAG TGATGTTAAA 3150 

CATGCTGTTT CTTCAGCTAA ATTCTTTGAA CCATCAAGAG GTCAAAAAGA 32 00 

AGGTGAAAAA GAATTTGGAG ATTTATTAAC CACTATCTCC GAAAGATTTG 3250 

20 CTGAAGCTTC AGAATAA 3267 



2) INFORMATION FOR SEQ ID NO: 685 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 



TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TTATGGCAGA 50 

AAATATGTCA GAAGCAAGTA TTCAAGACAT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAACCTG TTCGTTCCAC 2 00 

45 TGGGGAAGCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 250 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 300 

AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 350 

ATGGTGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGCTGGAG 400 

ACATCATTGG GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 4 50 

50 GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 500 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

AGCTGACGAT GATGCAAAAA TGGCCAGTAC GTCGTGGTCG ACCAATCAAA 600 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

55 TTGGTGCAGG GAAGACAGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 750 

GTAGATCTAG TGGTTTACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 800 

GGATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 850 

CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 900 

GCTGCTCGAG AAGCTTCTAT TTATACGGGA ATCACGATTG CCGAGTACTT 950 

60 CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 10 00 
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GGGCAGAAGC 
GATGAAGGTT 
ACGTTCAGGA 
TCACTGCCAT 
5 GTGACTCAAA 
TAGTCTTGCT 
ACTCATTATA 
CAGGATTGGT 
AGAACAATTA 

10 ATAACGATCG 
TTACAACAAA 

ACAATTCAAC" 

AAGCCTTATC 
GCGGTCAGAG 

15 AGCCAAAATT 
TTGTTTCAGA 



PCT/CA00/01150 



ACTGCGTGAA 
ATCCCGCTTA 
CGTGTCATTG 
CAGTGCGGTT 
ATACCTTACG 
CAAAAAAGAC 
TTCAACAGAA 
CTGATATGGT 
AATGAAATCG 
CTTGACCCTT 
ATGCTTTTGA 
ATGTTGAAAG 
GTTGGGAGCG 
AACGCATTAG 
AGTAGTATAA 
AGGAGGGATG 



ATGAGCGGAC 
TCTGGGCTCT 
CTCTAGGCTC 
TCTCCTTCTG 
TGTGGTGAAG 
ATTTTCCATC 
GTTGGCAGAT 
AACTGAAGGC 
TGCGCTTGGT 
GAAGTAGCAA 
TGATGTAGAT 
TTATTTTGAC 
TACTTCAATG 
TCGGAGCAAG 
ATGAAGAAAT' 
ACCGATGATT 



GTTTAGAAGA 

CGTCTAGCTG 

TGACCAACGT 

GTGGAGATAT 

GTTTTCTGGG 

GATTAACTGG 

ATATGGATCA 

ATGCGGATCT 

AGGGATCGAT 

AATCGATTCG 

ACGTTTACTT 
-TTTTGGGAAA- 

AAATCATGGA 
TATATTCCAG 
CAAAGAAACG 
AA 



AATGCCTGGT 
AATACTATGA 
GAGGGCAGTA 
CTCTGAACCA 
GATTAGATTC 
ATCCAAAGTT 
AATCTTACAA 
TGCAAGAAGA 
TCGCTTTCTG 
AGAAGACTAT 
CAAGAGAAAA 
GAAGCTCGAA 
AGGTACAGTA 
AAGAAGAGTT 
ATCCAATTGA 



105CT 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
- -1-6 00- 
1650 
1700 
1750 
1782 



20 2) INFORMATION FOR SEQ ID NO: 686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 

(C) ACCESSION NUMBER: Genome project 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686 

CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 

40 GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 200 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 300 

TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATTTATGGAA 350 

TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 400 

45 TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 450 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 500 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 550 

GTGCCTTTAC TATGGTGCAA AGATGGCCGA TCAAACAAGC TTTTATTGAA 600 

GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

50 AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC ACCATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 800 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATACCGGAA 850 

AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 900 

55 GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTGCAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 1000 

GATGGGCACA AGCCCTTAGA GAGATTTCGG GACGTCTTGA AGAAATCCCT 1050 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

60 CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 1200 

389 
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ACTCAATCTA CATTAGCTGT AGTCGGAGCG TTCTGTGGTC TTTCAAAAGC 1250 

ACGAGCTGAC GCACGTAGGT ATCCTTCAAT AGACCCTTTG ATTTCTTGGT 1300 

CAAAATATTT GAACCAGGTA GGACAAATTT TAGAAGAGAA GGTTTCAGGC 1350 

TGGGGTGGTG CTGTGAAAAA AGCAGCACAG TTTCTAGAGA AAGGTTCAGA 1400 

5 AATCGGCAAG CGTATGGAAG TTGTCAGTGA AGAAGGGGTT TCTATGGAAG 1450 

ACATGGAAAT CTACTTAAAG GCAGAACTTT ATGATTTTTG TkATCTCCAG 1500 

CAGAACGCAT TCGATCCTGT GGACTGTTAT TGTCCTTTTG AGAGACAGAT 1550 

AGAGTTATTT TCATTAATCA GTCGTATTTT TGATGCTAAA TTTGTTTTTG 1600 

ATAGTCCTGA TGATGCAAGA AGCTTTTTCC TTGAGCTGCA GAGCAAGATT 1650 

10 AAGACATTAA ATGGCCTGAA ATTTCTTTCA GAGGAATATC ATGAGAGTAA 1700 

AGAGGTCATA GTTAGACTGT TGGAAAAAAC AATGGTACAA ATGGCGTAAG 1750 

GATATGCAAA CAATCTACAC AAAAATAACT G 1-78-1- 



15 



2) INFORMATION FOR SEQ ID NO: 687 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1758 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 
(C) ACCESSION NUMBER: S56356 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 

ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

35 GACGTAACCA CCATCCAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 2 00 

CCAGCCCGTC GACAACACGG GCGAACCGCT CACCGTGGAC CTGGGCCCGG 250 

GGATGCTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 300 

GAAGACGAGA TGGGGGCGTT CCTCGATCGC GGTGTTGACG CACCCGGCAT 350 

CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 400 

40 AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 450 

GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 500 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG CACCAGGAGT GGCCGGTCCG CCGCCAGCGC 600 

CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 650 

45 CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 700 

CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 

TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 800 

CGAGATGACG GAAGTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

CCGGGAACCC GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 900 

50 ATGCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

GGAGTACTAC CGCGACATGG GCTACGACGT GGCGCTGATG GCCGACTCCA 1000 

CCTCGCGGTG GGCGGAGGCC ATGCGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

GTTCTACGAG CGCGCCGGCT ACTTCGAGAA CTTCAACGGG ACCGAGGGCT 1150 

55 CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 1200 

CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 1250 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 1300 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 1350 

GTCGACGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 1400 

60 GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 1450 

390 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CCGAGGACCA GCAGCTCACG CTGGAAGTCG 
TGGCTCCAGC AGAACGCGCT CCACGACGTG 
GAAGACGTAC GCCATCCTCT CCGGCATCAA 
TTGAGGCGTT GGACGCCGGT GTGCCAGTCG 
5 GCCGCGCCGC GCCTGAATCG TCTCGGCACG 
GGCGGAGGTC GCGGAGATCA AACAGCAGAT 
TCTACTGA 



PCT/CAOO/01150 

CGCGGTACAT CCGCGAGGCG L500 

GATCGCTACT GCCCGCCCGA 1550 

GACGCTTCAC GAGGAGTCCT 1600 

AGGAGATCAC GTCGATCGAC 1650 

ACGCCCGACG ACGAGCACGA 1700 

TACCGAGCAG CTTCGGGAGC 1750 

1758 



10 



2 ) INFORMATION FOR SEQ ID NO: 688 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3118 bases 

(B) TYPE: Nucleic acid . 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 



GAATTCCGGC 
TCATTCCAAA 
GTCCAAATTC 

3 0 AACGAATGGC 

GAACTGGTCG 
AGTATACGAA 
CAGGCAAGCC 
TTCGACGGTA 
35 TATCTACATC 
AGTGGGACTT 
GGTGACCTGT 
GCTGCTGCCG 
GAAACTACAC 

4 0 CGATCAAAGT 

TGTTACAGAA 
TGCTCGACTC 
GGGGCCTTCG 
CTCAAACTCC 

4 5 AAATGTCAGA 
GGTGTGACTG 
AAACATGCCT 
TGTCAGAATA 
TCAACTTCAC 

50 TGAAATGCCT 
CCAGTTTCTA 
AGGGAGGGCT 
CTTCTCAGAT 
GGGGTCTCGA 

55 TGGCTCATCT 
CAAGAATTTC 
TGCAGGAGGA 
TCATTGGCAG 
GGATGATTTC 

60 CATTCTACAA 



AGCTGACTAG TCTTGTGATT GGGGTCCTGG 
TGACGAGCAC ATTGATAAAG ACGTCCGATG 
GGCTTTGTTT TTGCCGTATC TGGACCTGTG 
CGGTTCTGCT ATGTACGAAC TGGTGCGTGT 
GAGAGATCAT CCGGTTGGAG GGTGACATGG 
GACACCTCAG GTGTGACAGT AGGCGATCCC 
GCTGTCCGTG GAACTGGGAC CCGGAATCAT 
TCCAGCGACC GCTGAAGGAT ATCAATGAAC 
CCGAAAGGTG TCAATGTGCC TGCCCTGAGT 
CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA 
ACGGTTTGGT CCACGAAAAT ACTCTGGTGA 
CCCCGTGCCA AGGGAACTGT CACGTACATT 
AGTTGATGAT GTTGTCCTGG AGACAGAATT 
TCACCATGCT GCAAGTGTGG CCTGTACGTC 
AAGTTGCCAG CTAACTACCC CCTCCTTACT 
CCTATTCCCG TGTGTCCAGG GTGGAACAAC 
GATGTGGCAA GACTGTAATA TCACAGTCTT 
GATGTAATTA TCTATGTAGG TTGTGGTGAG 
AGTACTCAGG GATTTCCCGC AGTTGTCGTT 
AATCAATCAT GAAGAGAACA GCCCTGGTCG 
GTGGCTGCTC GAGAAGCATC TATCTACACA 
CTTCAGGGAC ATGGGTTACA ATGTATCCAT 
GATGGGCCGA AGCTCTTCGA GAAATCTCAG 
GCCGACAGCG GTTATCCCGC CTACCTAGGT 
CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG 
CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC 
CCCGTGACGA CGGCCACACT AGGTATCGTC 
CAAGAAACTT GCCCAGCGAA AGCACTTCCC 
CGTACAGTAA ATACATGCGT GCTCTGGATG 
CCAGAGTTTG TCCCACTGCG TACAAAGGTG 
AGAAGACCTG TCTGAAATTG TGCAGTTGGT 
AAACTGACAA GATCACACTT GAGGTTGCCA 
CTGCAACAGA ACAGCTATTC ACCATATGAC 
GACAGTAGGA ATGCTGAAAA ATATGATTGC 

391 



GCTGATAAAA 


50 


AGGACCGGGA 


100 


GTGACAGCTG 


150 


CGGTTATTAT 


200 


CAACAATCCA 


250 


GTGCTGCGCA 


300 


GGGCAGCATC 


350 


TGTCAAATAG 


400 


CGCACTGCAC 


450 


CATTACTGGT 


500 


AACACAAGTT 


550 


GCAGAACCTG 


600 


TGACGGCGAG 


650 


AGCCCAGGCC 


700 


GGCCAGCGTG 


750 


AGCTATTCCT 


800 


TGTCAAAATA 


850 


CGAGGTAATG 


900 


GGAGATTGAT 


950 


CAAACACATC 


1000 


GGTATTACAC 


1050 


GATGGCTGAC 


1100 


GTCGATTGGC 


1150 


GCACGACTTG 


1200 


TAACCCAGAC 


1250 


CCGGTGGAGA 


1300 


CAGGTGTTCT 


1350 


ATCCATCAAC 


1400 


ACTTCTACGA 


1450 


AAGGAGATTT 


1500 


CGGTAAAGCT 


1550 


AACTATTAAA 


1600 


CGTTTCTGCC 


1650 


TTTCTACGAT 


1700 
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ATGTCTCGGC ATGCAGTTGA ATGTACTGCT CAGAGCGAGA ACAA^ATCAC 17S& 

TTGGAATGTT ATTAGAGATT CTATGGGCAA TATTCTGTAT CAGCTTTCCT 1800 

CCATGAAATT CAAGGATCCA GTCAAGGATG GAGAAGCGAA GATCAAGGCA 1850 

GACTTTGAGC AGCTTCATGA AGACATTCAG CAAGCCTTCA GGAACCTGGA 1900 

5 GGATTAAAGT GGTAGCTGCC AGTGGTTCTC TCGGTGCAGT TGTCACATTT 1950 

GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT CGGTGCTAGA CACCTGAGCA 2000 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT GGAATTTCAG TTGTAAAAAG 2 050 

CTGGTTCCAT TGGTGCTAAG ATTATGTTGT GCCCTTTTCT GCTTCTCACA 2100 

TTCCAACAGA GGAATTTACT TCCAGTTTTC TTCCATTTTC CTCCTCATTT 2150 

10 TAAGTGTCGG TACAGAGGCA ATAATCTGAT AACTCTGTAC CGTCACTTAC 2200 

AAGCAGGGAG AATTTGTAAT TATTACAAAT CCCATTATCT CTGTGCACCA 2250 

CAGCCTTGTA~AATTCATTTG~ TCCCAGGACT-CCCTCTTGTG— TGTACGTGAG 2-300- 

ATTGCCGTCT GTATGTATGT ACACACCGTA CTGCAGTATT TGAAGTCAGT 2350 

CAGAAGGTGA ATTACACCAC TTACTCATTG TGTCACGTAG CAAGTGTGCA 24 00 

15 AACTGCCATC CATTGTCCTA TTTATTCACA TAACTAGTTT TCTTTGCATT 2450 

TCCAGTGTTG CAAATTGTGT TTAGAAAATT ATGCCATCGA GACTGGTCGA 2500 

ACCTCACATT GTAACTCAGT ATTTACACAC ACGTTTACTT GCTACAGAAA 2550 

TGTAGAAAAA ATAATTGTTG TATATTGAAA GTACAAGTGA CAAAGTTGCA 2600 

TTTAAAATGG TGAATGTATT TTATATTTCT TTTGTAGACA CAAGAGTTAA 2650 

20 TGCATTTTGC TTAATGGAGA TGTATGTAAA CCTAAAATAG CAGTTTGTGC 2700 

ACAAATTATG TATATGTGAA ATGGAGATGG TTTCTAATTT GCTGATTGAT 2750 

TGCCAGTATT AATTTAAACA ACTGTAGTTG TGGGATGTAG TGGGAAGATT 2800 

TTTTTTTTCC TATAAAATTG GTGGATGTAT GTGTCGGAGA TTTTGATTGT 2 850 

ATGTGTAAAA TAGTGATCCC AGTAACTGTA AAGCTTTAGA ATACAGTTAC 2 900 

25. TGACTGTATA GTTGTACAGG TGTTGTTACT TTTAAGAATT TATTGACACA 2 950 

AAGGTGAAAG TCTATTATTG TATTGTAATG TTTAAAGCAT TTAAGGTTTA 3000 

AAAATCCTAC TTCTGTGTAT AAATGTTACC ATTCCTCATA TAACATAACT 3050 

GTGTAGAAAT ACAGTCAACT TCATGTTCAT TAGCATTTCA CTGTTGTCAC 3100 

ATAAATTATG CCCGGAAT 3118 

30 



2) INFORMATION FOR SEQ ID NO: 689 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 

(B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689 



ATGACAAAAG 
GGCTGGTTCA 
AGTTAGCTAA 
GAAGGGAATT 
TGTAGGAGAC 
GTCCTGGTAT 
AGAATAGCAA 
GACATCTTTA 
TAAAATTAAA 
GAAAATAAAT 
AGGGAGGCTT 



TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG 
TTAGTTATTG CTGAGAATAT GAGTGGAACT 
AGTAGGATGG AATAAATTGG TTGGAGAAAT 
ATGCATATAT ACAAGTTTAT GAAGATACTT 
CCTGTTATAA AAACAGGAAA TGCTTTATCA 
TTTAGATAAT ATTTATGATG GTATTCAAAG 
ATGTGTGTGG TGATGTATAT ATATATAAAG 
GATCATGATA AACAATGGCA ATTTTATGCT 
TGATATTGTT ACTGGTGGAG ATATCTTTGG 
TATTTAAAGA ACACAAAATT ATGGCTCCAC 
ACATATATTG CTCCAGATGG ATCATATACT 

392 



TTTATAAGGT 50 

CGTATGTACG 100 

TATTAGATTA 150 

CAGGTTTATC 200 

GTCGAATTGG 250 

ACCATTAGAA 300 

GTATTGATAT 350 

GATAAGAAAT 400 

ATTTGTTGAT 450 

CTAATGCTAA 500 

TTAAAAGATA 550 
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AAATATTTGA ATTAGAATAT CAAGGAAAAA AATATACATA TSGTTTATCT 600 
CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 
GGATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 
CAGTTCAAGG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 
5 ACTTGTGTTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 800 
ATATGTAGGA TGTGGTGAAA GAGGTAATGA AATGGCTGAA ATTTTATCCG 850 
ACTTTCCTGA ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 900 
CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

AGAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 1000 

10 TGGGTTATAA TGCTACCATG ATGGCTGATA GTACAAGTAG ATGGGCAGAA 1050 

GCCTTAAGAG AAATTTCAGG ACGTTTAGCT GAAATGCCTG CAGATAGTGG 1100 

TTATCCAGCT ~TATTTAGGTG~CTAGATTAGC~TTCCTTTTAT-GAAeGTGCAG- 1-1-50 - - 

GAAAAGTCAA ATGTATTGGT TCTCCATCTC GTATAGGATC CATTACAATT 1200 

GTGGGTGCTG TGTCTCCACC AGGTGGTGAT TTCTCTGACC CTGTAACTAC 1250 

15 AGCAACCATG TCTATTGTTC AAGCATTTTG* GGGGTTAGAT AAAAAACTAG 1300 

CTCAAAGAAA ACATTTCCCT TCTGTTAATT GGTCTACATC CTTTTCAAAG 1350 

TATGTCAGAC AATTAGAACA ATACTTTGAT AATTTTGATC AAGATTTCTT 14 00 

ATCTTTAAGA CAAAAAATTA GTGATATTTT ACAACAAGAA AGTGACTTGA 1450 

ATGATATTGT TCAACTAGTA GGAAAGGATT CATTATCAGA AGACCAAAAA 1500 

20 GTTGTTATGG AAGTAGCCAA AATTATTAGA GAAGATTTTC TTCAACAAAA 1550 

TGCATTTAGC GATTATGATT ATATGTGCCC ATTACAAAAA ACAGTTGGTA 1600 

TGATGAGAAT TATTTGCCAC TTTTATGCTC AATGCTTAAG AACATTACAA 1650 

GAATATGACT CAAGAGAAAG AAAAATTGGT TGGGGATCTA TATATAATAC 1700 

ATTAAGACCA ACTATAAATA AAATTACACA TATGAAATTT GAAAACCCAA 1750 

25 AAAATTCAGA TGAATATTTC AAAAAGTATT TTAAGGCACT TGAAGAAGAA 1800 

ATAACAGTAG GTTTAAGAAA CTTGATGGAA AAATGA 1836 



30 2) INFORMATION FOR SEQ ID NO: 690 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 

ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

50 AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 300 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCCAGCTT 400 

55 TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 450 

GATCATATTT CCGGTGGTGA TATTTACGGT TCCGTTTTTG AGAATTCGCT 500 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 

GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

60 TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 700 

393 
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TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT 7&0 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 800 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 

GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

5 ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

10 ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 1200 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG~AATTAGTAGA~ATCCTATAGA -AAGGeTTCAA 1-300 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 1350 

GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 1400 

15 TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 1450 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

20 TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 1800 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 1850 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

25 TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

30 TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 2200 

TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 2300 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 2350 

GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 2400 

35 CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 2450 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2500 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2600 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2650 

40 CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2700 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2800 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2850 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2900 

45 GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3000 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3050 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

ATGCCGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

50 GTCCATGGCG AATTCGAAAA ATTGTTGAGC ACTATGCAAG AAAGATTTGC 3200 

TGAATCTACC GATTAA 3216 



55 2) INFORMATION FOR SEQ ID NO: 691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

394 



SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 



ATGGCGGGAG GAATTGAACT GGCCAAGAAG GCTATCAGGA - GCCTCAAAAA: 
TTACGACGAG CATGAAAACC GATATGGATC TATTTTCAGC GTTTCTGGTC 
CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT GTTCGATGTA CGAACTCGTT 

15 CGCGTTGGTC ATGAAGAACT AGTTGGTGAA GTAATTCGTA TCCATCAAGA 
TAAATGTACT ATTCAAGTAT ACGAAGAGAC GTCCGGTCTC ACTGTTGGTG 
ATCCTGTCCA ACGCACTGGA AAGCCATTAT CTGTTGAATT AGGTCCTGGT 
TTAGCTGAGA CTATTTATGA TGGTATCCAA CGTCCGTTAA AGCAAATTTT 
CGACAAATCT CAAAGTATTT ATATTCCTAG AGGTATTAAT ACAGAATCAC 

20 TTAATCGTGA GCATAAGTGG GATTTCACAC CAAATAAGGA TTTACGCATT 
GGCGATCATG TATCCGGTGG TGATGTTTTC GGTTCTGTAT TTGAAAACTC 
TCTTTTCAAT GATCATAAAA TTATGTTACC CCCTAGAGCC CGTGGTACCG 
TCACATATAT TGCTGAAGCT GGATCATACC ATGTTGATGA AAAACTTCTT 
GAAGTCGAGT TTAATGGCAA GAAACATTCT TTTAGTATGT TGCATACTTG 

25 GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA CAACTTAACT GCTAATCAAC 
CTTTATTGAC TGGTCAACGT GTTTTGGATG CGTTATACCC CTGTGTTCAA 
GGTGGCACTA CTGCTATCCC CGGTGCCTTT GGTTGTGGTA AAACAGTTAT 
TTCACAATCT CTTTCTAAGT ACTCTAATTC TGATTTGATT GTTTACGTCG 
GTTGTGGTGA ACGTGGAAAC GAAATGGCAG AAGTGTTAAT GGATTTCCCA 

30 GAACTAACAA TTGATATTAA TGGTAAACCA GAGCCCATTA TGAAGCGTAC 
TACATTGGTA GCCAACACTT CTAACATGCC TGTCGCTGCT CGTGAAGCTT 
CCATTTATAC CGGTATTACA CTTGCTGAAT ATTATCGTGA TCAAGGTAAG 
AACGTTTCAA TGATGGCTGA TTCTACATCT CGTTGGGCTG AAGCTTTGCG 
TGAAATTTCT GGTCGTTTGG CTGAGATGCC TGCCGATTCT GGTTATCCCG 

35 CTTATTTGGG TGCCAAATTG GCTTCTTTTT ACGAACGTGC TGGTCGTGCT 
CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA ACAGTTTCAA TTGTTGGAGC 
TGTTTCTCCT CCGGGTGGTG ATTTTTCTGA TCCTGTTACT AGTGCAACCT 
TGGGAATTGT TCAAGTCTTC TGGGGTTTGG ACAAGAAATT GGCCCAACGT 
AAACACTTTC CCTCAATCAA CACCTCTCTT TCCTATTCTA AATACATCAA 

40 TGCTTTGCAA CCTTGGTATG AGGAAAGAGT TCCAGGCTTT AATACTCTTC 
GTGATCAAAT CAAACAGATC ATTCAACAAG AAGATTCCAT GTTGGAAATT 
ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT GAAACGGATA AAGTTACTTT 
GGACATAGCC GGTATTATTA AGAATGACTT CTTACAACAA AACGGTTATT 
CTGATTACGA TCGCTGTTGC CCTCTTTACA AGACTTATCA TATGATGCGA 

45 AACATGATTG CTTACTACAC AAAGGCTAAA AGTGCCGTTG AAACTGGTAG 
CGTTCCTTGG TCAAAGATTA AAGAAAGTAC TTCAGATATC TTTTATGAGT 
TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 
GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 
GACTGAGTAA 
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2) INFORMATION FOR SEQ 



ID NO: 692 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 
5 (B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 

10 ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

GAAGGCCGTC _TCCGGGCCAG J^GTCATTGC_TGAAAACATG GGCGGTAGCG 100 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT~TCCGGTTAGT~GGGCGAGATC 1-50 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG CCTCTTTCGC 250 

15 TTGAGCTTGG ACCTGGAATC ATGTCTGAAA< TATTTGACGG TATCCAGCGG 300 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 350 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGCCAT 400 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

20 GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 700 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

25 AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC 950 

TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

30 ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 
GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG . 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 1200 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 1250 

35 GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 00 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 1350 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 1400 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

4 0 AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 1700 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 1750 

4 5 AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 1800 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 1833 



50 2) INFORMATION FOR SEQ ID NO: 693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Thermus thermophilus 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 
GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 
GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

10 GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 200 
GGGCCTTCCC TTGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 250 

_AQ3AQ^ 300- 

TACATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 350 
GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 400 

15 TCCTGGGCAC GGTGCCCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 450 
CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 
CGTGGAGGAG CCGGTGGTGG TCCTCGAGGA CGGCACCGAG CTCAAGATGT 550 
ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 
CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

20 CGTGGCCATG GGGGGCACCG CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 700 
AGACCGTGAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 750 
GTCTACGTGG GCTGCGGGGA GCGGGGG AAC GAGATGACCG ACGTGCTCGT 800 
GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 
GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

25 GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 950 

CTTCTCCGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 1000 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 1100 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

30 TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 1200 

AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 1250 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 13 00 

CCCTTGACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 1350 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 14 00 

3 5 CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGCCGAG CGCCTCGTCA 1450 

TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGCCTACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 160 0 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

40 GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 1700 

GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 

CCTTCTGA 1758 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 694 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 21 
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PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 695 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi)~SEQUENCE _ DESCRTPTIONr-SEQ--rD-Ne:-695 

GTTTCACGTG ATGACGTACA 20 

15 

2 ) INFORMATION FOR SEQ ID NO: 696 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
30 ATIGGICAYR TIGAYCAYGG IAARAC 26 



35 



2) INFORMATION FOR SEQ ID NO: 697 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 

45 

CCIACIGTIC KICCRCCYTC RCG 23 



50 2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 

5 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC TCTGACCGCT GCAATCACCA 100 

CCGTACTGGC TAAAACCTAC GGCGGTGCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

10 CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA GACTGCCCGG 250 

GGCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

: GGCGCGATCC" TGGTAGTTGC _ TGCGACTGAC _ GGCCCGATGC _ CGCAGACTCG 3 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 450 

15 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 550 

AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 700 

20 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 750 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 950 

25 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC GGCCGTACCG 1150 

TTGGCGCGGG CGTTGTTGCT AAAGTTCTGG GCTAA 1185 

30 

2 ) INFORMATION FOR SEQ ID NO: 699 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
'(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
45 GTIACIGGYT CYTYRARRTT ICCICC 26 



2) INFORMATION FOR SEQ ID NO: 700 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 

60 
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TIRTIGAYGT CGARTTCCCT CARG 



2) INFORMATION FOR SEQ ID NO: 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii> MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 701 
GTGTTCACGA TCATCGATGC G 21 



2) INFORMATION FOR SEQ ID NO: 702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 
CTCTCGATAT CCGCGAAGCG 20 



2) INFORMATION FOR SEQ ID NO: 703 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
TATGGAAATT CGAAACATCT 20 



PCT/CA00/01150 

24 



2) INFORMATION FOR SEQ ID NO: 704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

400 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
5 AGTGCTCCAA TTAATGTTGG 20 



10 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 705 

(i) SEQUENCE CHARACTERISTICS: 
(A) — LENGTH:- 20 bases 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705 

20 

GTACAGTTCC AATACCTGAA 20 

25 2) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 

TGAAATCTTC ACATCCAACA 20 



2) INFORMATION FOR SEQ ID NO: 707 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 25 

2) INFORMATION FOR SEQ ID NO: 708 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC IATIAT 
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2) INFORMATION FOR SEQ ID NO: 709 

(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 

(C) ACCESSION NUMBER: extracted from AE001122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 



ATGAATGAAG 
AGTAATTCGT 
CAAAAGGGAT 
ACAGTTGAAC 
AAATCCTTTG 
GAGTATATTT 
ACCTCCAAAG 
AATTGAGGGA 
ATTCTTATAA 
GAGCAAATTG 
AATGTCTTTT 
GCCTTATTCC 
TTTTTCCCAG 
AGCAGGAAAA 
ATGTAGTGAT 
ACTCTTAAAG 
AATGGACAGG 
CTAGAGAAGC 
CAAATGGGCC 
TCAAGCAATG 
AGGCTTTTCC 
GCAGGTATTG 
TGGCTCTGTA 
CAACTTTAAA 
GATGCTAGGA 
TAAAGGCGTT 
TGAAAGGTAA 
ATAAGTAACG 
GTGCTATTTG 
CAGAGCGTCA 
AACTTTGAGT 
AAGGCAAAAT 



TTTTATTTGT 
ATTAGGGGCA 
ATCTGTTGGA 
TCGGACCAGG 
CCTGAATTGG 
AAGGCCCTTG 
TTGGAGATAT 
ACTGTTCACC 
AATTGTGGAG 
CTGTAATTGA 
CATTGGCCTG 
TAGTGAACCT 
TTGCCAAAGG 
ACGGTTCTTC 
TATTGCAGCT 
AATTTCCCGA 
ACTTGTATTA 
TTCTGTTTAT 
TTGATATTCT 
AGAGAAATGT 
GGCATATCTT 
TAGTTCTTAA 
AGTCCTGCTG 
AGTTGTAGGA 
AATTTCCAGC 
ATTGATCAAA 
TGAAATTAAT 
ATGATTTTTT 
CAGCAAAATT 
AAATTATATG 
TTTCTGATAA 
CTTTTAGACA 



AAAGACTGCT GGTAGGAATT 
ATGAAGTTGA TGCACAGGTT 
GACCTAGTTG AATTTACAGA 
GCTTTTAACT CAAGTATATG 
CTATTCAATG TGGATTTTTT 
AATAAAGATA AAAAGTGGAA 
CGTTATTGCA GGAGATTTTT 
ATCAAATAAT GATTCCATTT 
ATTGTAAGTG ATGGCGACTA 
AGATGATTCT GGTATGAGGC 
TTAAAGTTCC TATTACTAAT 
ATGTTGACTC AAACTAGAAT 
TGGAACTTTT TGCATTCCGG 
AGCAGGTTAC AAGTCGAAAT 
TGTGGTGAGC GAGCAGGAGA 
ATTAATGGAT CCAAAAACCG 
TTTGTAATAC ATCTTCAATG 
ACTGCTATTA CTATTGGTGA 
TCTTTTGGCA GATTCAACTT 
CTGGACGCCT TGAGGAAATT 
GAGTCTGTTA TTGCTTCCTT 
TAATGGGGAT ATTGGATCTG 
GTGGTAATTT TGAAGAGCCA 
GCATTTCACG GGCTTACAAG 
TATTAGTCCT CTTGAATCTT 
AAAAGACTGA ATATGCAAGA 
CAAATGATGA AAGTTGTTGG 
AATTTATTTA AAATCCGAGC 
CATTTGATTC TATTGATGCT 
TTTGATATAG TTTATAACAT 
ACTTCAAGCA AGAGATTTTA 
TGAATCTTTC TTCTTTTAAG 
402 



TAAAAGCAGA 
TTTGAATTGA 
CAAACTTTTA 
ATGGGCTTCA 
TTAGAAAGGG 
TTTTAAAAAA 
TAGGTTTTGT 
TATAAAAGGG 
TTCGATTGAT 
ATAATATTAC 
TATAAGGAAC 
TATAGATACA 
GTCCTTTTGG 
GCTGATGTTG 
AGTGGTAGAA 
GCAAATCTTT 
CCAGTTGCAG 
GTATTACAGG 
CAAGATGGGC 
CCTGGCGAGG 
TTATGAAAGG 
TAACAGTTGG 
GTTACTCAAG 
AGAAAGGTCT 
GGAGTAAATA 
TCTTTTTTGG 
AGAAGAAGGC 
TACTTGATTC 
GCTGTTAGTT 
TCTTAAAACT 
TAAATGAGTT 
GATCATAAGT 
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TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAG^TA 1650. 

ATTTAG 1656 

5 

2) INFORMATION FOR SEQ ID NO: 710 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1818 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

—(D) — TOPOLOGY-:— Line ar 

(ii) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE000520 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 

GTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 50 

CATTGTGTAT GCCGAGGGCC TCTCTGCGTG CAGCGTATAC GATGTTGTCG 100 

25 ACGTAGGGGA AGCATCGCTC ATCGGAGAAA TTATCCGGTT GGATGAGAGC 150 

AAGGCGGTCG TGCAAGTATA CGAGGATGAC ACAGGTATGC GAGTCGGGGA 2 00 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 250 

TAATCGGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 300 

CAAGAAGACG GCGCCTTCTT GCGTCCTGGT GCGCGTTCAC AACCGCTTGA 350 

30 TGGCTCCGTA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 4 00 

CCCTGTGCGC GGGGATTCCG ATTGCACCTG GGTCAGTGTT AGGGACCGTG 450 

CAGGAGACTC CTTCTGTTGT GCACACTATC ATGGTTCCTC CTGACATCCG 500 

GGGGAGCGTG CTATCTTCGT TCAAGGGCGC AGGTGCTTAC ACAATAGATG 550 

AAGAAATTGG ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 600 

35 TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 650 

GCCACTAGTT ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 700 

AGGGAGGAAC GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 750 

ACGCAGCATG CCGTTGCCAA GTGGTGTGAT GCAGATATTA TCGTGTACAT 800 

CGGCTGCGGA GAGCGGGGCA ACGAGATGAC AGACGTGCTC TCTGAATTTC 850 

40 CCAAACTCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGGACGATT 900 

TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

GTATTCAGGG ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 1000 

TGGCCATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 1050 

TTGTCTGGGC GCATGGAAGA AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

45 CCTTCCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

CCTGTGTGGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 1200 

CTGGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 1250 

CCGTTGCTTT TGGGCCTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 1300 

CTGCCATTGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 1350 

50 GCATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 1400 

CTTGGATTTG CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 1450 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 1500 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 1600 

55 ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

GCGCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 1700 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

GCACTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 1800 

GGGGAGAAAG TCGAATGA 1818 

60 

403 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



2) INFORMATION FOR SEQ ID NO: 711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
15 (B) STRAIN: MoPn 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 

CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

20 TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 100 

GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 200 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 250 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 300 

2 5 TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 350 

TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 400 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 550 

3 0 TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 

AGATACTCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 700 

CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 750 

GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

35 TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 

AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 

GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 950 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

4 0 GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 1200 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 1250 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 1300 

45 CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 1350 

TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 1400 

AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 1450 

ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 1500 

CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 1550 

50 AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 1600 

ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 1650 

AAAACGCTGA ATGGTCAAAA ATTCCTTTCT GAAGACTATC AGAAGGGGCT 1700 

AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 1750 

TATGCAAACA ATATATACAA GAATTACGG 1779 

55 



2) INFORMATION FOR SEQ ID NO: 712 

60 (i) SEQUENCE CHARACTERISTICS: 

404 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterococcus faecalis 
10 (B) STRAIN: V583 

(xi)— SEQUENCE-DESCRIPTION-:— SEQ— ID-NO-: — 742 



GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

15 AAACATGTCA GATGCTAGTA TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAACCAG TTATTTCAAC 200 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 250 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 300 

20 AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 350 

ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 400 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGCC GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

25 GTTTTAGCAT GATGCAAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 600 

GAAAAACTAA GTCCCAAAGT ACCGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 700 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 800 

30 AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 850 

CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 900 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATTACCATTG CAGAATATTT 950 

CCGTGATATG GGTTA 965 



35 



50 



2) INFORMATION FOR SEQ ID NO: 713 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barker! 

(C) ACCESSION NUMBER: extracted from J04 836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 



GTGGAAGTAA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 50 

CATCGGCTTG CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 

55 GTTTAATGGG TGAAGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 

GTATATGAAG AGACCGCAGG TATCAAGCCA GGGGAACCCT GTGTATCTAC 200 

AGGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 250 

ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGCTTGAAAA AATGGGTAGC 300 

TTCATCCAGA GAGGTGTCAG CGCAGATGGG CTTGATCATA AGAAACTCTG 350 

60 GGATTTCAAA CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAGACG 4 00 

405 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TAATTGGTGT TGTACAGGAA ACCGTGAATA TTGAACATAA GATC&TGGTG ~ ' ~4~5to 

CCTCCTGATA TCTCAGGTAC AATTTCCGAC ATAAAGAGCG GAAACTTTAC 56 0 

GGTAGTAGAC ACAATCTGTA CTCTGACTGA TGGGACCGAA TTGCAGATGA 550 

TGCAGAGGTG GCCTGTTCGA AGACCCAGAC CTGTGAAGGC AAAACTTACT 600 

5 CCAACCAGGC CTCTGGTTAC AGGAATGAGA ATCCTTGATG GGCTTTTCCC 650 

TGTGGCAAAA GGCGGAACAG CTGCAATCCC CGGACCTTTC GGATCGGGAA 700 

AGACCGTAAC TCAGCAGTCG CTTGCAAAAT GGAGTGATAC CGAAATTGTG 750 

GTCTACATCG GTTGTGGTGA GCGTGGAAAC GAAATGGCAG ATGTTCTGAG 800 

CGAATTCCCT GAACTCGAAG ATCCGCAGAC CGGGCGCCCA CTTATGGAGC 850 

10 GTACTGTTCT TATCGCTAAC ACTTCAAACA TGCCTGTGGC CGCAAGAGAA 900 

GCATCTGTGT ATACCGGAAT CACCATTGCA GAATACTACC GTGACATGGG 950 

ATTAGATGTA -TCGGTT-ATGG- GAGAGTCGAG— GTGAAGGTGG-GCAGAAGCCA 1000 

TGAGAGAAAT CTCTTCCCGT CTGGAAGAAA TGCCTGGTGA AGAAGGTTAC 1050 

CCAGCATACC TGTCTGCAAG ACTGGCCGAA TTCTACGAGC GTGCCGGGGT 1100 

15 TGCGGAGAGT CTTTGCGGCG AAACAGGTTC CATTACTGTT ATTGGAGCAG 1150 

TATCTCCACC TGGCGGTGAC TTCTCAGAGC CTGTTACACA GAATACCCTG 1200 

CGTATCGTAA AAGTGTTCTG GGCTCTCGAT GCCAAACTAT CTCAGAGGCG 1250 

TCACTTCCCG GCCATCAACT GGCTGAACAG TTACAGTCTG TATAAGGACA 1300 

GTCTTAATGA CTGGTTTGCA GATAATGTGG CTCCTGATTA TGTGCCTTTG 1350 

20 AGGGAAAGAG CAATGGAAAT GCTCCAGACA GAATCTGAAC TGCAGGAAAT 1400 

CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC AGACGACCAG CAGCTTCTGC 1450 

TTGAAATCAC CCGTATGCTT AGGGAAATTT TCCTGCAGCA GAATGCATTC 1500 

CACCCAGTAG ATGCATACAG CCCGTTCGAT CAGCAGTACA AGATCCTTAA 1550 

GGCAATCATG AAATGGGGAG ACGCTGCGAT GGATGCCTTG AAATCAGGTG 1600 

25 TTCCCGTAAC TGAAATTATC AAGCTTGAAT CCAAAAATGT GCTTGCTAAG 1650 

GTCAAGTACG AAGAGAAGTT TGATGAGTCT ATGAATGCTG TCCTGGCACA 1700 

GATGGATAAA GAGTTTGCAT CCCTGAGAGG TAGGTAA 1737 



30 



40 



2) INFORMATION FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1785 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 

(C) ACCESSION NUMBER: extracted from U67477 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 

ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

50 CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 200 

ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 250 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATTCAGAG GCCTTTAACA 300 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCCCAAGAG GAGTTGATGT 350 

CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 400 

55 AAGGAGATTA TGTTGAAGAA GGAGACATAA TTGGAACTGT TGATGAAACT 450 

CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

60 ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTCTTTACA TTAGCAAAAG 700 

406 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GAGGAACAGC 
CAGCATCAGT 
ATGTGGAGAA 
ACTTAGAAGA 
5 ATAGCCAACA 
TACAGGAATT 
TATTAACAGC 
TCAGGTAGAT 
AGCTTCAAGA 

10 TAGGGAAAGA 
CCACC AG GAG 
AGTTAAGGTA 
TCCCAGCTAT 
ACAGAGTGGT 

15 TGAAGCAATG 
AGTTAGTTGG 
GTTGCAAGAA 
GGTAGATACC 
TTATGACATT 

20 CCAGCTAAGA 
ATACATCCCA 
AGAAAATTAA 



PCT/CA00/01150 

AGCAATTCCA GGTCCATTCG GTTCAGGAAA AACC5GTTACT "750 

TGGCAAAGTG GTCTGACGCT GATGTCGTTG TTTATATCGG 800 

AGAGGAAACG AGATGACAGA GGTTATTGAA GAGTTCCCAC 850 

TATTAGAACT GGAAACAAAT TAATGGATAG AACTGTATTA 900 

CATCAAACAT GCCTGTCGCT GCAAGGGAAG CATCTGTCTA 950 

ACAATTGCAG AGTACTTCAG AGATATGGGT TATGGAGTTT 1000 

AGATTCAACA TCAAGATGGG CAGAGGCAAT GAGAGAAATT 1050 

TGGAAGAAAT GCCAGGGGAA GAAGGGTATC CAGCATACTT 1100 

TTGGCTCAGT TCTATGAAAG AGCTGGAAGA GTTATAACCT 1150 

TAACAGACAA GGATTCGTTT GTATCGTTGG AGCTGTTTCA 1200 

GGGACTTCTC AGAACCAGTT ACATCAAACA CACTAAGGAT 1250 

TTCTGGGCGT _ TAGATGCAAA — CTTGGCAAGA-AGAAGAGAGT 1-300- 

CAACTGGTTG CAGAGTTATT CATTATACAT TGATGATGTT 1350 

GGAACACAAA TACTGGTCCA GATTGGAGAC AATTAAGAGA 1400 

AGCTTATTAC AAAAAGAGGC AGAGTTGCAA GAGATTGTTC 1450 

GCCTGATGCA TTGCCAGATA GGGAGAGAGT TATTTTAGAA 1500 

TGTTGAGGGA GGATTTCTTA CAGCAAGATG CGTTTGATGA 1550 

TACTGTCCTC CAATGAAACA GTACTTAATG TTAAAGATAA 1600 

CTACCAAGAA GCATTGAAGG CAGTTGAAAG AGGAGTTGAA 1650 

TTTTAGGAGT TTCAGTTAAG CAAGATATTG CAAGAATGAA 1700 

CACGATGAGT TTATAAATGT TAAATCAAAA GAAATAATGG 1750 

GAATGAATTA GGTTCATTAA ACTAA 1785 



25 



2) INFORMATION FOR SEQ ID NO: 715 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1354 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: W83 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 

TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 100 

TATGCTTGAG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

45 GTCTGCAACA CGACTTGGAC AAGATGGACG GGATCTTCCT CAAACGAGGC 200 

GATTATACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 250 

GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 300 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 350 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

50 TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 450 

TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

55 ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 750 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

60 AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

407 



SUBSTITUTE SHEET (RULE 26) 



10 



20 



WO 01/23604 PCT/CAOO/01150 

GCTTTCCCGA TGGACTTGTC AGCTATCGTA GCCAACTTCT ACG^TCGTGC LrfOO 

AGGATACGTT TACCTGAACA ACGGTTCGGC CGGTTCGGTA ACGTTCATCG 1050 

GTACGGTATC TCCCGCCGGT GGTAACCTCA AAGAGCCTGT GACGGAAAAC 1100 

ACCAAGAAAG TGGCTCGCTG CTTCTATGCT TTGGAGCAGA ATCGTGCCGA 1150 

CCGCAAACGT TATCCGGCTG TAAACCCCAT CGATAGTTAC TCGAAGTACA 1200 

TCGAATATCC CGAATTCGAG AGCTATATAT CGAACCACAT CAGTTTACTC 1250 

ATTATATTCA ACAGAAGTTG GCAGATATAT GGATCAAATC TTACAGCAGA 1300 

CTGGACTACT AAGGTGAATG AGCTGAAGAT GCGCTTGCAT CAGGGTAAAG 1350 

AAAT 1354 



2) INFORMATION FOR SEQ ID NO: 716 

15 (i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
25 (B) STRAIN: Type 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 

TTTGACTCAA GGGAAGATTA TAAAAGTATC GGGACCTCTA GTTATTGCAT 50 

30 CAGGTATGCA GGAGGCTAAT ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 100 

GGGTTAATCG GTGAAATTAT TGAAATGAGA AGAGATCAGG CATCTATCCA 15 0 

AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT GTTGTTACAA 200 

CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 250 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA 30 0 

35 TGATTTTCTA GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA 350 

AGTGGCATTT TGATTCCACT ATAGCAATTG GTCAAAAAGT GAGTACGGGT 4 00 

GATATTCTTG GAACTGTCAA GGAAACCGAG GTAGTTAATC ATAAAATTAT 450 

GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTCTATTGCA TCTGGCGATT 500 

TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 550 

4 0 TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT 600 

TTCTAAACGT TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA 650 

TTGATGCATT CTTTCCAGTA ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA 700 

CCGTTTGGAG CAGGAAAGAC AGTTGTACAA CACCAAGTAG CTAAATTTGC 750 

CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT GGAAATGAAA 800 

45 TGACGGATGT ACTGAATGAG TTTCCTGAGT TGATTGACCC TAATACCGGA 85 0 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC 900 

TGTTGCTGCT CGTGAGGCTT CAATTTATAC AGGAATTACC ATGGCTGAGT 950 

ATTTTCGTGA TATGGGCTAC TCTGTCGCCA TTATGGCTGA TTCAACTTCA 1000 

CGTTGGGCAG AAGCGCTACG TGAAATGTCA GGACGTCTAG AAGAAATGCC 1050 

50 TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC GCTGAATATT 1100 

ATGAAAGAGC AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 1150 

ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA 1200 

ACCAGTTACT CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG 1250 

ATGCTCCGTT GGCACAGCGA CGTCATTTTC CTGCAATTAA CTGGCTTACA 1300 

55 TCTTATTCAC TATATAAAGA CAGTGTGGGC ACTTATATAG ATGGTAAAGA 1350 

GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC TACTTACAAC 1400 

GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 1450 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA 1500 

TTATTTGCAA CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG 1550 

60 CAAAACAAGA AGCAATGCTA AGTAATATTC TCACTTTTGC TGATCAGGCA 1600 

408 
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AATCATGCTT TAGAGTTGGG 
CGTGGCAGTT CGAGACCGTA 
GATTAGATGA AATCAAAATT 
TTGATATTAG AAACAGGAGG 



TTCTTACTTT ACAGAGATTA 
TGGCGAGAAG TAAATATGTT 
ATATCAAATG AGATTACACA 
TCTATAAATG AGTGTTAT 



PCT/CAOO/01150 

TGGAAGGTAC L650 
TCAGAAGATA 1700 
TCAAATTCAT 1750 
1788 



2) INFORMATION FOR SEQ ID NO: 717 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 
(-B-) — TYPE: Nucleic acid — 



15 



(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 
20 (B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

25 CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG CTGGTCGAAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 200 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 250 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 300 

3 0 TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 350 

GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 400 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 450 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

35 GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 650 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 700 

CCGTACGACG GACGTGACGG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 800 

40 ATCGCGATGG AAGAAGGTCT GCG 823 



2) INFORMATION FOR SEQ ID NO: 718 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
CO STRANDEDNESS: Double 

50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

60 GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 

409 
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GCACATCCTG CTGGCGCGTC AGGTCGGTGT GCCGTACATC ATSGTGTTCC /LOO 

TGAACAAGTG CGACATGGTG GACGACGCGG AGCTGCTCGA GCTGGTCGAA 150 

ATGGAAGTGC GCGAACTGCT GTCGAAGTAC GACTTCCCGG GCGACGACAC 200 

GCCGATCATC AAGGGTTCGG CGAAGCTGGC GCTGGAAGGC GACAAGGGCG 250 

5 AGCTGGGCGA AGTGGCGATC ATGAACCTGG CCGACGCGCT GGACACGTAC 300 

ATCCCGACGC CGGAGCGTGC GGTCGATGGC GCGTTCCTGA TGCCGGTGGA 350 

AGACGTGTTC TCGATCTCGG GCCGTGGTAC GGTGGTGACG GGTCGTGTCG 400 

AGCGCGGCGT GATCAAGGTT GGCGAGGAAA TCGAAATCGT CGGTATCAAG 450 

GCGACGGCGA AGACGACCTG CACGGGCGTG GAAATGTTCC GCAAGCTGCT 500 

10 GGATCAGGGT CAGGCGGGCG ACAACGTCGG TATCCTGCTG CGCGGCACGA 550 

AGCGTGAAGA CGTGGAGCGC GGCCAGGTTC TGGCGAAGCC GGGTTCGATC 600 



ACGCCGCAGA - CGCACTTCAC ~GGCTGAAGTG"TACGTGGTGA-GGAAGGACGA 650- 

AGGCGGCCGC CACACGCCGT TCTTCAACAA CTACCGTCCG CAGTTCTACT 700 
TCCGTACGAC GGACGTGACG GGCTCGATCG AGCTGCCGAA GGACAAGGAA 750 
15 ATGGTGATGC CGGGCGACAA CGTGTCGATC ACGGTGAAGC TGATCGCGCC 800 
GATCGCGATG GAAGAAGGTC TGCG 824 



20 2) INFORMATION FOR SEQ ID NO: 719 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 

TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

40 AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 200 

CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 250 

AAGCCAATCA TGGATTTAAT GGAAGCAGTA GATAGCTATA TCCCAACTCC 300 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 350 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

45 CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

CACAAGCTGG AGATAACATC GGAGCATTAT TAAGAGGAGT TCAAAGAACT 550 

GATATTGAAA GAGGTCAAGT TTTATCAAAA CCAAATTCAG TACACCCTCA 600 

CACTAAATTT GTAGGTCAAG TATACGTACT TAAAAAAGAA GAAGGTGGAA 650 

50 GACATACTCC ATTCTTTGAT GGATACAGAC CACAATTCTA TTTCAGAACA 700 

ACAGACGTTA CAGGRTCAAT CAAGTTACCA GATGGAATGG AAATGGTAAT 750 

GCCTGGAGAT CACATTGATA TGAATGTTGA ATTAATCACT CCAATCGCAA 800 



55 

2) INFORMATION FOR SEQ ID NO: 720 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
60 (B) TYPE: Nucleic acid 

410 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720 



30 



40 



GGTGCTATCC TGGTTGTTGC TGCATCTGATGGTCCTATG'C _ CTCA"GACTCG 50 

TGAGCACATC CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC ATCGTTGTAT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT CGACCTTGTT 150 

15 GAAATGGAAG TACGTGAGCT GTTAAGCGAG TACGGATTCG ACGGAGATAA 200 

CGCTCCGGTT ATCCGTGGTT CTGCACTGAA GGCTCTGGAA GGTGACGACA 250 

AATACGTTGG CGCTATCAAA GAACTGATGG ATGCAGTTGA TGAATTCATC 300 

CCAGATCCAA CTCGTGAAAC TGACAAACCA TTCCTGATGT CTGTAGAAGA 350 

CGTTATGACA ATCACAGGAC GTGGTACAGT TGCTACAGGA CGTGTTGAGC 400 

20 GTGGGGTAGT AAAACTGGGA GAAGAAGTTG AAATCGTTGG TATCAAGGAT 450 

ACTCAGAAAA CTGTTGTTAC CGGACTGGAA ATGTTCCGTA AGCAGCTGGA 500 

CTTCGCAGAA TCCGGAGACA ACATCGGTGC TCTGCTGCGT GGTATCAACC 550 

GTGACCAGAT TCAGCGTGGA CAGGTTCTTG CTAAACCAGG ATCCGTACAT 600 

CCACACACAA AGTTCAAGGC TCAGGTTTAT GTATTAACAA AAGAAGAAGG 650 

25 TGGACGTCAC ACTCCATTCG TTTCTAACTA CCGTCCTCAG TTCTACTTCC 70 0 

GTACAACTGA CGTAACTGGT GTTATTACAT TACCGGAAGG AACTGAAATG 750 

GTTATGCCTG GTGACAACGT TGAAATGAAC GTTGAGCTGA TTGCTCCAA 799 



2) INFORMATION FOR SEQ ID NO: 721 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 789 bases 

35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 

TTGTATCAGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATCCTA 50 

TTAGCATCAA GAGTAGGAGT TAACCACATA GTAGTATTCT TAAACAAAGC 100 

AGACCAAGTA GATGATCCAG AATTACTAGA ATTAGTAGAA ATGGAAGTAA 150 

50 GAGAATTATT AAGCGAATAC GGATTTGACG GAGACGAATG TCCAGTAGTA 200 

GTAGGATCAG CATTAAAAGC AATCGAAGAA GGGGATGACC AATGCATCCT 250 

AGACTTAATG AAAGCTGTAG ATGAATATAT CCCAACTCCA GAAAGAGCAA 300 

CAGATCAACC ATTCTTAATG CCTGTAGAAG ATGTATTTAC AATTACAGGA 350 

AGAGGAACAG TTGCAACAGG AAGAGTTGAA AGAGGAGTAC TACACGTAGG 400 

55 AGATGAAGTA CAAATCGTAG GAATGAAAGA AGAAATCGGA AAGACAACAA 450 

TCACAGGAGT AGAAATGTTC AGAAAGATGT TAGATGAAGC AATGGCTGGA 500 

GATAACATCG GAGCATTATT AAGAGGAGTA CAAAGAGACG AAATCGAAAG 550 

AGGTCAAGTA CTAGCAAAAC CAGGTTCAGT AACACCTCAC AAAAAATTCG 600 

TAGGTCAAGT TTACGTATTA AAGAAAGAAG AAGGTGGAAG ACACACTCCA 650 

60. TTCTTTAACG GATACAGACC ACAATTCTAC TTCAGAACAA CAGACGTAAC 700 

411 
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AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG CCAGGAGACC 75JD 
ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 789 



5 

2) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 798 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
CD) — TOPOLOGY:— Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 

GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 50 

GCATCAAGAG TTGGTGTTGA CTATATCGTA GTATTCTTAA ACAAGGCAGA 1.00 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

25 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATCAAG 200 

GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 250 

TCCAATCTTA GAATTAATGG AAGCAGTAGA TAGCTACATT CCAACACCAG 300 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 350 

ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 400 

30 TCATGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GAAAGCAGAA 450 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCA 500 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 550 

TATCGAAAGA GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGCCACACA 600 

GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 650 

35 CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 700 

AGACGTTACT GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 750 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA AGTAGCAA 798 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

60 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 200 

412 
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GGTTCAGCTT TACAAGCATT 
TCCAATCCTT GAGTTAATGG 
AAAGAGCAAC AGATAAGCCA 
ATCACTGGTA GAGGAACAGT 
TCACGTAGGA GACGAAGTAG 
AAGTAGTAGT AACAGGAATA 
CAAGCTGGAG ACAACGTAGG 
CATCGAAAGA GGTCAAGTTT 
AGAAATTTGT AGGTCAAGTA 
CATACTCCAT TCTTCGATGG 
AGAYGTTACT GGTTCAATCA 
eAGGAGAGGA- GATTGATA-TG 



AGAAAACCCA 
AAGCTGTAGA 
TTCTTAATGC 
TGCTACAGGA 
AAATCGTTGG 
GAAATGTTCA 
AGTTCTTTTA 
TAGCAAAAGT 
TACGTACTTA 
ATACAGACCA 
AGTTACCAGA 
-AACGT-TGAA-T- 



ACAGATGAAA ^AGCAATCGC 
TAGCTACATT CCAACTCCAG 
CAGTAGAAGA TGTATTCACA 
AGAGTTGAAA GAGGAGTTCT 
ATTATCAGAA GACAGCAGAA 
GAAAGTTACT AGACGAAGCG 
AGAGGAGTTC AAAGAACTGA 
TGGATCAGTT AAGCCACACA 
AAAAAGAAGA AGGTGGAAGA 
CAATTCTACT TCAGAACAAC 
TGGAATGGAA ATGGTTATGC 
-T-AATCACICA-AGIAGCTAT- . 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
-799. 
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35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 724 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 200 

GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 250 

ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 300 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 350 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 4 00 

TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 450 

AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 500 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAACCAC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 750 

TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 800 

A 801 



2) INFORMATION FOR SEQ ID NO: 725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725 



TATGCCTCAA ACTCGTGAAC ACATCTTGTT ATCTCGTAAC GTTGGTGTTC 50 

CTTACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGATGAAGAA 100 

10 TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATACGA 150 

CTTCCCAGGC GACGACACTC CAGTTATCGC TGGTTCAGCT TTGAAAGCTT 200 

TAGAAGGCGA -TGCTTCATAC- GAAGAAAAAA- TGTTAGAA-T-T— AATGGGTGGT 2 SO 

GTTGATGAAT ATATCCCAAC ACCAGTTCGT GATACTGACA AACCATTCAT 300 

GATGCCAGTC GAAGATGTAT TCTCAATCAC TGGTCGTGGA ACTGTTGCAA 350 

15 CTGGTCGTGT TGAACGTGGA CAAGTTCGCG' TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGCAAC TGCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGATT ACGCTGAAGC AGGCGATAAC ATTGGTGCAT 500 

TGTTACGTGG TGTTGCACGT GAAGACATCC AACGTGGACA AGTATTGGCT 550 

AAACCAGCTT CAATCACTCC ACATACAAAA TTCTCTGCAG AAGTTTACGT 600 

20 TTTAACTAAA GAAGAAGGCG GACGTCATAC TCC 633 



2) INFORMATION FOR SEQ ID NO: 726 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 

4 0 CACGTGAACA CATCTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 50 

GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 100 

AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 150 

ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AGAAGGCGAC 200 

GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 250 

45 CATCCCAACT CCAACTCGTG ACACTGACAA ACCATTCATG ATGCCAGTTG 300 

AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 350 

GAACGTGGAC AAGTTCGCGT TGGTGACGTT GTAGATATCG TTGGTATCGC 400 

TGACGAAACT GCTCAAACAA CTGTAACAGG TGTTGAAATG TTCCGTAAAT 450 

TATTAGACTA CGCTGAAGCA GGCGATAACA TCGGTGCTTT ATTACGTGGT 500 

50 GTTGCTCGTG AAGACATCCA ACGTGGACAA GTTTTAGCTA AACCAGCTTC 550 

AATCACTCCA CATACAAAAT TCTCTGCTGA AGTATACGTA TTAAGCAAAG 600 

AAGAAGGTGG ACGTCATACT CCA 623 



55 

2) INFORMATION FOR SEQ ID NO: 727 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 646 bases 
60 (B) TYPE: Nucleic acid 

414 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 



CGGTCCTATG~CCTCAAACTC~GTGAACACAT~CTTGTTATCA— CGTAACGTTG 50- 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

15 ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 300 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

20 GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 64 6 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 728 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma pirum 

(B) STRAIN: ATCC 25960D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 8 



GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

45 CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 200 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 250 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 300 

CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 350 

50 GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 4 00 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 500 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

55 CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 700 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

GCTTGTAAAA AGGAAGTAAG TTT 823 
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2) INFORMATION FOR SEQ ID NO: 729 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarium 
15 (B) STRAIN: ATCC 23064 • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729 

GGAGCAATCT TAGTTGTTGC TGCAACTGAT GGTGCGATGC CTCAAACTCG 50 

20 TGAACACGTT TTACTTGCAA AACAAGTTGG TGTTCCTAAA ATCGTTGTTT 100 

TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 150 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 200 

TGGTGACAAT ACTCCTATTA TTGCTGGTTC AGCATTAAAA GCATTACAAG 250 

GTGATCCAGA ATATGAAAAG AATATTCTAG AACTTATGGA TGCAGTAGAC 300 

25 ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 350 

TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 400 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 500 

AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 550 

30 GAATTGATAG AGATCAAGTA GAACGTGGAC AAGTTTTGGC CAAACCAAAA 600 

AGTATTATTC CTCACACAGA ATTTGAAGCT GCTGTGTATG TTCTAAAAGC 650 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 700 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 750 

CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

35 TGCTCCTATT GCAGTTGAAG AAGGAA 826 



40 



2) INFORMATION FOR SEQ ID NO: 730 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730 

55 TGGTATGTTC CGCAGCYGAY GGYCCTATGC CTCAAACTCG CGAACACATC 50 

CTGYTGGCTC GCCAAGTAGG YGTACCTTAC ATCATCGTRT TCATGAACAA 100 

ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT GAAATGGAAA 150 

TCCGYGACCT GYTGTCMAGC TACGACTTCC CMGGCGACGA CTGCCCAATC 200 

GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGYGAYGCTG GTTACGAAGA 250 

60 GAAAATCTTC GAAYTGGCTG CTGCTTTGGA CAGCTACATC CCAACTCCTG 300 

416 
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GTGAGAACGT 
AAGGTCTGCG 



PCT/CA00/01150 



GGACAAACCK TTCYTGYTGC CTATCGAAGA 
GYGGTACMGT AGTAACYGGY CGTGTAGAGC 
GACGAGATYG AAATCGTAGG TCTGAAAGAA 
CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 
ACGTAGGCGT ATTGYTGCGT GGTACCAARC 
CAAGTATTGG CYAAACCAGG YACCATYACT 
AGAAGTATAC GTACTGAGCA AAGAAGAGGG 
TCGCYAACTA CCGYCCMCAA TTCTACTTCC 
GCAGTTACTT TGGAAGAAGG TGTAGAAATG 
AACCATTACY GTAGAACTGA TTGCGCCTAT 



CGTRTTCTCT 
GCGGTRTCAT 
ACCCAAAAAA 
CGAAGGTCAA 
GTGAAGACGT 
CCSCACACCA 
TGGTCGYCAC 
GTACYACTGA 
GTAATGCCAG 
CGCTAGGAAG 



350 
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2) INFORMATION FOR SEQ ID NO: 731 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 10 0 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 200 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 250 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 350 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 500 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 700 
ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG . 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 

CGACGGTCTG CGT 813 



2) INFORMATION FOR SEQ ID NO: 732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella 
serotype Gallinarum 
5 (B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG 

10 CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA 
AATGCGACAT GGTTGATGAC GAAGAGCTGC 

GTTCGYGAAC"TGCTGTCTCA~GTACGACTTC 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA 
CGAAAATCAT CGAACTGGCT GGCTTCCTGG 

15 GAGCGTGCGA TTGACAAGCC GTTCCTGCTG' 
CATCTCCGGT CGTGGTACCG TTGTTACCGG 
TCAAAGTGGG CGAAGAAGTT GAAATCGTTG 
TCTACCTGTA CTGGCGTTGA AATGTTCCGC 
TGCCGGTGAG AACGTAGGTG TTCTGCTGCG 

20 TCGAACGTGG TCAGGTACTG GCTAAGCCGG 
AAGTTCGAAT CTGAAGTGTA CATTCTGTCC 
CACTCCGTTC TTCAAAGGCT ACCGTCCGCA 
ACGTGACTGG CACCATCGAA CTGCCGGAAG 
GGCGACAACA TCAAAATGGT TGTTACCCTG 

25 CGACGGTCTG CG 



choleraesuis subsp. choleraesuis 



ID NO: 732 



CCGCAGACCC 


GTGAGCACAT 


50 


CATCATCGTG 


TTCCTGAACA 


100 


TGGAACTGGT 


TGAAATGGAA 


150 


-eeGGGCGACG- 


-ACACTCCGAT 


200 


AGGCGACGCA 


GAGTGGGAAG 


250 


ATTCTTACAT 


CCCGGAACCA 


300 


CCGATCGAAG 


ACGTATTCTC 


350 


TCGTGTAGAA 


CGCGGTATCA 


400 


GTATCAAAGA 


GACTCAGAAG 


450 


AAACTGCTGG 


ACGAAGGCCG 


500 


TGGTATCAAA 


CGTGAAGAAA 


550 


GCACCATCAA 


GCCGCACACC 


600 


AAAGATGAAG 


GCGGCCGTCA 


650 


GTTCTACTTC 


CGTACTACTG 


700 


GCGTAGAGAT 


GGTAATGCCG 


750 


ATCCACCCGA 


TCGCAATGGA 


800 






812 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. 
serotype Paratyphi B 

(B) STRAIN: ATCC 8759 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 



choleraesuis 



60 



TGGTTGTTGC TGCGACTGAC 
CTGCTGGGTC GTCAGGTAGG 
ATGCGACATG GTTGATGACG 
TTCGYGAACT GCTGTCTCAG 
GTTCGTGGTT CTGCTCTGAA 
GAAAATCATC GAACTGGCTG 
AGCGTGCGAT TGACAAGCCG 
ATCTCTGGTC GTGGTACCGT 
CAAAGTGGGC GAAGAAGTTG 
CTACCTGTAC TGGCGTTGAA 
GCYGGTGAGA ACGTAGGTGT 
CGAACGTGGT CAGGTACTGG 
AGTTCGAATC TGAAGTGTAC 
ACTCCGTTCT TCAAAGGCTA 
CGTGACTGGC ACCATCGAAC 



GGYCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
41 



CGCAGACCCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGACGCAG 
TTCTTACATT 
CGATCGAAGA 
CGTGTAGAAC 
TATCAAAGAG 
AACTGCTGGA 
GGTATCAAAC 
CACCATCAAG 
AAGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
8 



TGAGCACATC 


50 


TCCTGAACAA 


100 


GAAATGGAAG 


150 


CACTCCGATC 


200 


AGTGGGAAGC 


250 


CCGGAACCAG 


300 


CGTATTCTCC 


350 


GCGGTATCAT 


400 


ACTCAGAAGT 


450 


CGAAGGCCGT 


500 


GTGAAGAAAT 


550 


CCGCACACCA 


600 


CGGCCGTCAT 


650 


GTACTACTGA 


700 


GTAATGCCGG 


750 
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GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 80tf 
GACGGTCTGC GTTT 814 



2) INFORMATION FOR SEQ ID NO: 734 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) — TOPOLOGY": -Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

25 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

30 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

35 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCA 828 

40 



2) INFORMATION FOR SEQ ID NO: 735 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 
55 (B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 
60 TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 
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TCATGAACAA 
GAAATGGAAG 
CCTGCCAGTT 
AGTGGGAAGC 
5 CCAGAACCAG 
CGTATTCTCC 
GCGGTATCGT 
ACCGTTAAGT 
CGAAGGCCGT 
10 GTGAAGACAT 
CCACACACCA 



PCT/CA00/01150 



TGGTCGTCAC" 
GTACAACTGA 
GTAATGCCAG 
15 CGCGATGGAC 



ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
TAAAATCATC 
AGCGTGCTAT 
ATCTCCGGYC 
TAAAGTTGGC 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AATTCGACTC 
ACTCCATTCT 
CGTGACCGGT 
GCGATAACGT 
GACGGTCTGC 



GTTGATGATG 

TCTGTCTGCT 

CAGCGCTGAA 

GAACTGGCTG 

CGACAAGCCG 

GTGGTACYGT 

GAAGAAGTTG 

TGGCGTTGAA 

ACGTAGGTGT 

CAGGTACTGG 

AGAAGTTTAC 

TCAAAGGCTA- 

ACCATCGAAC 

GAACATGGTT 

GTTTC 



AAGAGCTGCT 

TATGACTTCC 

AGCACTGGAA 

GCTACCTGGA 

TTCCTGCTGC 

AGTTACCGGT 

AAATCGTTGG 

ATGTTCCGCA 

TCTGCTGCGT 

CTAAACCAGG 

ATCCTGAGCA 

CCGTCCACAG- 

TGCCAGAAGG 

GTAACCCTGA 



GGAACTGGTA 

CTGGTGATGA 

GGCGAAGCTG 

TTCTTACATC 

CAATCGAAGA 

CGTGTAGAGC 

TATCAAAGAC 

AACTGCTGGA 

GGTATCAAGC 

TTCAATCAAG 

AAGAAGAAGG 

•TTGTAC-T-TGG- 

CGTAGAGATG 

TTCACCCAAT 



1*50 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
-700 
750 
800 
825 



20 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 736 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736 



35 TATTAGTTTG 
ATACTATTAT 
CAAATGTGAC 
AAGTAAGAGA 
ATAGTAAGAG 

40 GGGAGATAAG 
CTCCAGAGAG 
TTCTCAATCA 
AGTACTAAAA 
CAAGAAAAGT 

45 CAAGCACAAG 
AAACGAGATA 
CACACACAAA 
GGAAGACATA 
AACAACAGAC 

50 TAATGCCTGG 



TTCAGCAACA 
CAAGACAAGT 
ATGGTAGATG 
TTTATTAACA 
GTTCAGCATT 
ATAGTAGAAT 
AGATACAGAT 
CAGGAAGAGG 
GTACAAGACG 
AGTAGTAACA 
CAGGGGATAA 
GAAAGAGGAC 
GTTTACAGCA 
CACCATTCTT 
GTAACAGGAG 
AGATAACGTA 



GATGGACCAA 
TGGAGTACCA 
ATGAAGAGTT 
GAATATGATT 
AATGGCATTA 
TATTCGAGCA 
AAACCATTCT 
AACAGTTGCA 
AAGTAGAKTT 
GGAGTAGAGA 
TATAGGAGCA 
AAGTACTAGC 
GAAGTATATG 
TGATGGATAT 
CTTGTAAGTT 
ACAATGGAAG 



TGCCACAAAC 
TATATAGTAG 
ATTAGAGTTA 
TCCCAGGAGA 
GAAGATCCAA 
AATAGATGAG 
TAATGCCAGT 
ACAGGAAGAG 
AGTAGGATTA 
TGTTCAGAAA 
TTATTAAGAG 
AAAGACTGGA 
TACTTAAAAA 
AGACCACAAT 
ACCAGAAGGA 
TAGACTTAAT 



AAGAGAGCAT 
TATTCTTAAA 
GTAGAGATGG 
TGACACTCCA 
AGAGYGAGTG 
TATATACCAG 
AGAGGACGTA 
TGGAAAGAGG 
ACAGAAGCAC 
ATTATTAGAC 
GAGTACAAAG 
TCAGTAAAGG 
AGAAGARGGT 
TCTATTTCAG 
ATAGAGATGG 
AAACTCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
798 



2) INFORMATION FOR SEQ ID NO: 737 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

420 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737 

10 GTCAACATGA TGGAGCTCAT CAACAACATC GCGAAGGAGC ACGGCGGTTA 50 
CTCCGTGTTC GCGGGCGTGG GCGAGCGTAC CCGTGAAGGG AACGACTTCT 100 



ACCACGAAAT "G AAGGACTCG~AACGTTCTCG-AeAAGGTCGC-GC-TGGTGTAC 150 

GGCCAGATGA ACGAGCCGCC GGGCAACCGT CTGCGCGTGG CGCTGACGGG 200 

CCTCACGATG GCCGAGCACT TCCGTGACGA AGGCCTCGAC GTGCTGTTCT 250 

15 TCGTCGACAA CATCTACCGT TTCACGCTGG < CCGGTACCGA AGTGTCGGCG 300 

CTGCTCGGCC GTATGCCGTC GGCAGTGGGC TATCAGCCGA CGCTGGCTGA 350 

AGAAATGGGC AAGCTGCAAG AGCGCATCAC GTCGACGAAG AAGGGCTCGA 400 

TCACGTCGGT T 411 



20 



35 



2) INFORMATION FOR SEQ ID NO: 738 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifermentans 

(B) STRAIN: ATCC 638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738 



TACAAGAGCT TATTAACAAT ATAGCTACTC AACACGGTGG TATATCAGTA 50 

TTCGCAGGTG TTGGAGAGAG AACAAGAGAA GGTAACGACT TATTCCATGA 100 

4 0 GATGAGCGAT ACAGGAGTTA TAAATAAAAC AGCTCTAGTA TTCGGACAAA 150 

TGAATGAGCC ACCTGGAGCA AGAATGAGAG TTGCTTTAAC TGGTCTTACA 200 

ATGGCTGAAT ACTTCAGAGA TCAACAAGGG CAAGACGTTT TATTATTCGT 250 

AGATAATATA TTCCGTTTCA CTCAAGCAGG ATCTGAGGTT TCTGCACTTC 300 

TTGGACGTAC TCCATCAGCA GTTGGATACC AACCAACATT AGCAACAGAG 350 

45 ATGGGTAGAT TACAAGAGAG AATAACATCT ACAAATAAAG GGTC 394 



50 



2) INFORMATION FOR SEQ ID NO: 739 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 



TTAATAAACA ACATAGCTAA ACAACATGGT 
AGTTGGTGAA AGATCAAGAG AAGGTAATGA 
AGTCAGGAGT TATTGATAAG ACAGCATTAG 
CCACCGGGTG CCAGAATGAG AGTTGCATTA 
GTATTTTAGA GATAAAGGTC AAGATGTGTT 
T"CAGATATAC~TCAAGCAGGT~TCAGAGGTTT~ 
CCTTCAGCGG TTGGATATCA GCCAACACTT 
TCAGGAAAGA ATTACATCAA CAGTTAATGG 



GGTTTATCAG TATTTACTGG 50 

CTTATATCAT GAAATGAGAG 100 

TATTTGGACA AATGAATGAG 150 

ACAGGTCTTA CTATGGCAGA 200 

ACTATTCATA GATAACATAT 250 

CAGCATTACT-TGGAAGAACA 300 

GCAACTGAAA TGGGTGCACT 350 

TTCTATTACG TCAG 394 



2) INFORMATION FOR SEQ ID NO: 740 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 
30 (B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGCTAAG CAACATGGTG GTATTTCTGT ATTTTCAGGA 50 

35 GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG AAATGAGTGA 100 

GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA ATGAATGAAC 150 

CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC AATGGCAGAA 200 

CATTTTAGAG ATGAGCAAGG ACAAGACGTT TTACTTTTCG TTGATAATAT 250 

ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT CTAGGACGTA 300 

40 TGCCATCAGC TGTTGGTTAT CAGCCAACAT TAGCTACTGA AATGGGTGCA 350 

CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA CAT 393 



45 2) INFORMATION FOR SEQ ID NO: 741 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 

(B) STRAIN: ATCC 25582 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

422 
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TTGATTCAAG AATTCATTAA TAACATTGCT ACAGAACATG GTGGTTTATC 50 

AGTTTTTGCT GGAGTTGGTG AACGTAGCCG TGAAGGTAAT GATTTATATT 100 

ATGAAATGAA GGAAAGTGGT GTTTTATCTA AAACAACACT AGTATTTGGA 150 

CAGATGAATG AACCCCCAGG AGCTCGTTTA AGAGTTGCTT TAACGGGTCT 200 

TACTATGGCA GAAGAATTCC GTGATGAACA AGGTCAGGAT GTCTTATTAT 250 

TCATCGATAA TATTTTCCGT TTTACTCAAG CTGGATCTGA AGTATCTGCC 300 

TTACTTGGAC GGGTACCATC ACAAGCTGGG TATCAGCCAA CTTTAGCAAC 350 

CGAAATGGGT GCTTTACAAG AACGGATTAC ATCAACTAAA AAAGGATC 398 



2) INFORMATION FOR SEQ~ID NO": ~7"4'2 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 



TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

30 AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 200 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 250 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 300 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 350 

35 ACTTCAACTA AAAATGGATC AATAACTTCA 380 



2) INFORMATION FOR SEQ ID NO: 743 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 

55 TTAATAAATA ATATAGCAAA AGAGCATGGT GGTCTTTCTG TATTTACAGG 50 

AGTTGGAGAA AGGTCAAGAG AAGGTAACGA CTTATATTAT GAAATGAAAG 100 

AGTCAGGGGT TATAGATAAG ACAGCTTTAG TATTTGGACA AATGAATGAA 150 

TCACCAGGAG CAAGAATGAG AGTTTCATTA ACTGGATTAA CTATGGCTGA 200 

ATATTTTAGA GATCAAGGTC AAGACGTTCT TTTATTTATA GATAATATAT 250 

60 TTAGATTTAC TCAAGCGGGA TCAGAAGTTT CTGCGTTATT AGGAAGAATT 300 

423 
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CCTTCAGCAG TTGGATATCA ACCAACTCTT GCAACTGAAA TGGGAGCACT 3 St) 

TCAAGAAAGA ATAACATCAA CAAAGAATGG ATCAATCAC 389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
10 (B) TYPE: Nucleic acid 

CO STRANDEDNESS.:- Double- 

(D) TOPOLOGY: Linear 



15 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

(B) STRAIN: ATCC 15668 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

25 CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 200 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 250 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 300 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 350 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT 4 00 

30 GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

35 CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 750 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 800 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



2) INFORMATION FOR SEQ ID NO: 745 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscl eroma tis 
55 (B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 
60 TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

424 
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TCGCCATGGG TTCTTCTGAT GGTCTGCGCC GCGGTCTGGA TGT)*AAAGAC 450 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 200 

CATGAACGTA CTGGGTCAAC CGGTTGACAT GAAAGGCGAC ATCGGCGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CGTCCTATGA AGAGCTGTCC 300 

5 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT GTTCGGCGGT GCGGGTGTAG 4 00 

GTAAAACTGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

10 TGGTGTACGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT ,650 



ACTGCTGTTC~GTCGATAACA~TCTATCGTTA~CACCCTGGCC"GGTACTGAAG 700 

TATCCGCGCT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACC 750 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 
15 CGGTTCTATC ACCTCCGTA • 819 



2) INFORMATION FOR SEQ ID NO: 746 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746 

35 GCGATTCTAT TCCGCGCGTA TATGATGCTC TTAAACTAGT GGATAGAGAA 50 

CTGACGCTTG AAGTACAACA ACAGTTGGGT GATGGTGTCG TTCGTACTAT 100 

TGCGATGGGT AGTTCCGACG GCCTCAAACG AGGTTTGGCG GTAGTTAACA 150 

CCGGTGCTCC AATTACAGTG CCTGTGGGCA AAGCAACATT AGGCCGTATT 2 00 

ATGGACGTAT TAGGTAATCC GGTTGATGAA GCTGGTCCGA TTGGCTCCGA 250 

40 GCAAACCCGA GCAATCCACC AACCTGCTCC TAAGTTCGAC GAGCTTTCTA 3 00 

GCGCCACAGA GATTTTGGAA ACAGGTATTA AAGTAATTGA TTTGCTTTGC 350 

CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG TTTGGTGGTG CGGGAGTGGG 4 00 

CAAAACCGTA AATATGATGG AGTTGATTAA CAACATCGCG AAAGCACACA 450 

GCGGTTTGTC TGTATTTGCC GGTGTGGGTG AACGGACGCG TGAAGGTAAT 500 

45 GACTTTTATC ATGAGATGAA AGATTCCAAT GTATTAGATA AAGTAGCCAT 550 
GGTTTACGGT CAGATGAACG AGCCTCCCGG TAACCGTTTG CGCGTTGCGC ■ 600 

TAACTGGCTT GTCTATGGCC GAATTCTTCC GTGACGAGAA AGATGAAAAC 650 

GGTAAAGGCC GTGATGTATT GTTCTTTGTA GACAATATTT ACCGCTATAC 7 00 

CCTAGCCGGT ACAGAAGTAT CTGCATTGCT TGGCCGTATG CCTTCGGCAG 750 

50 TAGGTTATCA GCCGACGTTG GCAGAGGAAA TGGGCCGCTT GCAAGAGCGT 800 

ATTACYTCAM CCCAAACAGG CTCT 824 



55 2) INFORMATION FOR SEQ ID NO: 747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

425 
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WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747 
CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 



TCTGACTCTG GAGGTTCAAC _ AGCTTCTGGG~CGACGGCGTT _ GTCCGTA"CTA 100" 

TTGCAATGGG TAGTTCAGAC GGCCTTAAAC GCGGTATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 200 

15 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAAAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 300 

TCAGCTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAAGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

20 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAGCGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGC CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

25 CTTTGGCCGG TACTGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACCG GTTCCATTAC T 831 



30 



40 



2) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 862 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14688 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 

CCGTGGCCAA GTACCACAAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGTACC 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

50 CTCTGGTGCA CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 200 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 250 

CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 300 

AAACTCAACT GAGCTGCTAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 350 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 400 

55 GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 450 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 500 

ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

60 TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGCGTGA TGTTCTATTG 700 

426 
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TTCGTTGATA ACATCTATCG 
ACTTCTAGGT CGTATGCCAT 
AAGAAATGGG TGCGTTGCAA 
ATCACTTCGG GG 



TTACACACTG GCTGGTACGG 
CAGCAGTAGG TTATCAACCG 
GAGCGTATTA CCTCAACGCA 



PCT/CAOO/01150 

AAGTGTCAGC 750' 
ACTCTGGCTG 800 
ATCGGGTTCG 850 
862 
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2 ) INFORMATION FOR SEQ ID NO: 749 

10 (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
^g j TYPET Nucl~ei~c~aeid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 
20 (B) STRAIN: ATCC 25295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 

GGAACTCCCA CGTGACGCTA TCCCGCATGT TTTTGATGCA TTAAAATTAG 50 

25 TTGAAAATGA CCTAACCTTA GAAGTTCAAC AACTTTTGGG GGATGGTGTA 100 

GTGCGTACCA TTGCGATGGG TAGTTCAGAT GGATTAAAGC GTGGTATGGC 150 

TGTGAATAAT ACCGGAGCTC CGATTACTGT TCCTGTTGGC CGTGAAACTT 200 

TGGGTCGTAT CATGGATGTA TTGGGTAATC CGGTTGATGA GGCAGGTCCG 250 

GTAAATGCAT CCAATACACG TGCGATCCAT CAAGAGGCTC CTAAGTTTGA 300 

30 TGAGCTTTCT TCAACAACGG AATTATTAGA AACTGGCATT AAGGTTATCG 350 

ACTTGTTATG TCCGTTTGCC AAAGGTGGTA AAGTAGGTCT GTTTGGTGGT 400 

GCGGGTGTAG GTAAAACCGT AAATATGATG GAGTTAATTA ACAACATTGC 450 

CAAGGCACAT AGCGGTTTGT CTGTGTTTGC AGGCGTGGGT GAACGTACTC 500 

GTGAAGGTAA TGACTTCTAT CACGAGATGA AAGATTCCAA CGTATTGGAC 550 

35 AAAGTGGCAA TGGTTTACGG TCAGATGAAC GAACCTCCAG GCAACCGTCT 600 

GCGCGTTGCT TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA 650 

AAGACGAAAA CGGTAAAGGT CGCGACGTAT TGTTCTTCGT GGACAACATT 700 

TACCGTTACA CTTTGGCCGG TACGGAAGTA TCCGCATTGC TGGGTCGTAT 750 

GCCTTCAGCA GTAGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT 800 

40 TGCAAGAGCG TATTACCTCT ACCCAGACAG GCTCTATTAC TTCC 844 



2) INFORMATION FOR SEQ ID NO: 750 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 

60 CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

427 
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TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACCGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGCCGTAT 200 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

5 ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA TGAGTTGTCT 300 

TCAACTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

AGCGGCCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

10 CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT • 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGTCGTAT GCCTTCTGCA 750 

15 GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 



20 2) INFORMATION FOR SEQ ID NO: 751 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear , 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 31426 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 

35 

CGCGACATGA 
TCTGACTTTG 
TCGCTATGGG 
ACTGGTTCGC 
40 TGTCGATGTA 
ACAAGAGTCG 
TCCACAACCG 
TCCGTTTGCC 
GTAAAACCGT 
45 AGCGGCTTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTT 
CGGTAAAGGT 
50 CTCTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TTCCGCGCGT 
GAAGTCCAAC 
CAGCTCGGAC 
CCATTACTGT 
TTGGGAACTC 
TGCCATCCAC 
AATTGCTCGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCTC 
CACGAGATGA 
CCAAATGAAC 
TGACTATGGC 
CGTGACGTAT 
TACCGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGACGCT 
AGCTTTTGGG 
GGTTTGAAAC 
GCCGGTAGGT 
CTGTTGACGA 
CAAGCCGCTC 
AACGGGCATT 
AAGTAGGTCT 
GAATTGATCA 
AGGCGTGGGT 
AAGATTCCAA 
GAACCTCCAG 
TGAATACTTC 
TGTTCTTCGT 
TCCGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



TTGAAATTAG 
TGATGGCGTA 
GCGGCATGAC 
AAAGGTACGT 
GGCAGGTCCA 
CTAAGTTTGA 
AAAGTGATTG 
GTTCGGCGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
TTCC 



ACGAAAACGG 50 

GTCCGTACCA 100 

TGTGAGCAAT 150 

TGGGACGCAT 200 

ATTGATACCG 250 

CGAACTGTCT 300 

ACTTGCTGTG 350 

GCCGGTGTGG 400 

CAAAGCGCAC 450 

GCGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTTGCT 600 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 800 
834 



55 

2) INFORMATION FOR SEQ ID NO: 752 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
60 ' (B) TYPE: Nucleic acid 

428 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 27628 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 



ACATGATTCC 'GCGCCTTTAC -GACGCTTTGA— AATTAGAGGA— AAAGGGTG-TG 50- 

ACTTTGGAAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

15 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG, GTACGTTGGG ACGCATTGTC 200 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 300 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 350 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 400 

2 0 AACCGTGAAC ATGATGGAAT TGATCAACAA CATCGCCAAA GCGCACAGCG 450 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

TTCTACCACG AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 600 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

2 5 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 700 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 750 

GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 825 



30 



45 



2) INFORMATION FOR SEQ ID NO: 753 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

50 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 200 

CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 300 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 350 

55 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 4 00 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 450 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

60 ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 

429 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CAAAGGCCGC GACGTATTGT 
TGGCCGGTAC CGAAGTATCC 
GGTTACCAAC CGACATTGGC 
TACCTCTACC CAAACCGGTT 



TCTTCGTGGA CAACATCTAC 
GCACTGTTGG GCCGTATGCC 
TGAAGAAATG GGTCGTTTGC 
CCATTACTTC C 



CGTTACACCC 70 0 

TTCCGCAGTG 750 
AAGAGCGTAT 800 
831 



2) INFORMATION FOR SEQ ID NO: 754 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE : — Nucleic _ acrd _ 



15 



45 



(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria meningitidis 
20 (B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 

CCACGCGACA TGATTCCGCG CGTTTACGAC GCTTTGAAAT TAGACGAAAA 50 

25 CGGTCTGACT TTGGAAGTCC AACAGCTTTT GGGCGACGGC GTAGTCCGTA 100 

CCATTGCGAT GGGCAGCTCG GACGGTTTGA AACGCGGCAT GACTGTGAGC 150 

AATACCGGTG CGCCCATTAC TGTGCCGGTA GGTAAAGGTA CGTTGGGACG 200 

CATTGTCGAT GTATTGGGAA CGCCTGTTGA CGAGGCAGGT CCAATCGATA 250 

CCGACAAGAG CCGTGCCATC CACCAAGCCG CTCCTAAGTT TGACGAACTG 300 

30 TCTTCCACAA CCGAATTGCT CGAAACGGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGC GGTGCCGGTG 400 

TGGGTAAAAC CGTGAACATG ATGGAATTGA TCAACAACAT CGCCAAAGCG 450 

CACAGCGGCT TGTCCGTGTT CGCAGGCGTG GGTGAGCGTA CCCGCGAAGG 50 0 

TAACGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GATAAAGTGG 550 

35 CAATGGTTTA CGGTCAGATG AACGAACCTC CGGGCAACCG TTTGCGCGTC 600 

GCATTGACCG GTTTGACCAT GGCGGAATAC TTCCGTGACG AAAAAGACGA 650 

AAACGGCAAA GGCCGCGACG TATTGTTCTT CGTGGACAAC ATCTACCGTT 700 

ACACTCTAGC TGGTACCGAA GTATCCGCAT TGTTGGGCCG TATGCCGTCT 750 

GCAGTGGGCT ACCAACCGAC ATTGGCAGAA GAAATGGGTC GTTTGCAGGA 800 

4 0 GCGTATTACC TCTACCCAAA CCGGTTCTAT TACTTC 836 



2) INFORMATION FOR SEQ ID NO: 755 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 

60 CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 

430 
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CGGCCTGACT TTGGAAGTGC AACAGCTTCT GGGTGACGGC GTAGTTCGTA £00 

CTATTGCAAT GGGTAGTTCG GATGGTTTGA AACGCGGCAT GACTGTAAGC 150 

AATACAGGTG CGCCGATTAC AGTACCGGTA GGTAAAGGTA CTTTGGGACG 200 

TATTGTCGAT GTATTGGGTA CGCCTGTTGA CGAAGCAGGT CCGATTGATA 250 

5 CCGACAAACA CCGTGCCATC CATCAGACAG CTCCGAAATT TGATGAACTG 300 

TCTGCGACTA CTGAGCTGCT GGAAACCGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGT GGTGCCGGTG 400 

TAGGCAAAAC CGTCAACATG ATGGAATTAA TTAACAACAT CGCCAAAGCA 450 

CATAGCGGTT TGTCCGTGTT TGCAGGTGTG GGTGAGCGTA CCCGTGAAGG 500 

10 TAATGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GACAAAGTGG 550 

CGATGGTTTA CGGTCAGATG AACGAACCTC CGGGTAACCG TCTGCGTGTA 600 

GCCTTGAeCG-GTTTGAGGAT- -GGGCGAA-TAG- -TTGCGTGATG-AAAAAGACGA 6 5.0 

AAGCGGCAAA GGCCGTGACG TATTGTTCTT CGTGGACAAT ATCTACCGTT 700 

ACACCCTGGC CGGTACCGAA GTATCCGCAT TGTTGGGTCG TATGCCTTCA 750 

15 GCAGTAGGTT ACCAACCGAC ATTGGCTGAA. GAAATGGGTC GTTTGCAAGA 800 

GCGTATTACC TCTACCCAAA CAGGCTCCAT TACCTCC 837 



20 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 200 

4 0 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 3 00 

TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

4 5 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACTG GTTCCATTAC TTCC 834 



55 

2) INFORMATION FOR SEQ ID NO: 757 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
60 (B) TYPE: Nucleic acid 

431 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaver i 

(B) STRAIN: ATCC 51223 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 



CCCGTGATGC~TATTCCCATG "TATACGATGG-CCTGAAATTG- GTAGA-T-AAGG SO 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGTACC 100 

ATTGCAATGG GTAGTTCAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 150 

15 TACCGGCGCT CCGATTACTG TTCCGGTGGG GAAAGCCACC TTGGGACGTA 200 

TTATGGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAACTGTC 300 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 350 

GCCCGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 400 

20 GGTAAAACCG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGGCACA 450 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

25 ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT CTATCGCTAT 700 

ACTCTGGCCG GTACTGAAGT GTCTGCACTG TTAGGTCGTA TGCCGTCTGC 750 

AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 800 

GTATTACTTC TACTCAAACA GGTTCGATTA CTT 833 



30 
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2) INFORMATION FOR SEQ ID NO: 758 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 



CGCGATGCCA TTCCTCACGT TTACGATGCC CTGAAATTGG ACGACACCGG 50 

TCTGACTTTG GAAGTACAAC AACTTCTGGG CGACGGTGTG GTACGTACCA 100 

50 TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GGGGTTTGTC TGTGAGCAAT 150 

ACCGGTTCTC CGATTGCCGT TCCTGTCGGT AAAGCGACTT TGGGTCGTAT 200 

TATGGACGTA TTGGGCAATC CGGTTGATGA AGCCGGTCCG GTTGCTACCG 250 

AAGAGAAACG TGCTATTCAC CAAGCCGCAC CGAAGTTTGA TGAATTGTCA 300 

TCAGCTACCG AGTTGTTGGA AACCGGTATT AAAGTAATCG ACTTGCTGTG 350 

55 TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 400 

GCAAAACCGT AAACATGATG GAATTGATCA ACAACATCGC CAAAGCACAC 450 

AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT GAACGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTGTTGGAT AAAGTAGCCA 550 

TGGTGTACGG TCAGATGAAT GAGCCGCCGG GCAACCGCTT GCGCGTGGCT 600 

60 TTGACCGGCC TGACTATGGC CGAATACTTC GTGACGAAAA AGACGAAAAC 650 

432 
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GGCAAAGGTC GTGACGTATT GTTCTTCGTG 
ACTGGCCGGT ACCGAAGTAT CAGCATTGTT 
TAGGTTATCA GCCGACATTG GCAGAGGAAA 
ATTACCTCTA CCCAAACCGG TTCGATTACC 



GACAACATTT ACCGCTACAC 
GGGCCGTATG CCGTCTGCAG 
TGGGTCGCTT GCAAGAGCGT 
TCT 



700/ 
750 
800 
833 
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2) INFORMATION FOR SEQ ID NO: 759 

(i) SEQUENCE CHARACTERISTICS: 
(A) LE NGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 

(B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 

AGCGTCCCTA AAGTATACGA CGCTCTTGAG GTTATGAATG GTAAAGAAAA 50 

ACTGGTGCTG GAAGTTCAGC AACAGTTAGG CGGTGGTATC GTTCGTTGTA 100 

TCGCAATGGG TACATCAGAC GGTTTAAGCC GTGGCTTAAA AGTTGAAGAT 150 

TTAGGCCACC CAATTGAAGT ACCAGTAGGT AAAGCAACAC TGGGACGTAT 200 

CATGAACGTT CTGGGTACAC CTATTGATAT GAAAGGTGAT ATTGCAACTG 250 

AAGAACGTTG GTCTATTCAC CGTGAAGCAC CAACCTACGA AGAGTTATCA 300 

AACTCACAAG AACTGCTTGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCGTTTGCT AAAGGTGGTA AAGTAGGTCT GTTCGGTGGT GCGGGTGTGG 400 

GTAAAACAGT TAACATGATG GAATTGATCC GTAATATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC TGGTGTTGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAT CATGAAATGA CAGATTCTAA CGTTCTTGAC AAAGTATCGT 550 

TAGTTTATGG TCAGATGAAT GAGCCACCAG GAAACCGTCT GCGTGTAGCA 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGATGAAG GCCGTGACGT 650 

ACTGTTATTC GTCGATAACA TTTATCGTTA CACCTTAGCC GGTACAGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCAACA 750 

TTGGCTGAAG AGATGGGTGT TCTGCAAGAA CGTATCACTT CAACCAAAAC 800 

AGGTTCAATC ACCTCTGTA 819 



2) INFORMATION FOR SEQ ID NO: 760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

GCCGTACCAC GCGTGTACGA TGGCCTTGAG GTGCAGAATG GTAATGAGAA' ^0 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

5 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

10 TCCGGTTACT CAGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTC TAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG^CAGATGAAC - GAGCCGCCGG~GAAACCGTCT _ GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

15 TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC ACCTCCGTA 819 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 

ACCAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT CCGATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

4 0 ACACCCGATC GAAGTACCGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 250 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 300 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 350 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

45 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 600 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

50 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 700 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750. 

TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 800 

CAATCACCTC TG 812 



2) INFORMATION FOR SEQ ID NO: 762 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 08 bases 

434 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 \ (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 



10 



CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

15 CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 200 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 250 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 300 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 350 

20 ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 400 

ATGTTGAC 408 



25 2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellii 

(B) STRAIN: ATCC 9714 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 



GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 100 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 200 

45 TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 250 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 300 

CGTACTCCAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 350 

TAGATTACAA GAGAGAATAA CATCTACAAA TAAAGGGTCT ATAACATCAG 400 



2) INFORMATION FOR SEQ ID NO: 764 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 405 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

435 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 194 02 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764 

TTAATTCAAG AATTAATCAA CAATATAGCG AAGGAACACG GTGGATTATC 50 

TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 100 

10 ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 150 

CAAATGAATG AGCCACCTGG AGCAAGAATG AGAGTTGCTC TTACAGGACT 200 

TACTATGGCA - GAATATTTCA - GGGATCAAGG-AGAAAAGGTA— C-T-T-T-TAT-TGA 250- 

TAGACAACAT ATTTAGATTC ACTCAAGCAG GTTCAGAGGT GTCAGCTTTA 300 

CTTGGAAGAA TACCTAGTGC CGTTGGATAC CAACCAACAC TAGCAACAGA 350 

15 AATGGGTGCG CTTCAAGAAA GAATAACATC TACAAAGCAT GGTTCTATAA 400 

CATCA 405 



20 2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 

AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC 
GGAGTTGGAG AAAGATCTAG AGAAGGTAAT GATCTTTATC 
AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT 
AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT 
GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA 
ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA 
TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA 
CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA 



AGTGTTTACT 50 

ATGAAATGAG 100 

CAAATGAATG 150 

AACTATGGCT 200 

TAGATAATAT 250 

CTTGGAAGAA 300 

AATGGGTGCA 350 

CAT 393 



45 



2) INFORMATION FOR SEQ ID NO: 766 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 399 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 



436 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ 



PCT/CA00/01150 



ID NO: 766 



AGGAACTTAT AAATAATATT 
ACAGGTGTTG GTGAAAGAAC 
GAAAGATTCT GGGGTTATAG 
ATGAACCACC AGGAGCGAGA 
GCAGAATATT TTAGAGATCA 
TATATTTAGA TTTACGCAGG 
GAATTCCAAG TGCAGTTGGA 
GCATTACAAG AAAGAATAAC 



GCAAAACAAT 
AAGAGAAGGT 
AGAAAACTGC 
ATGAGAGTAG 
AGGGCAAGAT 
CTGGTTCTGA 
TATCAACCAA 
ATCCACAAAA 



ATGGAGGTCT ATCTGTATTT 
AATGACCTAT ATAATGAAAT 
ACTAGTATTT GGACAGATGA 
CATTGACAGG ACTTACTATG 
GTACTTTTAT TTATAGATAA 
AGTTTCTGCA TTGTTAGGAA 
CCCTTGCAAC TGAAATGGGT 
AATGGATCAA TTACTTCAG 



50 
100 
150 
200 
250 
300 
350 
399 
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2) INFORMATION FOR SEQ ID NO: 767 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 

CATGATGGAA CTGATCCGTA ACATCGCCAT CGAGCACAGC GGTTATTCCG 50 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG ACTGGCCTGA 200 

CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 250 

GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 300 

GGGCCGTATG CCTTCGGCAG TAGGTTACCA GCCGACCCTG GCTGAAGAGA 350 

TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 390 



2) INFORMATION FOR SEQ ID NO: 768 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 200 



437 
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GACTGGTATC AAGGTTGTCG ATTTGCTTGC TCCCTATGCT CGTGGtGGTA 2S# 

AGATTGGTCT TTTCGGTGGT GCCGGTGTCG GCAAGACTGT CTTGATTCAA 300 

GAATTGATCA ACAACATTGC CAAGGCTCAC GGTGGTTACT CTATCTTCTG 350 

TGGTGTCGGT GAACGTACTC GTGAAGGCAA CGATTTGTAC CACGAAATGA 400 

5 TTTCCACTGG TGTCATCAAG CTTGAAGGTG AATCCAAGTG TGCTCTTGTC 450 

TTTGGTCAAA TGAACGAACC CCCCGGAGCT CGTGCCCGTG TTGCCTTGAC 500 

TGGTTTGACC ATTGCCGAAT ACTTCCGTGA TGAGGAAGGT CAAGATGTGT 550 

TGCTCTTCAT TGACAACATT TTCCGTTTCA CTCAAGCCGG TTCTGAAGTG 600 

TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT GTCGGTTACC AACCCACTCT 650 

10 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG TATTACTACC ACCAAGAACG 700 

GTTCCATTAC CTCCGTGCAA GCTGTCTACG TCCCTGCTGA CGATTTGACT 750 



GATCCTGCTC CTGCTACTAC "TTTTGCTCAC _ TTGGACGCCA~~CCACTGTGTT 800 

GTCTCGTTCC- ATTGCTGAGT TGGGTATCTA CCCTGCTGTC GATCCCCTTG 850 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA TCGTTGGTGA TGAGCACTAC 900 
15 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT CAAAACTACA AGTCGCTTCA 950 
AGA 953 



20 2) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 50 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 100 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTACCGGTAT 200 

40 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 350 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 4 00 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 450 

45 CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA 500 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 650 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 700 

50 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 800 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

55 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 1000 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

60 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 

438 
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TCCGATCATT CAAGGCTATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 1300 
GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 1343 



2) INFORMATION FOR SEQ ID NO: 770 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 480 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 



15 



(D) TOPOLOGY sTiheaY 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770 

TTCAGGAGTT GATTGTATGT TCACCTGCAA CATAAGACTT CCCATTCTCC 50 

ACTCTTTTCT AACTCTTCAC AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGATCTG 150 

25 TACCACGAAA TGCAGGAGAC TGGTGTCATT CAGCTCGAGG GTGAATCTAA 200 

GGTCGCCCTT GTCTTCGGTC AGATGAACGA GCCCCCAGGT GCCCGTGCCC 250 

GTGTCGCCCT TACCGGTCTG ACCATCGCCG AGTACTTCCG TGACGAGGAA 300 

GGTCAGGATG TGCTGCTCTT CATTGACAAC ATTTTCCGTT TCACCCAGGC 350 

CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG TATCCCCTCC GCTGTCGGTT 400 

30 ACCAGCCCAC TCTGGCCGTC GACATGGGTG GTATGCAGGA GCGTATTACC 450 

ACCACCACCA AGGGTTCCAT TACCTCCGTC 480 



35 2) INFORMATION FOR SEQ ID NO: 771 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 3 8592 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 

CTCGAACAAG AYAACTTGCC YGCCATTT' 
YCACTCTGGT GGACGTCTCG TYCTCGAA< 



ACACTGTCCG TACTATTGCT ATGGATGGTA 
TCCCANNGGA TACAGTCARA CAGMAATGTC 

55 GATGATTGAC CAATATGTTA GGTCTTGTCC 
CACTGGTGCT CCCATTACCA TCCCCGTTGG 
TCATCAACGT TATTGGTGAA CCCATTGATG 
AAGACTCACC GTCCTATTCA CGCTGAAGCT 
CCCCACTCCC GAAATCCTCG AGACTGGTAT 

60 CTCCTTATGC TCGTGGTGGT AAGATTGGTC 



ID NO: 771 






AACGCCCTTG 


AAGTCAAGGA 


50 


CTCTCAACAT 


TTGGGTGAGA 


100 


CTGAAGGTAA 


GTTATGTYCA 


150 


TAGTGGTTAT 


AGCAGYAGCA 


200 


GGTGGTCAAA 


AGGTTGTTGA 


250 


TAAGGAAGTC 


CTTGGTCGTA 


300 


AACGTGGTCC 


CATTGACGCC 


350 


CCCGAATTCG 


TTGATCAATC 


400 


CAAGGTYGTC 


GATTTGTTGG 


450 


TCTTCGGTGG 


TGCTGGTGTC 


500 
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GGTAAGACTG TCTTGATTCA AGAATTGATT AACAACATYG CCAAGGCTCA 550 

CGGTGGTTAC TCTATCTTCT GTGGTGTCGG TGAACGTACT CGTGAGGGTA 600 

ACGATTTGTA CCATGAAATG ATTGAAACCG GTGTCATTCA ATTGGAAGGC 650 

GAGTCCAAGT GTGCTCTCGT CTTTGGTCAA ATGAACGAAC CCCCAGGTGC 700 

5 TCGTGCCCGT GTCGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCCGTG 750 

ATGATGAGGG TCAAGATGTC TTGCTTTTCA TTGATAACAT TTTCAGATTC 800 

ACTCAAGCTG GTTCTGAGGT ATCTGCCCTT TTGGGTCGTA TTCCTTCCGC 850 

TGTCGGTTAC CAACCCACTC TTTCCACYGA TATGGGTGGT ATGCAAGAGC 900 

GTATTACTAC CACCAAGAAC GGTTCCATTA CCTCCGTCCA AGCTGTCTAC 950 

10 GTCCCTGCTG ATGATTTGAC CGATCCTGCT CCTGCCACCA CTTTTGCTCA 1000 

CTTGGAT GCC ACCACTGTCT TGTCTCGTTC CATCGCTGAA TTGGGTATCT 1050 

AYCCCGCTGT CGAT CCTCTT~GATTCCAAGT"~CTCGTATCCT~CGATCCCCGT MO 0 

ATTGTCGGTG ATGAGCACTA CAAGGTTGCC ACTGAAGTTC AACAAATCCT 1150 

CCAAAACTAC AAGTCTCTCC AAGA • 1174 

15 



2) INFORMATION FOR SEQ ID NO: 772 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 467 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

3 0 (B) STRAIN: ATCC 242 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

35 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 200 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 300 

4 0 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 350 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 400 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 450 

AAGAAGGGAT CCATTAC 467 



2) INFORMATION FOR SEQ ID NO: 773 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55- (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773 

TTGCCAAGGC CCACGGTGGT TACTCTGTCT TCACTGGTGT CGGTGAGCGT 50 

ACCCGTGAGG GTAACGATCT GTACCACGAA ATGCAGGAAA CCTCCGTCAT 100 

5 TCAGCTTGAT GGCGAGTCCA AGGTCGCGCT TGTCTTCGGT CAGATGAACG 150 

AGCCCCCTGG TGCCCGTGCT CGTGTCGCTC TTACTGGTCT TACCGTTGCC 200 

GAGTACTTCC GTGATGAGGA GGGTCAGGAT GGTAAGTTAT ATCGTTTTTA 250 

TTATCTTCTT TGCCACCACC CCTCTACGAA TCCATGCCTC CGTTGGTGAA 300 

GGCATCGTTT GTAGGGCGGG TCGGAGTTTG CGGCAATTTC TGCCGTCGGC 350 

10 TTGAAGCCGC GGATGCCCGA TGTTTGACGC GTATCGATGC TAACAACAAT 400 

GACAACAGTG CTTCTCTTCA TTGACAACAT TTTCCGATTC ACCCAGGCCG 450 



GTTCCGAGGT GTCTGCCCTT~CTCGGTCGTA~TTCCCTCTGC~CGTCGGTTAC 500- - 

CAGCCCACTC TTGCCGTAGA CATGGGTGCC ATGCAGGAAC GTATTACCAC 550 
CACCAAGAAG GGTTCGATTA CCTCCGTC 578 



15 



2) INFORMATION FOR SEQ ID NO: 774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
30 (B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

35 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 200 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 250 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 300 

40 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 350 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 400 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

45 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 800 

50 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAACGTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 900 

TCTACGTCCC TGCTGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 1000 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

55 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTTCAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



60 2) INFORMATION FOR SEQ ID NO: 775 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 

( B) !TfRAIN~ATCC"~3"87~4"5 



15 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 775 



ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 100 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 200 

20 CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 2 50 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 300 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 350 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 400 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 450 

25 GACTCAGAAG GGCTCGATTA CCTCGGT 477 



2) INFORMATION FOR SEQ ID NO: 776 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 

4 5 TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 200 

CTACCATGAA ATGCAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 250 

50 AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 300 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 350 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 400 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 450 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

55 TGCTTGGTCG TATCCCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 600 

TACCTCCGTC 610 



60 
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10 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANI SM7" "Trichophyton - tonsurans 

(B) STRAIN: ATCC 56185 

15 (xi) SEQUENCE DESCRIPTION: SEQ- ID NO: 777 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

GTGAAAATTG GACAATTGAA CAATTTAGCC CTTGGAGAAA AGAAATTTCG 100 

AGTATTAATT ATTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

20 TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 200 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 300 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

TCAAGACGGT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 400 

25 TCATGTTCTA ACAAAGTGTA TTCTAGTGCT TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

CCATCTGCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 

GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 593 



2) INFORMATION FOR SEQ ID NO: 778 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 



CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TCTTGGTAGA ATTATCAACG TCATTGGTGA ACCAATTGAC 100 

50 GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 200 

TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 300 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

55 GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 400 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

60 TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 
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ATATGGGTGC CCTCCAAGAA 
ACATCTGTCC AAGCCGTCTA 
CCCAGCCACC ACTTTCGCTC 
CCATTTCCGA ATTGGGTATC 
5 TCTCGTCTTT TGGATCCTGA 
CACTCAAGTT CAACAAACCC 
TTGCCATTTT GGGTATGGAT 
GAACGTGCCC GTAAGATCCA 
CGAAGTTTTC ACTGGTATCG 
10 TCAGATCTTT CAAGGAAATC 



CGTATTACCA CCACCCAAAA 
TGTCCCAGCA GACGATTTGA 
ACTTGGACGC CACCACTGTC 
TACCCAGCTG TCGATCCTCT 
AGTTATTGGA CACGAACACT 
TCCAAGCTTA CAAGTCTCTC 
GAATTGTCTG AAGCTGATAA 
AAGATTCCTT TCCCAACCAT 
AAGGTCGTCT CGTTCCATTG 
CTTGAAGGTA AGTACGATCA 



PCT/CA00/01150 
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2) INFORMATION FOR SEQ ID NO: 779 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 79 

30 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

35 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 300 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 450 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

40 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 600 

CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 750 

45 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

50 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGCCTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



55 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CAOO/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 
TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 



CTTCCTTCAT~CAAGAAGATT-GGTTACAATC-CTA?^ATCCGT -TCCTTTCGTY- 10 0- 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 2 00 

15 GTAAGACTCT CTTAGAAGCC ATTGACAGCA. TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 3 00 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 400 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 450 

20 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 500 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 650 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

25 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 

GC 752 



30 2) INFORMATION FOR SEQ ID NO: 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 200 

50 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 250 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 3 00 

GGTCCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 4 00 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 450 

55 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

60 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 728 
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2) INFORMATION FOR SEQ ID NO: 782 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



— (i"ipMOLECULE~TYPEl^Genotnic-DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 

20 TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 100 

CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 150 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 200 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT ACTGGTGAGT TCGAGGCCGG 250 

25 TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 300 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAG 350 

TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 400 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 450 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

30 AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGCCATCG ACCCCCCCAC TCGTCCCACC GACAAGCCCC . 600 

TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT 650 

CCCGTCGGTC GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTCAC 700 

CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 

35 ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGCAACGTCG CCGGTGACTC 850 

CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 900 

TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGCCACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 1000 

40 CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TTCATCAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 1100 

GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 1145 



45 
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2 ) INFORMATION FOR SEQ ID NO: 783 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1151 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 

446 
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GCTCAAGGCT GAGCGTGAGC GTGGTATCAC CATTGACATC GCCCTCTGGA St 

AGTTCGAGAC CCCCAAGTAC ATGGTCACCG TCATCGGTAT GCTTTATCTG 100 

TTTCCCATTT ATAGTTGCGA CAAGTAACTA ATAAAAAGTA GATGCCCCCG 150 

5 GACACCGTGA CTTCATCAAG AACATGATTA CTGGTACCTC CCAGGCCGAC 200 

TGCGCTATTC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCT TTCACCCTCG 300 

GTGTCAAGCA GCTCATCGTT GCCATCAACA AGATGGACAC CACCAACTGG 350 

TCTGAGTCCC GTTTCGGTGA AATCATCAAG GAAGTCACCA ACTTCATCAA 400 

10 GAAGGTCGGC TACGACCCCA AGGGTGTCCC ATTCGTCCCA ATCTCTGGCT 450 

TCAACGGTGA CAACATGATT GAGCCCTCCA CCAACTGCCC ATGGTACAAG 500 

GGATGGAACTr AGGAGACeAA-GGeeGGTGGe-AAATGGTG-TG-GTAAGAGGG-T 550 

CCTTGAGGCC ATCGATGCCA TTGACATGCC CACTCGTCCC ACCGACAAGC 600 

CTCTCCGTCT CCCACTCCAG GATGTCTACA AGATCTCTGG TATCGGAACA 650 

15 GTACCAGTCG GTCGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTTGT 700 

CACTTTCGCC CCCGCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 750 

ACCACCAGCA GCTCGTTCAG GGTGTTCCCG GTGACAACGT TGGCTTCAAC 800 

GTCAAGAACG TCTCTGTCAA GGAAGTCCGC CGTGGTAACG TTGCCGGTGA 850 

TTCCAAGAAC GACCCACCAT CTGGCTGCGC CTCTTTCAAG GCCCAGGTCA 900 

2 0 TCGTCCTCAA CCACCCCGGC CAGATCGGTG CTGGTTACGC CCCAGTCCTC 950 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TTCTTGAGAA 1000 

GATTGACCGC CGTACTGGTA AATCCGTCGA AACCAGCCCT AAGTTCGTCA 1050 

AGTCTGGTGA TGCCGCTATT GCCACCATGG TTCCATCCAA GCCCATGTGC 1100 

GTTGAGGCTT TCACTGACTA CCCACCACTT GGTCGTTTCG CCGTCCGTGA 1150 

25 C 1151 



2) INFORMATION FOR SEQ ID NO: 784 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

45 CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 200 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 250 

50 CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 3 00 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 350 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 4 00 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 4 50 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

55 TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 550 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 700 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

60 CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 
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TTGTGTTGCT AACGTCGCAT 
AAGAAGATGG AAGATGCTCC 
AAAATAGAAT ACTAACAATT 
CtGCTATCGT CAAGATGGTT 



TTGCTTCTTT TGCTTCTTTT 
CAAAGTAAGT ATTACGATTG 
ATTGTTTATA GTTCGTCAAG 
CCCTCCAAG 



PCT/CAOO/01150 

GCATGTAGGT 8^6 
ATGGACAATT 900 
TCTGGTGACT 950 
979 



2 ) INFORMATION FOR SEQ ID NO: 785 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 



15 



( B ) TYPE! - Nucl ei"c~a"c i"d" 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 
20 (B) STRAIN: ATCC 38561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

25 TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 200 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 250 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 30 0 

30 GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 350 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 400 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 55 0 

35 CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 60 0 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAAATGCAC 700 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

40 CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

45 TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 



50 



2 ) INFORMATION FOR SEQ ID NO: 786 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Saksenaea vasiformis 
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(B) STRAIN : ATCC 60625 



{xi) SEQUENCE DESCRIPTION : SEQ 



ID NO: 786 



ACCACCAAGT GGTCTGAGGC 
CGGYTTCATC AAGAAGGTCG 
CCATCTCCGG TTTCAACGGT 
CCCTGGTACA AGGGTTGGGA 
GACCMTCCTC GAGGCCATTG 
ACAAGCCCCT YCGTCTTCCC 
GGCACGGTTC CCGTCGGTCG 



CCGTTACCAG 
GCTTCAACCC 
GACAACATGA 
GAAGGAGACC 
ACGCCATCGA 
CTMCAGGATG 
TGTYGAGACC 
CTGGTGTCAC 
GCCGAGGGTS 
CGTCAAGGAG 
CCCCCAAGGG 
CCCGGTCAGG 
TCACATTGCC 
CCGGCAAGTC 
GCCATCGTCA 



GAGATCATCA 
CAAGCACGTT 
TCGACGTCTC 
AAGGCCAAGG 
CCCCCCYAGY 
TYTACAAGAT 
GGTRCCATCA 
"CACTGAGGTC 
TCCCCGGTGA 
ATCCGTCGTG 
CTGCGACAGC 
TCGGCGCCGG 
TGCAAGTTCT 
CATCGAGTCC 
AGATGGTTCC 



AGGAGACCTC 
CCCTTCGTGC 
CACCAACTGC 
CCACCGGCAA 
CGTCCCACCG 
TGGCGGTATT 
AGGGTGGCAT 
AAGTCCGTCG 
CAACGTCGGC 
GCAACGTTGC 
TTCAACGCCC 
CTACGCGCCS 
CTGAGCTCCT 
GGCCCCAAGT 
CTCCAAGCCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 



GGTCGTCACC TTCCCCCCCG 
AGATGCACCA CGAGCAGCTC 
TTCAACGTCA AGAACGTCTC 
CGGTGACTCC AAGAACGACC 
AGGTCATCGT CCTCAACCAC 
GTCCTSGACT GCCACACTGC 
CGAGAAGATC GACCGCCGTT 
TCATCAAGTC TGGTGACGCC 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Syncephalastrum racemosum 

(B) STRAIN: ATCC 32330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CACGTCACCG TCATTGATGC CCCCGGCCAT 100 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CTGACTGCGG 150 

TATCCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CACGCTCTGC TTGCCTTCAC CCTCGGTGTC 250 

CGTCAGCTGA TCGTCGCCAT CAACAAGATG GACTCGACCA AGTACTCTGA 300 

GGCCCGTTAC AACGAAATCG TCAAGGAGGT CTCCACCTTC ATCAAGAAGA 350 

TCGGTTTCAA CCCCAAGTCC GTTCCCTTCG TCCCCATCTC TGGCTGGAAC 400 

GGTGACAACA TGTTGGAGGA GTCCTCCAAC ATGCCCTGGT TCAAGGGCTG 450 

GAAGAAGGAG ACCAAGGCTG GCGAGAAGTC CGGCAAGACC CTCCTTGAGG 500 

CCATTGACAA CATTGACCCC CCGGTCCGTC CCTCGGACAA GCCCCTCCGT 550 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATCGGCA CAGTCCCCGT 600 

CGGTCGTGTC GAGACTGGTG TCATCAAGGC TGGTATGGTC GTGACCTTCG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

CAGCTCGTCG AGGGTGTCCC CGGTGACAAC GTCGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTC AAGGATATCC GCCGTGGTAA CGTCTGCTCT GACTCCAAGA 800 

ACGACCCCGC CAAGGAGTCT GCCTCGTTCA CCGCCCAGGT CATCGTCCTG 850 

AACCACCCCG GTCAGATCGG TGCCGGTTAC GCCCCGGTTC TTGACTGCCA 900 

CACCGCTCAC ATTGCCTGCA AGTTCGCTGA GCTCCTCGAG AAGATCGACC 950 

GTCGTTCCGG YAAGAAGCTC GAAGAGTCCC CCAAGTTCGT CAAGTCGGGT 1000 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1050 

CTACACTGAG TACCCCCCTC TTGGCCGTTT CGCC 1084 
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2) INFORMATION FOR SEQ ID NO: 788 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(IT) - M0LECULE _ TYPET — Genomi"c _ DNA" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyti 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC 
AGTTCGAGAC CCCCAAGTAC AATGTCACCG 
ACCTTTCCCC TCCATCGTCT TGCTGTGCCA 
CCCCGGTCAC CGTGACTTCA TCAAGAACAT 
CTGACTGTGC TATTCTCATC ATTGCTGCCG 
GGTATCTCCA AGGATGGCCA GACCCGTGAG 
CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT 
ACTGGTCCGA GGACCGTTTC AAGGAAATCA 
ATCAAGAAGG TTGGCTACGA CCCCAAGGGT 
TGGTTTCAAC GGTGACAACA TGATTGAGGC 
ACAAGGGATG GAACAAGGAG ACCAAGGCCG 
ACCCTYCTCG AGGCCATCGA TGCCATCGAC 
CAAGCCCCTY CGTCTCCCAC TCCAGGATGT 
GAACTGTACC AGTCGGTCGT GTTGAGACCG 
GTCGTCACCT TCGCCCCTGC CAACGTCACC 
AATGCACCAC CAGCAGCTTC AGCAGGGTGT 
TCAACGTCAA GAACGTTTCC GTCAAGGAAG 
GGTGACTCCA AGAACGACCC ACCATCCGGC 
GGTCATCGTC CTCAACCACC CCGGCCAGAT 
TCCTCGACTG CCACACTGCT CACATTGCTT 
GAGAAGATTG ACCGCCGTAC CGGTAAATCC 
CGTCAAGTCT GGTGATGCCG CTATCGCCAA 
TGTGCGTTGA GGCTTTCACT GACTACCCCC 
CGTGA 



tonsurans 



ID NO: 788 

CATCGATATC GCCCTCTGGA 50 

TCATTGGTAT GTTTTTCTTT 100 

TAACTAACGA GAGTAGACGC 150 

GATCACTGGT ACCTCCCAGG 200 

GTACTGGTGA GTTCGAGGCT 250 

CACGCTCTGC TCGCCTTCAC 300 

CAACAAGATG GACACCACCA 350 

TCAAGGAAGT CACCAACTTC 400 

GTTCCATTCG TTCCAATCTC 450 

CTCCAGCAAC TGCCCATGGT 500 

GTGGTGCCAA GACTGGCAAG 550 

ATGCCAACCC GTCCTACCGA 600 

CTACAAGATC TCTGGTATCG 650 

GTATCATCAA GCCTGGTATG 700 

ACTGAAGTCA AGTCCGTCGA 750 

CCCCGGTGAC AACGTCGGCT 800 

TCCGCCGTGG TAACGTTGCC 850 

TGTGCCTCCT TCAACGCCCA 900 

CGGTGCTGGT TACGCCCCAG 950 

GCAAGTTCGC TGAGCTCCTC 1000 

GTCGAAGCCA ACCCCAAGTT 1050 

GATGGTTCCC TCCAAGCCCA 1100 

CACTTGGTCG TTTCGCCGTC 1150 

1155 



2) INFORMATION FOR SEQ ID NO: 789 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1138 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

450 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 

TCAAGGCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACCC CCAAGTACAA TGTCACCGTC ATTGGTATGT TTCTCTTTAC 100 

5 CTTTCCCCTC CATCGTCTTG CTGTGCCATA ACTAACGAGA GTAGACGCCC 150 

CCGGTCACCG TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 200 

GACTGTGCTA TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG 250 

TATCTCCAAG GATGGCCAGA CCCGTGAGCA CGCTCTGCTC GCCTTCACCC 300 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC 350 

10 TGGTCCGAGG ACCGTTTCAA GGAAATCATC AAGGAAGTCA CCAACTTCAT 400 

CAAGAAGGTT GGCTACGACC CCAAGGGTGT TCCATTCGTT CCAATCTCTG 450 

GTTTCAACGG _ TGACAACATG — ATTGAGGeCT-eCAGGAAG-TG-€GGATGGTAG 500- 

AAGGGATGGA ACAAGGAGAC CAAGGCCGGT GGTGCCAAGA CTGGCAAGAC 550 

CCTCCTCGAG GCCATCGATG CCATCGACAT GCCAACCCGT CCTACCGACA 600 

15 AGCCCCTCCG TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA 650 

ACTGTACCAG TCGGTCGTGT TGAGACCGGT ATCATCAAGC CCGGTATGGT 700 

CGTCACCTTC GCCCCTGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA 750 

TGCACCACCA GCAGCTTCAG CAGGGTGTCC CCGGTGACAA CGTCGGCTTC 800 

AACGTCAAGA ACGTTTCCGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG 850 

20 TGACTCCAAG AACGACCCAC CATCCGGCTG TGCCTCCTTC AACGCCCAGG 900 

TCATCGTCCT CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC 950 

CTCGACTGCC ACACTGCTCA CATTGCTTGC AAGTTCGCTG AGCTCCTCGA 1000 

GAAGATTGAC CGCCGTACCG GTAAATCCGT CGAAGCCAAC CCCAAGTTCG 1050 

TCAAGTCTGG TGATGCCGCT ATCGCCAAGA TGGTTCCATC CAAGCCTATG 1100 

25 TGCGTTGAGG CTTTCACTGA CTACCCCCCA CTTGGTCG 1138 



2) INFORMATION FOR SEQ ID NO: 790 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 

45 CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGGAGACCT 50 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 

CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACTGGTA 200 

AGACCCTTCT CGAGGCCATC GACGCCATCG ACCCCCCTGT CCGTCCTACC 250 

50 GACAAGCCCC TCCGCCTTCC CCTCCAGGAT GTGTACAAGA TTGGTGGTAT 300 

TGGCACGGTT CCCGTCGGTC GTGTCGAGAC CGGTATCATC AAGCCCGGTA 350 

TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA CCACTGAAGT CAAGTCCGTC 400 

GAGATGCACC ACGAGCAGCT TGCCGAGGGT GTCCCCGGTG ACAACGTCGG 450 

CTTCAACGTC AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGTAACGTTG 500 

55 CCGGTGACTC CAAGAACGAC CCCCCCAAGG GTTGCGAGTC CTTCAACGCC 550 

CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCCG GTTACGCACC 600 

AGTCCTTGAC TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGCTCC 650 

TCGAGAAGAT TGACCGCCGT ACCGGAAAGT CTGTTGAGAA CTCCCCCAAG 700 

TTCATCAAGT CCGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAG 748 



60 
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2) INFORMATION FOR SEQ ID NO: 791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
15 (B) STRAIN: ATCC 14110 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 

CGCTATTGTC GTTGTTGCTG CCTCCGACGG TCAGATGTAG GTGGAACATC 50 

20 TTGGGAAATA CGTCGTAAAA CACGGCGCTT ACGTTTTCGC GAATAGGCCC 100 

CAGACTCGTG AGCATTTGCT GCTCGCCCGC CAGGTTGGTG TCCAGAAGAT 150 

CGTTGTCTTC GTCAACAAAA TCGATGCTAT TGATGATCCG GAGATGCTGG 200 

AACTGGTCGA ACTCGAGATG CGTGAGCTGC TGAACAGCTA CGGTTTCGAG 250 

GGTGAAGAGA CTCCGATCAT TTTCGGTTCC GCTCTCTGTG CTCTCGAAGG 3 00 

25 ACGCCGTGAC GACATCGGTA AAGACAGAAT TGAGCAGCTT ATGAACGCTG 350 

TCGACACCTG GATCCCCACT CCTCAGCGTG ACCTCGACAA ACCTTTCTTG 400 

ATGTCTGTCG AGGAAGTGTT CTCTATCGCC GGCCGTGGTA CCGTGGCTTC 450 

TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA GGACTCTGAG GTTGAGATTG 500 

TTGGAGGCTC CTTCGAACCC AAGAAGACCA AAGTCACCGA CATTGAAACC 550 

3 0 TTCAAGAAGA GCTGTGATGA ATCGCGTGCT GGTGACAACT CTGGTCTCCT 600 

CCTGCGTGGT ATCCGACGTG AAGACGTCAA GCGTGGTATG GTCATTGCTG 650 

TTCCCGGCAG CACCAAAGCT CACGACAAGT TCCTCGTCTC CATGTACGTC 700 

CTGACCGAGG CGGAGGGTGG TCGTCGTACT GGCTTCGGTG CCAACTACCG 750 

TCCCCAAGTC TTCATCCGTA CTGCAGGTAA GTTCCCGCAC ACCGTGTCCA 800 

35 GATCTTCCGA GAGATTAGCG ATATATGCTA ATGATTCATC AGACGAGGCT 850 

GCTGACCTCA GCTTCCCTGA CGGCGACCAA TCTCGCAGAG TTATGCCTGG 900 

TGACAACGTC GAGATGATCC TGAAGACCCA CCACCCTGTT GCTGCTGAGG 950 

CTGGTCAA 958 



40 



55 



2) INFORMATION FOR SEQ ID NO: 792 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 



CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 50 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

60 ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 
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TTAACAAGGT 
CTTGAAATGC 
CCCCATCATT 
AGCTTGGTGT 
5 ATCCCCACCC 
GGAAGTGTTC 
AGCGTGGTAT 
ACCACCCCTA 
CTGCGATGAA 
10 TCAAGCGTGA 
ACCAAGGCTC 



TGAGGGTGGT 
TCATCCGTAC 
ATCATTGCTA 
15 TGGCCTGGAG 
GATGATTTGC 



CGATGCCGTT 
GTGAACTCCT 
TTTGGCTCTG 
CGAGAAGATT 
CCGAGCGCGC 
TCTATCTCTG 
CCTCAAGAAG 
TCAAGACCAA 
TCTCGAGCTG 
GGACTTGAAG 
ACACCGACTT 
CGTTCCAACG 
TGCTGGTATG 
ACTTGTATTT 
AAGACCAAGA 
AAGACCCTCC 



GAGGACCCAG 

CAGCCACTAC 

CTCTCTGTGC 

GACGAGCTAT 

CACTGATAAG 

GTCGTGGTAC 

GATTCCGACG 

GGTCACAGAT 

GTGACAACTC 

CGTGGAATGG 

CATGGTCTCC 

"GCTTCACCCA" 

TAACCCAAGT 

CCTTCCGTAG 

CAAGAAGGCT 

ACCCCATTGC 



AGATGTTGGA 
GGTTTCGAGG 
CCTCGAGTCC 
TGAACGCCGT 
CCTTTCCTCA 
CGTCGTCTCC 
TCGAAATTGT 
ATCGAAACCT 
TGGTCTCCTT 
TTGTTGCTGC 
CTCTACGTCC 
"CAAGTACCGC 
TTCCGCTATT 
ACGAAGCCGC 
ATGCCTGGTG 
TGCCGA 



PCT/CAOO/01150 

GCTTGTCGAA "20(/ 

GTGAGGAGAC 250 

CGTCGACCTG 300 

CGACACCTGG 350 

TGTCCATTGA 400 

GGTCGTGTTG 450 

TGGTGGCTCT 500 

TCAAGAAGTC 550 

CTCCGAGGTA 600 

CCCCGGATCC 650 

TGACTGAGGC 700 

"CCCCAAATGT 750~ 

TACTAAGTAG 800 

ATCTTTCAGC 850 

ACAACGTCGA 900 
936 



20 .2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 20 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794 

GACGACAAGT CGGTGAACTT 20 



55 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 

10 ACTTGCACGC GATGTGGCAG 20 



2) INFORMATION FOR SEQ ID NO: 796 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 

25 

GGTCCAATGC CWCAAACWAG A 21 



30 2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 

CATTAAGAAT GGYTTATCTG TSKCTCT 27 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 798 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 

454 
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PCT/CA00/01150 

22 



5 2) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 

20 ACCTGTGAAT ACAAGCAATC T 21 



25 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 800 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 

GATGAAATCT TCAACGAAGT TGAT 24 

40 

2) INFORMATION FOR SEQ ID NO: 801 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 20 
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10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(vi) ORIGINAL SOURCE: 

(A) — ORGANISM:— eryptospor4d-i-um-parvum- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802 
TTGCCATTTC TGGTTTCGTT 20 



20 2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 

35 ACTTCAGTGG TAACACCAGC 20 



40 



2) INFORMATION FOR SEQ ID NO: 804 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 

CCTGGGACGG CCTCTGGCAT 20 

55 



2) INFORMATION FOR SEQ ID NO: 805 

60 (i) SEQUENCE CHARACTERISTICS: 

456 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 



15 



CTCTTGTCCA TCTTAGCAGT 20 



2 ) INFORMATION FOR SEQ ID NO: 806 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806 

AGCATCACCA GACTTGATAA G 21 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 807 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 

AAAGTGGCTT CAAAGGTTGC 20 



55 2) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808 
GCITTAIWRG CATTAGAARA YCCA 



2) INFORMATION FOR SEQ ID NO: 809 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid - 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TCTTCCTGTW GCAACTGTTC CTCT 



2) INFORMATION FOR SEQ ID NO:. 810 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 
AGAGMWACAG ATAARSCATT CTTA 



2) INFORMATION FOR SEQ ID NO: 811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 



2) INFORMATION FOR SEQ ID NO: 812 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 



2) INFORMATION FOR SEQ ID NO: 813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
AATCYGTYGA AATGCAYCAC GA 



2) INFORMATION FOR SEQ ID NO: 814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
GCIGGCACGT ACACIGCCTG 



2) INFORMATION FOR SEQ ID NO: 815 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 
TGGTGCATYT CKACRGACTT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 



GCTACGACGA GATCAAGGGC 20 



2) INFORMATION FOR SEQ ID NO: 817 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 20 

30 



2) INFORMATION FOR SEQ ID NO: 818 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
45 AGCCGGGCTG GATCTTCTTC 20 



2) INFORMATION FOR SEQ ID NO: 819 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 

60 
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5 2) INFORMATION FOR SEQ ID NO: 820 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820 

20 GAAGGAGGTG TCTGCTTACA C 21 



2) INFORMATION FOR SEQ ID NO: 821 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 21 

40 



2) INFORMATION FOR SEQ ID NO: 822 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



60 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 

CGGCGGATGT CCTTAACAGA A 21 
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2) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xiT SEQUENCE-DESCRIPTION : -SEQ- ID-NO •:- 823- 



15 



GAGCGGTATG AYGAGATTGT 20 



2) INFORMATION FOR SEQ ID NO: 824 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
30. GGCTTCTGCG GCACCATGCG 2 0 



35 



2 ) INFORMATION FOR SEQ ID NO: 825 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 

45 

ATGAGCARCG SAACCATCGT TCAGTG 26 



50 2) INFORMATION FOR SEQ ID NO: 826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 

462 
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TCGATCGTGC CGACCATGTA GAACGC 26 



2) INFORMATION FOR SEQ ID NO: 827 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 6 bases 

10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) — TOPOLOGY i Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

CACCAACTTG CTAAATGGGG AGATGCCCAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

25 GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 200 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 250 

TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 300 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 350 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 400 

30 GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 446 



2) INFORMATION FOR SEQ ID NO: 828 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

50 CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

55 CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 300 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 350 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 4 00 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 



60 
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2) INFORMATION FOR SEQ ID NO: 829 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 44 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



( vil^RIGINAinSOURCE": 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829 



ATGCTATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 50 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

20 TATTGACCCA AAAACTGGGG AAAGTTTAAT GAAGAGAACA GTACTTATAG 150 

CTAATACTTC AAACATGCCA GTTGCTGCTA GAGAAGCTTG CATATACACA 200 

GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 250 

AATGGCTGAT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 300 

GTAGACTTGA AGAAATGCCA GGTGATGAAG GATATCCAGC GTACTTAGGA 350 

25 TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 400 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 445 



30 2) INFORMATION FOR SEQ ID NO: 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 



TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 150 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 2 00 

50 GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 250 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 300 

TAGACTTGAA GAAATGCCTG GTGATGAAGG ATATCCAGCT TACCTTGGAT 350 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 444 



2) INFORMATION FOR SEQ ID NO: 831 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 56 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831 



TAGTTCAGCA CCAAGTTGCT AAATGGGGAG ATACTGAGAT AGTTGTTTAC 50 

GTTGGATGTG GAGAACGTGG TAACGAGATG ACAGACGTTC TTAACGAATT 100 

15 CCCAGAACTT AAAGACCCTA AAACTGGGGA AAGCTTAATG AAGAGAACAG 150 

TTCTTATTGC TAATACATCT AACATGCCAG TTGCTGCCAG AGAAGCATCA 200 

ATATATACTG GTATAACAAT AGCAGAGTAT TTCAGAGATA TGGGATACTC 250 

AGTATCAATC ATGGCTGACT CAACTTCACG TTGGGCAGAG GCTTTAAGAG 300 

AAATGTCAGG AAGACTTGAA GAAATGCCAG GAGACGAAGG TTACCCAGCA 350 

20 TACTTAGGAT CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAAGTTGT 400 

AGCTTTAGGT AAAGATGGAA GAGAAGGAGC TGTTACAGCT ATCGGAGCAG 450 

TATCCC 456 



25 2 ) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 194 06 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 



CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT ATATATACTG 200 

45 GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 250 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 300 

3AGACTAGAG GAGATGCCAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 350 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 400 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 

50 



2) INFORMATION FOR SEQ ID NO: 833 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: Genome project 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 



TGAACCAAGG AAAAATAATA ACCGTTTCGG GACCTCTTGT TGTGGCTTCT 50 

10 GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 100 

CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 150 
TTTATGAGGA" AACATCAGGG^ATCGGTCCAG-GAGAACCAGT-AGTGAeT-AG-T- — — -200 

GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 250 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 300 

15 ACTTTTTAAT CCGTGGTGTG GCTATCCCAA' GTCTTGATCG AAAGGCTAAG 350 

TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 4 00 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 450 

TTCCTTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGGACTTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 550 

20 CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 600 

CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 700 

ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGCCA 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

25 ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 850 

GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGCCAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGCCGAATAT 950 

TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGCAGACT CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGTCAGG ACGCCTACAA GAAATGCCTG 1050 

30 GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

GAACGGGCTG GTCGTGTTCG GACCTTGGGA AGTCAAGAAC GTGAGGGAAC 1150 

CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC . 1200 

CTGTCACTCA AAACACCCTT CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 1250 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 13 00 

35 TTATTCATTG TATCAAGATG ATGTAGGAAG CTATATTGAC CGTAAACAGC 1350 

AATCTAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 1400 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 1450 

TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 1550 

40 AAACAAGAGG CCATGCTAAC CAATATTTTG ACCTTTAATG AGGAAGCCAG 1600 

CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 1650 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 1700 

TTAGAGCAGA TTAAAGGGCT TACTCAGAAG GTTACCAAAG AGATTCACCA 1750 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 1786 

45 



2) INFORMATION FOR SEQ ID NO: 834 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 
60 (B) STRAIN: Suarez-3 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 

TATCTCACGT . AAGTTTTTGC GCGGCGGTTA TATATCACTC CAGGCCCTGG 50 

5 CTAAGTACGC TAATACTGAC GTTACTGTCT ATGTGGGATG TGGAGAGCGT 100 

GGAAACGAGA TTGCGGAGGT GCTTAAGGAG TTCCCTGAGC TGAAGACCAA 150 

GGTTGATGGC AAGGAAGTGA GCATTATGAA ACGCACTTGC TTGGTGGCCA 200 

ATACTTCAAA CATGCCAGTG GCCGCCAGGG AGGCTAGTAT CTACACTGGC 250 

ATTACCCTAT GTGAATACTT CAGGGATATG GGATACAACG CCTGTGTGAT 300 

10 GGCGGATTCC ACCAGTCGTT GGGCTGAGGC TTTGCGTGAG ATATCAGGTC 350 

GTTTAGCTGA GATGCCTGCT GATTCAGGTT ATCCCGCCTA CCTTGCTTCT 400 

"AGGCTTTCGG - CGTTCTATGA - GCGTGCTGGT-ACAGeTGAGT— GTATTGGAAG 450- 

ACCACTTCGT GAAGGTTCAG TTACCATTGT TGGTGCTGTA TCTCCACCA 4 99 



15 



2) INFORMATION FOR SEQ ID NO: 835 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 464 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835 

TTTCTCAAGC TTTGAGTAAA TATAGTAACT CTGATGTTAT TATTTACATT 50 

GGTTGTGGAG AAAGAGGAAA TGAAATGGCA GAAGTTCTTA CAGAATTCCC 100 

TGAGCTTTAT ACTATGGTTG ATGGAAAGAA GGAGTCAATT ATGCAAAGAA 150 

35 CTTGTTTAGT AGCTAATACA TCAAATATGC CTGTCGCTGC TAGAGAAGCT 2 00 

TCCATCTACA CTGGTATTAC ACTTTCTGAA TACTTTAGAG ATATGGGATG 250 

TAATGTTTCT ATGATGGCAG ATTCAACTTC TCGTTGGGCT GAAGCTCTTA 3 00 

GAGAAATTTC TGGTAGATTA GCTGAAATGC CTGCAGATTC GGGTTACCCA 350 

GCATATTTAG GCGCCAGACT TGCTTCATTC TATGAAAGAT CAGGAAGAGT 4 00 

40 TAAATGTATG GGTTCCCCAG ATAGAGAAGG TACAGTAACA ATTGTTGGTG 450 

CAGTTTCTCC ACCT 4 64 



45 2) INFORMATION FOR SEQ ID NO: 836 
3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 

467 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCCCTCTCCA 
CGAGCGCGGT 
CGACCGTGAT 
GTGGCGAACA 
CACCGGCATC 
CCATGATGGC 
TCGGGTCGTC 
CAGCGCTCGT 
TCGGCGGGCC 



AGTACTCCAA 
AATGAGATGG 
CGATGGCCGC 
CCTCGAACAT 
ACCCTGGCCG 
TGACTCGACG 
TGGCGGAGAT 
CTCGCCTCCT 
GAAGCGCCAG 



CTCCGATTGC GTCATCTATG TCGGCTGCGG 
CCGAGGTGCT CATGGAGTTC CCGACCCTGA 
GAGGAGTCGA TCATGAAGCG CACCTGCCTC 
GCCAGTCGCA GCCCGTGAGG CCTCTATTTA 
AGTACTACCG TGATATGGGC AAGCATATCG 
TCTCGCTGGG CCGAGGCGCT TCGTGAGATT 
GCCGGCGGAT GGTGGCTACC CCGCCTACCT 
TCTACGAGCG CGCCGGCCTC GTCACCTGCA 
GGCTCCGTCA CGATCGTCGG TGCCGT 



'50 
100 
150 
200 
250 
300 
350 
400 
446 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 837 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837 

TTAGTCAGGC CCTCTCCAAG TACTCCAACT CCGACTGCGT CATCTATGTC 50 

GGCTGCGGCG AGCGCGGTAA TGAGATGGCT GAGGTGCTCA TGGATTTCCC 100 

AACTCTGACG ACCGTGATCG ATGGTCGCGA GGAGTCCATC ATGAAGCGCA 150 

CCTGCCTCGT GGCAAACACT TCGAACATGC CAGTCGCAGC CCGCGAGGCC 200 

TCTATTTACA CCGGCATCAC CCTGGCCGAG TACTACCGTG ATATGGGCAA 250 

GCATATTGCC ATGATGGCCG ACTCGACATC TCGCTGGGCC GAGGCGCTTC 300 

GTGAGATTTC CGGTCGTCTG GCGGAGATGC CAGCCGATGG TGGCTACCCT 350 

GCCTACCTCA GCGCTCGTCT CGCCTCCTTC TACGAGCGCG CCGGCCTCGT 400 

CACCTGCATC GGCGGGCCGA AGCGCCAGGG CTCCGTCACG ATCGTCGGTG 450 

CTGTGT 456 



2) INFORMATION FOR SEQ ID NO: 838 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Leishmania tarentolae 

(B) STRAIN: MOU-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838 

AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 50 
GGCGAGCGCG GTAATGAGAT GGCCGAGGTG CTCATGGAGT TCCCGACCCT 100 
GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 150 



468 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TCGTGGCCAA CACCTCCAAC 

TACACTGGTA TCACCCTGGC 

CGCCATGATG GCTGACTCGA 

TTTCGGGTCG CCTGGCGGAG 

5 CTTAGTGCTC GTCTTGCCTC 

CATCGGTGGG CCGAAGCGCC 



ATGCCAGTCG CAGCCCGTGA 
CGAATACTAC CGTGATATGG 
CGTCTCGCTG GGCGGAGGCG 
ATGCCGGCTG ATGGTGGGTA 
CTTCTACGAG CGTGCCGGCC 
AGGGCTCCGT CACGATCGTC 



PCT/CA00/01150 



A6CCTCTATT 200 

GCAAGCATAT 250 

CTTCGTGAGA 300 

CCCCGCCTAC 350 

TCGTCACCTG 400 

GGTGCCGTGT 450 



10 2) INFORMATION FOR SEQ ID NO: 839 



25 



45 



(i) SEQUENCE CHARACTERTSTTCST 

(A) LENGTH: 437 bases 

(B) TYPE: Nucleic acid 
15. (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:- Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 795 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839 



GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 50 

TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 100 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 150 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 200 

30 CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 2 50 

CCATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 3 00 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 350 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG TGCTGGCCGT GTGACATGCA 4 00 

TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT 437 

35 



2) INFORMATION FOR SEQ ID NO: 840 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 
50 (B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION :> SEQ ID NO: 840 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

55 TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

60 CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

469 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC 
GCAGGGCGAC AACATGATTG ACAAGTCGGA 
GCCCCACGCT GCTGGAGGCA CTCGACATGC 
AGCGACAAGC CGCTGCGCCT GCCGCTGCAG 
5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA 
GCGACGTGGT GACGTTTGCG CCCGCCAACG 
ATTGAGATGC ACCACGAGCA GCTGGCCGAG 
CGGCTTCAAC GTGAAGAACG TGTCCGTGAA 
TGTGCGGCAA CTCGAAGAAC GACCCCCCAA 
10 GCGCAGGTGA TCATCCTGAA CCACCCCGGC 
GCCGGTGCTC G ACTGCCACA CCTGCCACAT 

tcgagtccaOsmcgaccgc^cgctccggca- 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG 
GCCCATGTGC GTGGAGGTGT TCAACGACTA 
15 CC 



PCT/CA00/01150 

TTCATCCCCA TCTCCGGCTG A00 

AAATATGCCG TGGTACAAGG 450 

TGGAGCCCCC GGTGCGCCCC 500 

GACGTGTACA AGATCGGCGG 550 

GACGGGCACG ATGAAGCCCG 600 

TGACGACGGA GGTGAAGTCG 650 

GCCACGCCCG GCGACAACGT 700 

GGACATCCGC CGTGGCAACG 750 

AGGAGGCGGC CGACTTCACG 800 

CAGATCGGCA ACGGCTATGC 850 

CGCGTGCAAG TTCGCCGAGA 900 

-AGGAGCTTGA-GAAGAACCCC 950 

GTGCGCATGG TGCCGCAGAA 1000 

CGCTCCTCTT GGCCGCTTTG 1050 

1052 



2) INFORMATION FOR SEQ ID NO: 841 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN : CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

35 TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

40 GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 400 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

45 AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 750 

50 TGTGCGGCAA CTCGAAGAAC GATCCCCCAA AGGAGGCGGC CGACTTCACG 800 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTT GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

55 GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

CCGTGCGTGA C 1061 



60 2) INFORMATION FOR SEQ ID NO: 842 

470 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 
(B ) STRAIN;- -PCU-1 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 



GCTGAAGGCT GAGCGCGAGC GCGGCATCAC GATCGACATC GCGCTCTGGA 50 

AGTTCGAGTC GCCCAAGTCT GTGTTCACGA TCATCGACGC CCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCTCAGG CGGACGCCGC 150 

CGTCCTTGTC ATTGCGTCAT CGCAGGGTGA GTTTGAGGCG GGCATCTCGA 200 

20 AGGACGGCCA GACACGCGAG CACGCGCTGC TCGCCTTCAC GCTCGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGT CGGTGAACTT 300 

CGCCCAGGAG CGCTACGATG AGATTGTGAA GGAGGTGTCG GCGTACCTGA 350 

AGAAGGTTGG GTACAACGTG GAGAAGGTGC GCTTCATCCC CATCTCCGGC 400 

TGGCAGGGCG ACAACATGAT TGACAAGTCG GAAAATATGC CGTGGTACAA 450 

25 GGGCCCCACG CTGCTGGAGG CACTCGACAT GCTGGAGCCC CCGGTGCGCC 500 

CCAGCGACAA GCCGCTGCGC CTGCCGCTGC AGGACGTGTA CAAGATCGGC 550 

GGTATCGGCA CCGTGCCGGT CGGTCGCGTG GAGACGGGCA CGATGAAGCC 600 

CGGCGACGTG GTGACGTTTG CGCCCGCCAA CGTGACGACG GAGGTGAAGT 650 

CGATTGAGAT GCACCACGAG CAGCTGGCCG AGGCCACGCC CGGCGACAAC 700 

30 GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG AAGGACATCC GCCGTGGCAA 750 

CGTGTGCGGC AACTCGAAGA ACGACCCCCC AAAGGAGGCG GCCGACTTCA 800 

CGGCGCAGGT GATCATCCTG AACCACCCCG GCCAGATCGG CAACGGCTAT 850 

GCGCCGGTGC TCGACTGCCA CACCTGCCAC ATCGCGTGCA AGTTCGCCGA 900 

GATCGAGTCC AAGATCGACC GCCGCTCCGG CAAGGAGCTT GAGAAGAACC 950 

35 CCAAGTCGAT CAAGTCCGGT GACGCCGCCA TGGTGCGCAT GGTGCCGCAG 1000 

AAGCCCATGT GCGTGGAGGT GTTCAACGAC TACGCTCCTC TTGGCCGCTT 1050 

TGCCGTGCGT GA 1062 



40 



50 



2) INFORMATION FOR SEQ ID NO: 843 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1057 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 50 

AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 100 

GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 150 

60 TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 200 

471 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TATCTGGCGC TGCTCAGATG GATGGTGCCA TATTGGTTGT TTCT*FGTGTT 2^0 
GACGGTCCCA TGCCTCAGAC TAAGGAGCAC GTGTTGCTTG CTAAGCAGAT 300 
TGGTGTACCT CGTTTAGTTG TGTTTTTGAA CAAGCTTGAC ATGTTAGAGG 350 
ACTCTGAGCT ATTGGAGTTG GTGGAGTTAG AGGTTCGTGA GTTATTGAGT 4 00 

5 GAGTTTGGTT ACGACGGTGA CAACACGCCT ATCGTTCGTG GCAGTGCTAT 450 
AAAGGCATTG AACAGTAGTT CCGAGGCTGA CATTAAGCCA ATTCAGGATT 500 
TATTGGATGC GTGTGATGCC TTTTTACTGA CTCCAGAACG TAAGGATGAC 550 
ATGCCGCTCT TGGTTGCTAT TGACGATGTT CTTGCCATTC CTGGCAAGGG 600 
TACTGTTGTA ACCGGTAGGA TAGAGCAGGG CAAGATTCGT TGTGGTGACC 650 

10 CTATTGAGGT TTGCGCCGGT CCGAAGTCCG GCAAGAAGAC TGTGTGTGTT 700 
GGTCTTRARA TGTTCCGCAA GAGTCTCAGT RAGGGTATTG CTGGTGACCA 750 

GATTGGTGTT _ TTGCTCAAGG _ GTGTGAAGCG~eGAGRAGGTA-GAGGGGGGGT 800- 

TTGTATTGAT TCAACCCGGA AGTTACAAAT GTCACGGTGA ATTTGATGCT 850 
GACTTGTACG TGTTGACTAC GRAGGAAGGT GGGCGCAAGC ATCCGTTTGT 900 

15 GTCTAACTAC CGTCCTCAGG CGTTTATACG TACTGGAGAC GTTTGCTGCT 950 

CAGTTCATTT GGATRAGGGT GTTGAGATGG CAGCTCCTGG TGACAACGTG 1000 

CGTTGCAAGA TCAAGTTACT TTATCCCATG CCTGTCCATG AAGGTTTACG 1050 

ATTTGCG 1 1057 



20 



35 



2) INFORMATION FOR SEQ ID NO: 844 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

40 TTCATCAACA AAGTGGATAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 2 00 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

45 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 4 00 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 4 50 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CAACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

50 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACCGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGGGA GGAGCTGAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

55 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

GCTCGCGTAC CCGATGCCCA TTGAAAAGGG CCTGAAGTTC ACC 943 



60 2) INFORMATION FOR SEQ ID NO: 84 5 

472 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

( B ) STRAIN :— ATCC^50 1 31 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CTTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 200 

20 CGCCCATCGT GCGCGGCTCG GCCCTCAAAG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 350 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

25 CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

30 CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGACAAGGG TCTGAAGTT 939 

35 



2) INFORMATION FOR SEQ ID NO: 846 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 
50 (B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

55 TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

AAGGTGGACA TGACGGACGA GGACACGTGC GACCTGGTGG ACATGGAGCT 150 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 200 

TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 250 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 300 

60 GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 

473 



SUBSTITUTE SHEET (RULE 26) 
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TCGGCAAGGA CAAGAAGAGC GTTGTCGTGA CCGGCCGCGT CGATckGGGC 40# 

ATTCTGAAGC TCAACACAGA CGCCGAGCTG GCCGGCTTCA GCTCCAAGAA 450 

GTCGACGGTG AGGGTGACGG GCATCGAGAT GTACCACAAG ACGCTGAGCG 500 

AGTGCATGCC TGGTGACTCC GTCGGCGTCA GCATTGTCGG GACCGGCGAC 550 

5 ACGACCAGTC TATCCAAGGG CAACGTGGAA CGCGGCATGG TGATGGCGGC 600 

GACGGGTAGC ACGAACCTGT ACAACAAGGT GAAGGCGCAG GTGTACGTGC 650 

TGACGAAGGA TGAGGGCGGC CGCCACACTG GCTTTAGTCC TCACTACCGC 700 

CCGCAGCTCT TCTTCCATTG TGCTGACGTG ACGGCGGACA TGAGCTTCCC 750 

GGAGGCGGAG AAGCACCGCG AAGAGCTCAA CAAGAAATTC GGCCGCGGCC 800 

10 CCGAGGAGGA CAAGAAGAAA GAGGCAGCGA TGAAGGAGTT CGAGAGCAAG 850 

CTCGTCTGCA TGCCGGGCGA TAACCGCGAG CTGATCCTGA CGCTGGCGTA 900 

CCCGATGCCC "^ATTGAAAAGG~GTCTGAAGTT-CAeeATGeGT-GAGGG 945- 



15 

2) INFORMATION FOR SEQ ID NO: 84 7 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 9 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

35 GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 200 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 300 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 4 00 

40 AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

45 CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

50 GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 939 



2) INFORMATION FOR SEQ ID NO: 848 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

474 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848 

10 CACCGACGGC GTCATGCCGC AGACACGGGA GCACCTGCTC ATCTGCTCGC 50 

AGATCGGGCT GCCGGCGCTT GTAGGGTTCA TCAATAAAGT TGACATGACG 100 

GACGAGGATA-CGTGCGACCT-CGTGGACATG-GAGGTGeGGG-AACAGG-TGGA 1-50 

GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 2 00 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

15 GTGCGAAAGT GCGACGAGTG GATCCCCGAO CCGCCGCGCA ACACCGAAAA 300 

GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 4 00 

ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

GACGGGCATC GAGATGTATC ACAAGACACT CAATGAGTGC ATGCCCGGCG 500 

20 ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTGACACGAC CAGCTTATCC 550 

AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

GTGGCCGCCA CACCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 700 

CACTGCGCTG ACGTGACCGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

25 CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 800 

AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 850 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 933 



30 



45 



2) INFORMATION FOR SEQ ID NO: 849 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 9 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

50 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

55 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 4 00 

GATCAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

60 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

475 
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TGTACGTGCT GACGAAGGAT GAGGGCGGCC 

CACTACCGCC CGCAGCTCTT CTTCCATTGC 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG 

5 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT 

GCTGGCGTAC CCGATGCCCA TTGAAAAGGG 



PCT/CAOO/01150 

GCCACACTGG CTTCA&CCCC 1(5 (/ 

GCTGACGTGA CAGCGGACAT 750 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAGGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTTC ACC 943 



10 2) INFORMATION FOR SEQ ID NO: 850 



~~ (T}~ SEQUENCE-CHARACTER-ISTTCS : 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 1 Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 



GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 200 

30 AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 250 

AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 300 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 400 

AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCGCC 450 

35 AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 700 

40 ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

GCGTACCCGA TGCCCATT , 918 

45 



2) INFORMATION FOR SEQ ID NO: 851 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
60 (B) STRAIN: ATCC 50156 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 
GCGAGCATCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 
TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 
CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 
CGCCCATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 
TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 
CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 
TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 
_ GATCAGGGCG _ TGCTGAAGCT-eAAeAeAGAe-GGGGAGGTGG- CGGGGT-TCAG 
CGTCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 
CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 
ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 
AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 
TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 
CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 
GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 
GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 
GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 
GCTGGCGTAC CCGATGCCCA TTGAGAAGGG TCTGAAGTT 



50 
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2) INFORMATION FOR SEQ ID NO: 852 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 200 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 250 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 300 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 350 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 400 

TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 450 

AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 500 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 650 

CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 800 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 850 

GCTGGTCTGC ATGCCAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

ACCCGATGCC TA 912 
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2) INFORMATION FOR SEQ ID NO: 853 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 



40 



55 



(i i)~MOLECULE~TYPET"Genom±c — DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 

20 ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 2 00 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 250 

25 ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 3 00 

GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 350 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 4 00 

ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 4 50 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT ACCATAAAAT 500 

30 TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 600 

CTCTCCGCAC CAGGTGCAAC GACACTGTTC AACAAGGTCC GCGCGCAGGT 650 

GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 700 

ACTATCGTCC GCAGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 750 

35 AACTTCCCGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 800 

CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 850 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 900 

CTTGCCTATC CAATGCCAAT GGAAAAGGGA CTCAAG 936 



2) INFORMATION FOR SEQ ID NO: 854 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 



CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 

AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 

60 ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 150 
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TGAAATGGAG GTACGTGAAC TTTTGGAGAA GTACAAGTTC CCTGCGG&GG 200 

AGACGCCATT TGTGCGGGGG TCTGCGGTGA AGGCATTGGA GGGTGATGCT 250 

GAAAATGAAG GAAAGATTTT GGAGCTTGTA AAAAAATGTG ATGAATGGAT 300 

TCCCGACCCA CCGCGTGCCA TTGAAAAACC GTTCCTTATG GCCATTGAGC 350 

5 ACGTTTTTGA GGTTGGAAAG GATAAGAAGG CCGTTGTTGT GAGCGGGCGT 400 

GTGGACCAGG GGCAGTTGAA GGTCGGCGCA GATGCAGAAC TTTCCGGGTT 450 

TAGTGCAAAG AAGCTGACGG TGAAGGTTGC TAGCATCGAA ATGTACCATA 500 

AAATTCTGGA GGATTGCATG CCTGGTGACT CTATTGGCGC GAAGATCGTT 550 

GGCAGCGGTG AAACAGTGAA CCTGTCGAAG GAAAATGTGG AACGCGGCAT 600 

10 GGTACTCTCC GCACCAGGTG CAACGACACT GTTCAACAGG GTCCGCGCGC 650 

AGGTGTACGT GTTGACAAAG GAAGAAGGCG GTCGTCACAC AGCCTTTAGT 700 



CCTCACTATC GTCCGCAGCT TTTCTTCCAC TGTGCTGATG TCACGGCGGA 750" 

TATTAACTTC CCGGAAAGCG AGAAGCTTGC AGGGGAGCTG AACAAAAAGT 800 

ATGGCCGTGA CGCGGCGGAA CAGAAGAAGA AGGAGGCAGA ACTGAAAGAG 850 

15 TTTGAAAAGA CGCTTGTCTG CATGCCTGGT GATAACCGCG AACTCCTGCT 900 

CACCGTTGCC TATCCAATGC CAATGGAAAA GGGA 934 



20 2) INFORMATION FOR SEQ ID NO: 855 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 7 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855 

35 

TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 200 

40 AGGAGACGCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 300 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 4 00 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

45 GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGCCTTT 700 

50 AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 800 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 900 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 937 

55 



2) INFORMATION FOR SEQ ID NO: 856 

60 (i) SEQUENCE CHARACTERISTICS: 

479 
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WO 01/23604 PCT/CA00/01150 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 
10 (B) STRAIN: Suarez-2 



(xi) SEQUENCE'DES CRT PTION— SEQ-ID-NO:- 856" 



CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

15 CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

ACGAGATGAT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 200 

CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 300 

20 AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 400 

GCGCGCTGCA GGAGCGTATC ACGACGACCA ACAAGGGCTC CATCACCTCC 450 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

25 CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 700 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 750 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

30 ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 888 



35 2) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 



TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 200 

55 GAACCACCAG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 300 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 350 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

60 TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 
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ACCACTTTCT CGCACTTGGA 
GGAGTTGGGT ATTTACCCTG 
TGCTGTCGGC CAACATTGTA 
GTACAGAAAA TTTTACAGGA 
TCTGGGTATG GACGAGCTGT 
CCCGTAAGGT ACAGCGTTTC 
TTCACCGGCA AGCCTGGGAG 
TGTCAAGGAA ATCTGGAAGG 



TGCCACTACA GTGCTTTCTC GTTCAATTGC 
CGGTCGACCC GCTTGACTCA ACGTCACGTA 
GGACAGGAAC AGTACGATGC CGCACGTGGT 
CTACAAATCA CTGCAGGATA TCATTGCCAT 
CTGAGCAGGA CAAGTTCGTT GTAGCACGCG 
CTGTCTCAGC CGTTCCAAGT GGCTGAGGTG 
GTTCGTTGAA CTACAGGATA CCATCAGCGG 
TGAGTGTGAC GATA 
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"2TINFORHATI ON" FOR-SEQ- ID- NO: ~8 5 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double ? 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 5 0 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 10 0 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 200 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 3 00 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 350 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 4 00 

ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 4 50 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACACTATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 700 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



2) INFORMATION FOR SEQ ID NO: 859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

5 TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 200 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 250 

10 CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 300 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

"~TGCAAGGGCG~GCAAGATCGG""CCTGTTCGGC~GGTGCCGGTG-TGGGeAAGAG 400- 

TGTGATCATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 4 50 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

15 TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 

GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

20 ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 900 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 950 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

25 CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAGC 1100 

GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT 1150 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 1200 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 1250 

30 TCGTA 1255 



2) INFORMATION FOR SEQ ID NO: 860 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860 

50 CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 50 

TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 2 00 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 250 

55 GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 300 

GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 350 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 4 00 

GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 450 

CGTCGCGAAG GGCCACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

60 GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

482 
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ATTGACCTGA AGGGCGAGTC GAAGTGCGTG CTTGTGTACG GGCAGAT&AA " 600 

CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC GCAGTCTGCG CTGACGATGG 650 

CGGAGTACTT CCGAGACGTG GAGGGCCAGA ATGTGCTGCT GTTCATCGAC 700 

AACATCTTCC GCTTCACGCA GGCGAACTCC GAGGTCTCTG CGCTGCTGGG 750 

5 CCGCATTCCG GCCGCCGTGG GCTACCAGCC GACGCTTGCG GAGGATCTTG 800 

GTATGCTGCA GGAGCGCATC ACGTCGACGA CGAAGGGGTC GATCACGTCC 850 

GTGCAGGCCG TGTACGTGCC TGCGGATGAT ATCACGGATC CGGCGCCCGC 900 

GACGACGTTC TCGCACCTGG ACGCGACGAC TGTGCTGGAC CGCGCGGTGG 950 

CGGAGTCGGG GATCTACCCT GCCGTGAACC CGCTGGAGTG CGCGTCGCGT 1000 

10 ATCATGGACC CCGATGTGAT CGACGTGGAC CACTACAACG TTGCGCAGGA 1050 

TATCGTGCAG ATGCTGACCA AGTACAAGGA GCTGCAGGAT ATCATTGCGG 1100 

— TGCTTGGTAT^CGACGAGCTG~~AGCGAGGAGG-"AGAAGGTCGT~GGTGGACCGC 1 -1-5-0- 

GCGCGCAAGG TGACCCGGTT CCTGTCGCAG CCGTTCCAGG TTGCGGAGGT 1200 

GTTCACGGGC ATGACGGGCC AC 1222 

15 



2) INFORMATION FOR SEQ ID NO: 861 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1246 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
30 (B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861 

CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

35 CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACCG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 200 

TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 300 

4 0 CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 350 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 4 00 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 450 

GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

45 GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 

GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 

GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 800 

50 ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 

GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 

TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 1000 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

55 CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 1200 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 1246 
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2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 

GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 200 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 250 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 300 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 350 

CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 400 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 450 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 1200 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG 1265 



2) INFORMATION FOR SEQ ID NO: 863 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

484 
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ATGGCAAAAG GCAAGTTTGA ACGTACCAAG CCGCACGTGA ACGTGGGTAC 50 

GATTGGTCAC GTTGACCACG GCAAAACGAC GTTGACGGCG GCGATCACGA 100 

CGGTGCTGTC GAACAAGTTC GGCGGCGAGG CTCGCGGCTA CGACCAGATT 150 

5 GACGCGGCGC CGGAAGAGAA GGCGCGTGGG ATCACGATCA ACACCTCGCA 200 

CGTTGAGTAC GAGACGGAGA CGCGTCACTA CGCGCACGTT GATTGCCCGG 250 

GTCACGCTGA CTACGTGAAG AACATGATCA CGGGTGCTGC GCAGATGGAC 300 

GGCGCGATCC TGGTGGTGTC GGCCGCAGAC GGCCCGATGC CGCAGACGCG 350 

CGAGCACATT TTGCTGTCGC GCCAGGTTGG CGTGCCGTAC ATCATCGTGT 400 

10 TCCTGAACAA GGCGGACATG GTTGATGACG CGGAGCTGCT CGAGCTGGTG 450 

GAGATGGAAG TCCGCGAACT GCTGAGCAAG TACGATTTCC CGGGCGATGA 500 

CACGC'CGATC-GTGAATGGTG- CGGCCAAGGT-GGGGCTGGAA— AGGGAGAAGG 550 

GCGACCTGGG CGAGCAGGCG ATTCTGTCGC TGGCGCAAGC GCTGGACACG 600 

TACATTCCGA CGCCGGAGCG CGCGGTCGAC GGTGCGTTCC TGATGCCGGT 650 

15 GGAAGACGTG TTCTCGATCT CGGGCCGTGG CACGGTGGTG ACTGGCCGTA 700 

TCGAGCGCGG CGTGGTGAAG GTTGGCGAGG AAATCGAAAT CGTGGGCATC 750 

AAGCCGACGG TGAAGACGAC CTGCACGGGC GTGGAGATGT TCCGCAAGCT 800 

GCTGGACCAG GGCCAGGCGG GCGACAACGT GGGTATCTTG CTGCGCGGCA 850 

CCAAGCGTGA AGACGTCGAG CGTGGCCAGG TGCTGGCCAA GCCGGGTTCG 900 

20 ATCAACCCGC ACACGGACTT CACGGCCGAG GTGTACATTC TGTCCAAGGA 950 

AGAGGGTGGC CGTCACACGC CGTTCTTCAA CGGCTATCGT CCGCAGTTCT 1000 

ACTTCCGCAC GACGGACGTG ACCGGCACGA TCGACCTGCC GGCGGACAAG 1050 

GAAATGGTGC TGCCGGGCGA CAACGTGTCG ATGACCGTCA AGCTGCTGGC 1100 

CCCGATCGCC ATGGAAGAAG GTCTGCGTTT CGCCATCCGT GAAGGCGGTC 1150 

25 GTACCGTCGG TGCCGGCGTC GTCGCCAAGA TCATCAAGTA A 1191 



2) INFORMATION FOR SEQ ID NO: 864 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: LVH/75/USAMRU-K/18 

(C) ACCESSION NUMBER: extracted from U10562 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 200 

50 CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 300 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

55 TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 700 

60 TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

485 
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CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGYGAC 800/ 

GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 
ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG CTTTAACGTG 900 
AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

5 CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGATCA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 1050 

TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

CGACCGTCGC TCTGGCAAGG AGCTGGAGAA GGCTCCCAAG TCGATCAAGT 1150 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 1200 

10 GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 1300 

GTTCTGGTGG-TAAGGTGAeG-AAGGeTGeGG-TGAAGGGT-TG-GAAGAAATAA 1-350- 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 865 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1052 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG CTGAGAGAGA AAGAGGTATT ACCATCGATA TTGCTTTATG 50 

GCAATTCGAA ACCCCAAAAT ACCACTACAC TGTCATTGAT GCCCCAGGTC 100 

ACAGAGATTT CATCAAGAAT ATGATTACTG GTACCTCTCA AGCTGATGTT 150 

GCTTTATTGG TTGTCCCAGC CGATCGTTTC GAAGGTGCCT TCTCCAAGGA 2 00 

35 AGGTCAAACC AGAGAACATG CTTTATTGGC CTTCACTTTG GGTGTCAGAC 250 

AAATGATTGT CGGTATTAAC AAGATGGATA CCTGTGAATA CAAGCAATCT 300 

CGTTTTGATG AAATCTTCAA CGAAGTTGAT GGTTACCTCA AGAAGGTTGG 350 

TTACAACACC GAGAAGATCC CATTCGTTGC CATTTCTGGT TTCGTTGGTG 400 

ATAATATGGT TGAGAGATCT GACAAGATGC CATGGTATAA GGGTAAGACC 450 

40 TTAGTCGAAG CCCTCGACAC TATGGAACCA CCAAAGAGAC CAACTGACAA 500 

GCCACTCCGT CTCCCATTAC AAGATGTTTA CAAGATAGGT GGTGTAGGTA 550 

CTGTCCCAGT CGGTCGTGTT GAGACTGGTA TCATCAGACC AGGTATGAAT 60 0 

GTTACCTTCG CTCCAGCTGG TGTTACCACT GAAGTTAAGT CAGTAGAAAT 650 

GCACCATGAG CAGATGCCAG AGGCCGTCCC AGGTGACAAC GTTGGTTTCA 700 

45 ATGTTAAGAA CGTCTCCATC AAGGATATCA AGAGAGGTTT CGTTGCTTCT 750 

GATGCCAAGA ATGACCCAGC TAAGGGCTGT GAAGACTTCA CTGCTCAAGT 800 

TATCGTCCTC AACCACCCAG GTGAAATCAA GAACGGTTAC TCTCCAGTCG 850 

TTGACTGTCA CACCGCTCAC ATTTCCTGCA AATTCCAGAC TATCACTGCT 900 

AAGATGGACA AGAGATCTGG TAAGGTTTTG GAAGAAAACC CAAAGCTTAT 950 

50 CAAGTCTGGT GATGCTGCTT TGGTTGTTAT GCAACCTTTG AAGCCACTTT 1000 

GTGTTGAGGC CTTCACTGAC TACCCACCTC TAGGTCGTTT CGCTGTCCGT 1050 

GA 1052 



55 

2) INFORMATION FOR SEQ ID NO: 866 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 837 bases 
60 (B) TYPE: Nucleic acid 

486 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(B) STRAIN: ATCC 35552 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 



5 



CAATGAAGTT" CCAGAAATTA ACAATGCCTT- AGTCGTAGAC- GT-TGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

15 TACAGAAGTT CGAGATAGCG GAGATAGCAT- CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

20 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 4 50 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

25 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 
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2) INFORMATION FOR SEQ ID NO: 867 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867 



AAGGTATTCG ATGCCTTGAA AATGGAAGGC TCCGAGCTGA CCCTGGAAGT 50 

ACAACAGCAG CTGGGCGACG GCATTGTCCG TACCATTGCA CTGGGTACCT 100 

50 CCGACGGCCT GCGTCGCGGC ATGATGATCC AGAACACCGG CAAACCTATC 150 

ATGGTGCCAG TCGGTAAAGC AACCCTGGGT CGCATCATGG ACGTGCTGGG 200 

TAACCCGATC GACGAATGCG GCGCGGTCGC TCACGACCAG ATCGCTTCGA 250 

TCCACCGCGC TCCTCCTGCG TACGACGAAC TGTCGCCATC GCAAGATCTG 3 00 

CTGGAAACCG GCATTAAAGT TATTGACCTG GTGTGCCCGT TCGCCAAGGG 350 

55 CGGTAAAGTC GGTCTGTTCG GCGGTGCAGG TGTGGGCAAG ACCGTGAACA 400 

TGATGGAACT GATCAACAAC ATCGCCAAAG CACACTCGGG TCTGTCCGTG 450 

TTTGCCGGTG TGGGTGAGCG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

GATGGCTGAC GCCAAAGTGG TCGATCTGGA AAATCCAGAG AACTCCAAGG 550 

TTGCGATGGT CTACGGTCAG ATGAATGAAC CACCAGGCAA CCGTCTGCGC 600 

60 GTGGCGCTGA CCGGTCTGAC CATGGCTGAA GCATTCCGTG ACGAAGGCAA 650 

487 
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AGACGTTCTG TTCTTCGTGG 

CCGAAGTATC GGCACTGCTG 

CCTACGCTGG CCGAAGAAAT 

CAAGACCGGT TCGATCAC 



ACAACATCTA CCTgCTTCACC 
GGCCGTATGC CATCGGCTGT 
GGGTCGCCTG CAAGAGCGCA 
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CTGGCCGCSST ^00 w 
GGGTTACCAG 750 
TCACTTCGAC 800 
818 



15 



50 



2) INFORMATION FOR SEQ ID NO: 868 

10 (i) SEQUENCE CHARACTERISTICS: 

_JA) LENGTH: 778 bases 

~TB) T YPE7~Nuc leits - SRS 1" d 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 
20 (B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868 

CTATCTTAGT AGTATCTGCT GCTGATGGCC CAATGCCACA AACTCGTGAA 50 

25 CACATTCTTT TATCACGTAA CGTTGGTGTT CCAGCATTAG TTGTATTCTT 100 

AAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTAGAAA 150 

TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG TGACGATGTA 200 

CCTGTAATCT CTGGTTCTGC ATTAAAAGCT TTAGAAGGCG ACGCTGACTA 250 

TGAGCAAAAA ATCTTAGACT TAATGCAAGC TGTTGATGAC TTCATTCCAA 300 

30 CACCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG TTGAACGTGG 4 00 

TCAAATCAAA GTCGGTGAAG AAATCGAAAT CATCGGTATG CAAGAAGAAT 450 

CAAGCAAAAC AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC 500 

TACGCTGAAG CTGGTGACAA CATTGGTGCA TTATTACGTG GTGTTTCACG 550 

3 5 TGATGACGTA CAACGTGGTC AAGTTTTAGC TGCTCCTGGT ACTATTACAC 600 

CACATACAAA ATTCAAAGCG GATGTTTACG TTTTATCTAA AGATGAAGGT 650 

GGTCGTCATA CACCATTCTT CACTAACTAC CGCCCACAAT TCTATTTCCG 700 

TACTACTGAC GTAACTGGTG TTGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TTATGCCTGG CGATAACGTT GAAATGGA 778 

40 



2) INFORMATION FOR SEQ ID NO: 869 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
55 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 
60 GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC 100 
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GAAGAATTAC TAGAATTAGT TGAAATGGAA 
ATATGACTTC CCAGGCGACG ATGTTCCTGT 
AAGCTCTTGA AGGCGATGCT TCATACGAAG 
GCTGCAGTTG ACGAATACGT TCCAACTCCA 
5 ATTCATGATG CCAGTCGAAG ACGTATTCTC 
TTGCTACAGG CCGTGTTGAA CGTGGACAAG 
GAAATCGTTG GTATTGCTGA AGAAACTGCT 
TGAAATGTTC CGTAAATTGT TAGACTATGC 
GTGCATTGCT ACGTGGTGTT GCTCGTGAAG 
10 TTGGCTAAAG CTGGTACAAT CACACCTCAT 
_ __ _T_TACGTT_TTA ACAAAAG AAG AAGGTGGA CG 



PCT/CA00/01150 



GTTCGTGACT 


TATTGT&AGA 


150 


AATCGCTGGT 


TCTGCTTTGA 


200 


AAAAAATCAT 


GGAATTAATG 


250 


GAACGTGACA 


CTGACAAACC 


300 


AATCACTGGA 


CGTGGTACTG 


350 


TTCGCGTTGG 


TGACGAAGTT 


400 


AAAACAACTG 


TAACTGGTGT 


450 


TGAAGCAGGG 


GATAACATTG 


500 


ACATCCAACG 


TGGACAAGTA 


550 


ACAAAATTTA 


AAGCTGAAGT 


600 


TCACACTCCA 




640 



15 2) INFORMATION FOR SEQ ID NO: 870' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 



GTCCTATGCC TCAAACACGT GAACACATCT TGTTATCACG TAACGTTGGT 50 

GTACCATACA TCGTTGTTTT CTTAAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTACTA GAATTAGTTG AAATGGAAGT TCGTGACTTA TTGTCAGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCTGGTTC TGCTTTGAAA 2 00 

35 GCTCTTGAAG GCGATGCTTC ATACGAAGAA AAAATCATGG AATTAATGGC 2 50 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGACACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ACGAAGTTGA 4 00 

AATCGTTGGT ATTGCTGAAG AAACTGCTAA AACAACTGTA ACTGGTGTTG 4 50 

40 AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGTGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTTAAA GCTGAAGTTT 600 

ACGTTTTAAC AAAAGAAGAA GGTGGACGTC ACACACCATT CTTC 644 



2) INFORMATION FOR SEQ ID NO: 871 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 637 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R758 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871 

TCCTATGCCT CAAACACGTG AACACATCTT GTTATCACGT AACGTTGGTG 50 

TACCATACAT CGTTGTTTTC TTAAACAAAA TGGATATGGT TGATGACGAA 100 

5 GAATTACTAG AATTAGTTGA AATGGAAGTT CGTGACTTAT TGTCAGAATA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCTGGTTCT GCTTTGAAAG 200 

CTCTTGAAGG CGATGCTTCA TACGAAGAAA AAATCATGGA ATTAATGGCT 250 

GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGACACTG ACAAACCATT 300 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

10 CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA CGAAGTTGAA 400 

ATCGTTGGTA TTGCTGAAGA AACTGCTAAA ACAACTGTAA CTGGTGTTGA 450 

AATGTTCCGT~AAATTGTTAG-ACTATGCTGA- ftGGAGGGGA-T-AAGAT-TGGTG— — —500- 

CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTA 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTTAAAG CTGAAGTTTA 600 

15 CGTTTTAACA AAAGAAGAAG GTGGACGTCA CACTCCA 637 



2 ) INFORMATION FOR SEQ ID NO: 872 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

35 GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

4 0 TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 4 50 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

4 5 GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 643 



50 



60 



2) INFORMATION FOR SEQ ID NO: 873 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

490 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 



GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 
GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 
AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 
ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 
GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 
TGCAGTTGAC - GAATACGTTC~CAACTCeAGA— AGGTGATAG-T— GAGAAAGGAT- 
TCATGATGCC AGTCGAAGAC GTATT.CTCAA TCACTGGACG TGGTACTGTT 
GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 
AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 
AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 
GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 
GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 
ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT C 



50 
100 
150 
200 
250 
-300. 
350 
400 
450 
500 
550 
600 
641 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 874 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTT ATGCCTGGCG A 681 



2 ) INFORMATION FOR SEQ ID NO: 875 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 bases 

(B) TYPE: Nucleic acid 
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( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: R591 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 



5 



ATTATCACGT~AACGTTGGTG-TACCAGCATT-AGTTGTATTC— TTAAACAAAG— — — 50- 

TTGACATGGT AGACGACGAA GAATTATTAG AATTAGTTGA AATGGAAGTT 100 

CGTGACTTAT TAAGCGAATA TGACTTCCCA GGTGACGATG TACCTGTAAT 150 

15 CGCTGGTTCT GCATTAAAAG CATTAGAAGG CGATGCTGAA TACGAACAAA 200 

AAATCTTAGA CTTAATGCAA GCAGTTGATG ATTACATTCC AACTCCAGAA 250 

CGTGATTCTG ACAAACCATT CATGATGCCA GTTGAGGACG TATTCTCAAT 300 

CACTGGTCGT GGTACTGTTG CTACAGGCCG TGTTGAACGT GGTCAAATCA 350 

AAGTTGGTGA AGAAGTTGAA ATCATCGGTA TGCACGAAAC TTCTAAAACA 400 

20 ACTGTTACTG GTGTAGAAAT GTTCCGTAAA TTATTAGACT ACGCTGAAGC 4 50 

TGGTGACAAC ATCGGTGCTT TATTACGTGG TGTTGCACGT GAAGACGTAC 500 

AACGTGGTCA AGTATTAGCT GCTCCTGGTT CTATTACACC ACACACAAAA 550 

TTCAAAGCTG AAGTATACGT ATTATCTAAA GATGAAGGTG GACGTCACAC 600 

TCCATTCTTC ACTAACTATC GCCCACAATT CTATTTCCGT ACTACTGACG 650 

25 TAACTGGTGT TGTAAACTTA CCAGA 675 



2) INFORMATION FOR SEQ ID NO: 876 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 

45 TCTTATTATC ACGTAACGTT GGTGTACCAG CATTAGTTGT ATTCTTAAAC 50 

AAAGTTGACA TGGTAGACGA CGAAGAATTA TTAGAATTAG TTGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

TAATCGCTGG TTCTGCATTA AAAGCATTAG AAGGCGATGC TGAATACGAA 200 

CAAAAAATCT TAGACTTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 250 

50 AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTATGCACG AAACTTCTAA 400 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATCGGT GCTTTATTAC GTGGTGTTGC ACGTGAAGAC 500 

55 GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCTATTA CACCACACAC 550 

AAAATTCAAA GCTGAAGTAT ACGTATTATC TAAAGATGAA GGTGGACGTC 600 

ACACTCCATT CTTCACTAAC TATCGCCCAC AATTCTATTT CCGTACTACT 650 

GACGTAACTG GTGTTGTAAA CTTACCAGAA GGTACAGAAA TGGTTATGCC 700 

TGGC 704 



60 
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2) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 
15 (B) STRAIN: ATCC 35984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877 

TCTTAGTTGT ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC 50 

20 ATCTTATTAT CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA 100 

CAAAGTTGAC ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTTCGTGA CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT 200 

GTAATCGCTG GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA 250 

ACAAAAAATC TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC 300 

25 CAGAACGTGA TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC 350 

TCAATCACTG GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA 400 

AATCAAAGTT GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA 450 

AAACAACTGT TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT 500 

GAAGCTGGTG ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA 550 

30 CGTACAACGT GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA 600 

CAAAATTCAA AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT 650 

CACACTCCAT TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC 700 

TGACGTAACT GGTGTTGTAA ACTTACCAGA AGGTACAGAA ATGGTTATGC 750 

CTGGCGACAA CGTTGAAATG 770 

35 



2) INFORMATION FOR SEQ ID NO: 878 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 
50 (B) STRAIN: ATCC 35983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA ATTATTAGAA 50 

55 TTAGTTGAAA TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG 100 

TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA TTAGAAGGCG 150 

ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC AGTTGATGAT 200 

TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT 250 

TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG 300 

60 TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT CATCGGTATG 350 
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CACGAAACTT CTAAAACAAC TGTTACTGGT 
ATTAGACTAC GCTGAAGCTG GTGACAACAT 
TTGCACGTGA AGACGTACAA CGTGGTCAAG 
ATTACACCAC ACACAAAATT CAAAGCTGAA 
TGAAGGTGGA CGTCACACTC CATTCTTCAC 
ATTTCCGTAC TACTGACGTA ACTGGTGTTG 
GAAATGGTTA TGCCTGGCGA CAACGTTGAA 
TCCAATCGCT ATCGAA 



PCT/CA00/01150 

GTAGAAATGT TCCGTAAATT 400 

CGGTGCTTTA TTACGTGGTG 450 

TATTAGCTGC TCCTGGTTCT 500 

GTATACGTAT TATCTAAAGA 550 

TAACTATCGC CCACAATTCT 600 

TAAACTTACC AGAAGGTACA 650 

ATGACAGTTG AATTAATCGC 700 

716 



2) INFORMATION FOR SEQ ID NO: 879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:' Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 300 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 64 0 



2) INFORMATION FOR SEQ ID NO: 880 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

CGGCGCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 50 

494 
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GCGAGCACAT CCTGCTGTCC CGCCAGGTAG GCGTTCCCTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TGGAACTGGT 150 

CGAGATGGAA GTTCGCGATC TGCTGAACAC CTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CATCATCGGT TCCGCGCTGA TGGCGCTGGA AGGCAAGGAT 250 

5 GACAACGGCA TCGGCGTAAG CGCCGTGCAG AAGCTGGTAG AGACCCTGGA 300 

CTCCTACATT CCGGAGCCGG TTCGTGCCAT CGACCAGCCG TTCCTGATGC 350 

CGATCGAAGA CGTGTTCTCG ATCTCCGGCC GCGGTACCGT GGTAACCGGT 4 00 

CGTGTAGAGC GCGGCATCAT CAAGGTCCAG GAAGAAGTGG AAATCGTCGG 450 

CATCAAGGCG ACCACCAAGA CTACCTGCAC CGGCGTTGAA ATGTTCCGCA 500 

10 AGCTGCTCGA CGAAGGTCGT GCTGGTGAGA ACGTTGGTAT CCTGCTGCGT 550 

GGCACCAAGC GTGAAGACGT AGAGCGTGGC CAGGTTCTGG CCAAGCCGGG 600 

CACCATCAAG CCGCACACCA AGTTCGAGTG CGAAGTGTAC GTGCTGTCCA 6S0 

AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TCAAGGGCTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA YGTGACCGGT AACTGCGAAC TGCCGGAAGG 750 

15 CGTAGAGATG GTAATGCCGG GCGACAACAT. CAAGATGGTT GTCACCCTGA 800 

TCGCTCCGAT CGCCATGGAA GATGGCCTGC G 831 



20 2) INFORMATION FOR SEQ ID NO: 881 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R421 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 

35 

CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

40 TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

45 ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 

50 



2) INFORMATION FOR SEQ ID NO: 882 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

495 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882 



CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

10 AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

TCTTGAAGGC "GATGCTTCAT-ACGAAGAAAA-AA-TeA-TGGAA— T-TAATGGG-TG 250- 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

15 TACAGGCCGT GTTGAACGTG GACAAGTTCG- CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

20 GTTTTAACAA AAGAAGAAGG TGGACGTCAT ACACCA 636 



2) INFORMATION FOR SEQ ID NO: 883 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R422 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

40 GGTCCTATGC CTCAAACACG TGAACATATC 
TGTACCATAC ATCGTTGTAT TCTTAAACAA 
AAGAATTATT AGAATTAGTA GAAATGGAAG 
TACGATTTCC CAGGCGATGA TGTTCCAGTT 
AGCTTTAGAA GGCGACGAGT CTTATGAAGA 

45 CTGCAGTTGA CGAATATATC CCAACTCCAG 
TTCATGATGC CAGTCGAAGA CGTATTCTCA 
TGCTACAGGC CGTGTTGAAC GTGGTGAAGT 
AAATCGTTGG TATTAAAGAC GAAACATCTA 
GAAATGTTCC GTAAATTATT AGACTACGCT 

50 TGCTTTATTA CGTGGTGTAG CACGTGAAGA 
TAGCTAAACC AGCTACAATC ACTCCACACA 
TACGTATTAT CAAAAGAAGA AGGCGGACGT 



ID NO: 883 






TTATTATCAC 


GTAACGTTGG 


50 


AATGGATATG 


GTTGATGACG 


100 


TTCGTGACTT 


ATTATCAGAA 


150 


ATCGCAGGTT 


CTGCTTTGAA 


200 


AAAAATCTTA 


GAATTAATGG 


250 


AACGTGATAC 


TGACAAACCA 


300 


ATCACTGGAC 


GTGGTACTGT 


350 


TCGCGTTGGT 


GACGAAGTTG 


400 


AAACAACTGT 


TACAGGTGTT 


450 


GAAGCAGGCG 


ACAACATCGG 


500 


TATCGAACGT 


GGACAAGTAT 


550 


CAAAATTCAA 


AGCTGAAGTA 


600 


CACACTCCAT 


T 


641 



55 

2) INFORMATION FOR SEQ ID NO: 884 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 640 bases 
60 (B) TYPE: Nucleic acid 

496 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R575 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 



5 



~TATGCCTCAA"A"CACGTGAAC "ATATCTTATT ~ATCACGTAAC-GTTGGTGTAe 5 0- 

CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAT CAGAATACGA 150 

15 TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 200 

TAGAAGGCGA CGAGTCTTAT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 2 50 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 300 

GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGT GAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

20 GTTGGTATTA AAGACGAAAC ATCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 500 

TATTACGTGG TGTTGCACGT GAAGATATCG AACGTGGACA AGTATTAGCT 550 

AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 600 

ATTATCAAAA GAAGAAGGCG GACGTCACAC TCCATTCTTC 64 0. 

25 



2) INFORMATION FOR SEQ ID NO: 885 

30. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885 

TGCCTCAAAC TCGTGAACAC ATCCTATTGT CTCGTCAAGT TGGTGTTCCT 50 

45 TACATCGTTG TATTCTTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 10 0 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA CCTATTAACA GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 20 0 

GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 250 

TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATTCATGA 30 0 

50 TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 350 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 400 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 450 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 50 0 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

55 ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 600 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



60 2) INFORMATION FOR SEQ ID NO: 886 

497 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
(B) STRATNT-R5-76 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886 



CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 150 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 200 

20 AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 250 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 300 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATTACTGGA CGTGGTACTG 350 

TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 400 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 450 

25 TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCTGGA GACAACATTG 500 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGCTAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

ATACGTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 640 



2) INFORMATION FOR SEQ ID NO: 887 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887 



ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

CATCCTGCTG GCCCGCCAAG TTGGCGTTCC ATACATCATC GTGTTCCTGA 100 

50 ACAAGTGCGA CCTGGTTGAC GACGCAGAAC TGCTGGAACT GGTCGAAATG 150 

GAAGTGCGTG AATTGCTGTC GAAATACGAG TTCCCAGGCG ACGACGTACC 200 

AATCATCAAG GGTTCGGCAC GTATGGCGCT GGAAGGCAAA GAAGGCGAGA 250 

TGGGCGTTGA CGCCATCATG CGTCTGGCCG ATGCACTGGA CAGCTACATC 300 

CCTACGCCAG AGCGCGCAGT CGATGGCGCC TTCCTGATGC CAGTGGAAGA 350 

.55 CGTGTTCTCG ATCTCGGGTC GCGGTACCGT TGTGACCGGT CGTATCGAGC 400 

GCGGCGTGAT CAAGGTCGGC GAAGAGATCG AAATCGTCGG CATTATCGAC 450 

ACCGTCAAAA CCACTTGCAC CGGCGTGGAA ATGTTCCGCA AGCTGCTGGA 500 

CCAGGGTCAA GCCGGCGACA ACGTTGGTCT GCTGCTGCGC GGCACCAAGC 550 

GTGAAGACGT ACAGCGTGGT CAGGTTCTGG CCAAGCCAGC GTCGATCAAG 600 

60 CCGCACAACC ACTTCACCGG CGAGATCTAC GTTCTGTCGA AAGATGAAGG 650 

498 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGGCCGTCAC ACCCCGTTCT 
GTACGACTGA CGTGACCGGT 
GTCATGCCAG GCGACAACGT 
CGCGAT 



TCAACAACTA TCGTCCACAG 
TCGATCGAAC TGCCAGCAGA 
GTCGATCACC GTCAAGCTGA 



TTCTACTTCC " 7 OCT 
CAAAGAAATG 750 
TCAACCCGAT 800 
806 



2 ) INFORMATION FOR SEQ ID NO: 888 

10 (i) SEQUENCE CHARACTERISTICS: 
(A ) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
20 (B) STRAIN: R503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 



TATGCCTCAA 
25 CATACATCGT 
TTATTAGAAT 
TTTCCCAGGC 
TAGAAGGCGA 
GTTGACGAAT 
30 GATGCCAGTC 
CAGGCCGTGT 
GTTGGTATTA 
GTTCCGTAAA 
TATTACGTGG 
35 AAACCAGCTA 
ATTATCAAAA 



ACACGTGAAC 
TGTATTCTTA 
TAGTAGAAAT 
GATGATGTTC 
CGAGTCTTAT 
ATATCCCAAC 
GAAGACGTAT 
TGAACGTGGT 
AAGACGAAAC 
TTATTAGACT 
TGTAGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ATATCTTATT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
GAAGTTCGCG 
ATCTAAAACA 
ACGCTGAAGC 
GAAGATATCG 
ACACACAAAA 
GACGTCACAC 



ATCACGTAAC 
ATATGGTTGA 
GACTTATTAT 
AGGTTCTGCT 
TCTTAGAATT 
GATACTGACA 
TGGACGTGGT 
TTGGTGACGA 
ACTGTTACAG 
AGGCGACAAC 
AACGTGGACA 
TTCAAAGCTG 
TCCA 



GTTGGTGTAC 50 

TGACGAAGAA 100 

CAGAATACGA 150 

TTGAAAGCTT 200 

AATGGCTGCA 250 

AACCATTCAT 300 

ACTGTTGCTA 350 

AGTTGAAATC 400 

GTGTTGAAAT 450 

ATCGGTGCTT 500 

AGTATTAGCT 550 

AAGTATACGT 600 
634 



40 2) INFORMATION FOR SEQ ID NO: 889 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889 

55 

TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 
TTCGCTCGAT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 
GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 150 
GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 200 
60 AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 250 

499 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGTGCCCGTG CCCGTGTCGC CCTTACCGGT 
CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 
GTTTCACCCA GGCCGGTTCT GAGGTGTCTG 
TCTGCCGTCG GTTACCAGCC CACCCTGGCC 
5 GGAGCGTATC ACCACCACCA AGAAGGGTTC 



PCT/CAOO/01150 

CTGACCATTG CCGAGTACTT 3 00 

CTTCATTGAC AACATTTTCC 350 

CCCTTCTCGG TCGTATCCCC 400 

GTCGACATGG GTGGTATGCA 4 50 

TATTACCTCC GTC 493 



2) INFORMATION FOR SEQ ID NO: 890 

10 

(i) SEQUENCE CHARACTERISTICS: 
(A) — tENGTHr -466-bases — 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890 



25 TTGATTGTAC 
AAGCGTAGAA 
GGTGTCGGTG 
GGAAACTGGT 
TCGGTCAGAT 

30 GGTTTGACCA 
TCTCTTCATT 
CTGCCCTTCT 
GCCGTCGACA 
TTCCATCACC 

35 



GTCTTTACCT 
CAACATTGCC 
AACGTACTCG 
GTCATTCAGC 
GAACGAGCCC 
TTGCCGAGTA 
GACAACATTT 
GGGTCGTATC 
TGGGTATCAT 
TCCGTC 



TTCTGCCTGA 
AAGGCTCACG 
TGAGGGTAAC 
TCGAGGGTGA 
CCCGGTGCCC 
CTTCCGTGAC 
TCCGTTTCAC 
CCCTCTGCCG 
GCAGGAGCGT 



CTGTTTACGA 
GTGGTTACTC 
GATTTGTACC 
ATCCAAGGTC 
GTGCCCGTGT 
GAGGAAGGTC 
TCAGGCCGGT 
TCGGTTACCA 
ATTACCACCA 



CAACTAACGA 50 

TGTCTTCACT 100 

ACGAAATGCA 150 

GCCCTCGTGT 200 

CGCTCTTACT 250 

AGGACGTGCT 300 

TCTGAGGTGT 350 

GCCCACCCTT 400 

CCACCAAGGG 450 
466 



2) INFORMATION FOR SEQ ID NO: 891 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 478 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
50 (B) STRAIN: ATCC 42570 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891 



AGGAGCTGAT CAACAACATC 
55 ACTGGTGTCG GTGAGCGTAC 
GCAGGAGACC TCGGTCATTC 
TCTTTGGTCA GATGAACGAG 
ACTGGTCTTA CCGTCGCCGA 
TTAGTTCTCG TCCACTCATG 
60 AACGTGCCAG TGCTGCTTTT 



GCCAAGGCTC ACGGTGGTTA 
CCGTGAGGGT AACGATCTGT 
AGCTCGAGGG CGAGTCTAAG 
CCCCCGGGTG CTCGTGCCCG 
GTACTTCCGT GACCAGGAGG 
CCGAAACATG TGCGTGTTCC 
CATCGACAAC ATTTTCCGAT 
500 



CTCCGTCTTC 50 

ACCACGAAAT 100 

GTGGCCCTGG 150 

TGTCGCTCTT 200 

GTCAGGATGG 250 

GAGGCTAATC 300 

TCACACAGGC 350 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGGTTCCGAG GTGTCTGCCC TGCTGGGTCG* t kT^CCTCT GCCGTfcGGTT 
ACCAGCCCAC CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 
ACCACCAAGA AGGGCTCTAT CACCTCCG 



400/ 
450 
478 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 892 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) — STRANDEDNESS:— Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:^ Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892 

GTCTTTATCC AGGAGTTGAT TGTACGTCTT TACCTTTCTG CCTGACTGTT 50 

TACGACAACT AACGAAAGCG TAGAACAACA TTGCCAAGGC TCACGGTGGT 100 

TACTCTGTCT TCACTGGTGT CGGTGAACGT ACTCGTGAGG GTAACGATTT 150 

GTACCACGAA ATGCAGGAAA CTGGTGTCAT TCAGCTCGAG GGTGAATCCA 200 

AGGTCGCCCT CGTGTTCGGT CAGATGAACG AGCCCCCCGG TGCCCGTGCC 250 

CGTGTCGCTC TTACTGGTTT GACCATTGCC GAGTACTTCC GTGACGAGGA 300 

AGGTCAGGAC GTGCTTCTCT TCATTGACAA CATTTTCCGT TTCACTCAGG 350 

CCGGTTCTGA GGTGTCTGCC CTTCTGGGTC GTATCCCCTC TGCCGTCGGT 400 

TACCAGCCCA CCCTTGCCGT CGACATGGGT ATCATGCAGG AGCGTATTAC 450 

CACCACCACC AAGGGTTCCA TCACCTCCGT C 481 



2 ) INFORMATION FOR SEQ ID NO: 893 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893 

CTCGTCCGTG GTGCTAAGGC CACTGACACT GGTTSCCCCA TTACCATCCS 50 
CGTCGGCCCC GGTACCCTCG GTCGCATCAT GAACGTCACC GGTGACCCGA . 100 

TCGACGAGCG CGGTCCCATC AAGACCGACA AGTTCCGTCC CATCCACGCT 150 

GAGGCTCCCG AGTTCGTTGA CCAGTCGACC ACCGCTGAGG TTCTCGTGAC 200 

TGGTATCAAG GTCGTCGATC TGCTTGCTCC CTACGCCCGT GGTGGTAAGA 250 

TTGGTCTGTT TGGCGGTGCC GGTGTTGGCA AGACCGTGTT CATCCAGGAG 300 

CTCATCAACA ACATCGCCAA GGCCCACGGT GGTTACTCCG TCTTCACCGG 350 

TGTCGGCGAG CGTACCCGTG AGGGTAACGA TCTGTACCAC GAAATGCAGG 400 

AGACCTCTGT CATTCAGCTT GACGGTGACT CCAAGGTCGC CCTGGTGTTC 450 



501 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

GGTCAGATGA ACGAGCCCCC TGGTGCTCGT GCCCGTGTCG CCTTGAC&GG 500 

TTTGACTGTC GCTGAGTACT TCCGTGACGA GGAGGGCCAG GATGGTATGT 550 

TTTGAATTAT TTCCTTGTCG TACAGTTCCA AATCGAAGAA TTACTAACTT 600 

GTCAGTGCTT CTCTTCATCG ACAACATTTT CCGCTTCACC CAGGCCGGTT 650 

5 CTGAGGTGTC TGCCCTTCTG GGTCGTATTC CCTCCGCTGT CGGTTACCAG 700 

CCCACGCTCG CCGTGGACAT GGGTCTGATG CAGGAGCGTA TTACCACCAC 750 

CCGCAAGGGC TCAATTACCT CCGTCCAGGC CGTCTACGTG CCCGCTGACG 800 

ATCTGACGGA TCCCGCCCCC GCCACCACCT TCGCCCATCT GGACGCCACC 850 

ACTGTGCTGT CCCGAGGTAT CTCTGAGCTG GGTATCTACC CCGCTGTCGA 900 

10 CCCCCTCGAC TCCAAGTCGC GTATGCTGGA CCCCCGTATT GTCGGTGACG 950 

ACCACTACGA GACCGCCACT CGCGTCCAGC AGATCCTCCA AGAGTACAAG 1000 

TeGeTGGaGG-AeATeATeGe-eATTeTGGGT—ATGGAeGAGe-TGTeTGAGGe 1050- 

CGACAAGCTT ACAGTCGAGC GTGCTCGTAA GATCCAGCGT TTCCTGAGCC 1100 

AGCCGTTCAC GGTCGCGCAG GTCTTCACTG GTATCGAAGG CCAGCTGGTC 1150 

15 GATCTGAAGG ACACTATCGC TTCGTTCAAG GCTATCCTGA GCGGTGAGGG 1200 

TGACAGCC 1208 



20 2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894 



TTCAGGAACT TATTGTAAGC CGCCCTCTTT ATGCATTGAG GGTGAATAAG 50 

AAGGCTGACA GGTAATAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC 100 

CGTGTTCACT GGTGTCGGTG AGCGTACCCG TGAAGGAAAC GATCTGTACC 150 

ATGAAATGCA GGAAACCCGC GTCATCCAGC TTGATGGCGA GTCTAAGGTC 200 

4 0 GCCCTTGTGT TCGGTCAGAT GAACGAGCCC CCTGGAGCCC GTGCCCGTGT 250 

CGCTCTTACT GGTCTTACCG TTGCCGAATA CTTCCGTGAC GAGGAGGGCC 300 

AAGATGGTAC GCCTTTTTAC TCTTCTTATT CTTCGGGTCG GACTACAGAA 350 

CTAACCTGCT CCAGTGCTTC TCTTCATTGA TAACATTTTC CGTTTCACAC 400 

AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG GACGTATTCC CTCTGCCGTC 450 

4 5 GGTTACCAGC CCACTCTCGC CGTCGACATG GGTGGTATGC AGGAACGTAT 500 

CACAACCACC AACAAGGGTT CCATTACTTC CGTG 534 



50 2 ) INFORMATION FOR SEQ ID NO: 895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

502 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 95 

5 

CAAGGCTCAC GGTGGTTACT CCGTCTTCAC TGGTGTCGGT GAGCGTACCC 50 

GTGAGGGTAA CGATCTGTAC CACGAAATGC AGGAGACCTC GGTCATTCAG 100 

CTCGAGGGCG AGTCTAAGGT GGCCCTGGTC TTTGGTCAGA TGAACGAGCC 150 

CCCGGGTGCT CGTGCCCGTG TCGCTCTTAC TGGTCTTACC GTCGCCGAGT 200 

10 ACTTCCGTGA CCAGGAGGGT CAGGATGGTT AGTTCTCGTC CACTCATGCC 250 

GAAACATGTG CGTGTTCCGA GGCTAATCAA CGTGCCAGTG CTGCTTTTCA 300 



"TCGACAACAT~TTTC"CGATTC" ~ACCCAGGCCG~GTTCCGAGGT GTCTGCCCTG 350 
CTGGGTCGTA TCCCCTCTGC CGTCGGTTAC CAGCCCACCC TCGCCGTCGA 400 
CATGGGTGGC ATGCAGGAGC GTATCACCAC CACCAAGAAG GGCTCTAT 448 



15 



2) INFORMATION FOR SEQ ID NO: 896 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 
30 (B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 

ATTCAGGAGC TGATTGTAAG TTGCCAATCC ATGAACTGGA GATTTGGTGT 50 

35 GACCCATAGA ACTAACAAAT TATTTAGAAC AACATCGCCA AGGCTCACGG 100 

TGGTTACTCC GTCTTCTGTG GTGTCGGTGA GCGTACTCGT GAGGGTAACG 150 

ATCTGTACCA CGAAATGCAG GAGACTGGTG TCATCCAGCT CGAGGGTGAC 200 

TCCAAGGTCG CTCTGGTCTT CGGTCAGATG AACGAGCCCC CGGGTGCCCG 250 

TGCCCGTGTC GCCCTTACCG GTCTGACCAT TGCCGAGTAC TTCCGTGACG 300 

40 AGGAGGGTCA GGACGTGCTG CTCTTCATTG ACAACATTTT CCGTTTCACC 350 

CAGGCCGGTT CTGAGGTGTC TGCCCTTCTG GGTCGTATCC CCTCTGCCGT 4 00 

CGGTTACCAG CCCACTCTGG CCGTCGACAT GGGTGGTATG CAGGAGCGTA 450 

TTACCACCAC CACCAAGGGT TCCATTACCT CCG 483 



2) INFORMATION FOR SEQ ID NO: 897 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



503 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 



AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

5 CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 200 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 250 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 300 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 350 

10 GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 4 00 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACG~GTG~ACSACATGCT _ TGAGCCCTCC~ 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTY CAACAAGCCC 600 

15 CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

20 CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

25 CGAGTCCTTC ACTGACTACC CCCC 1124 



2 ) INFORMATION FOR SEQ ID NO: 898 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 63 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Penicillium marneffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898 

45 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT .150 

ATTCTCATCA TTGCCTCTGG TACTGGTGAA TTCGAGGCTG GTATCTCCAA 2 00 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

50 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 300 

GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 400 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 450 

CGGTGACAAC ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

55 GGGAGAAGGA GACCAAGGCC GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCCTCGACT ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 700 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

60 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 800 

504 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TGTGCTAACT TGACACTCAG CGCTCCCGCC AACGTCACCA CTGAAGTCAA 850 
GAGTGTTGAA ATGCACCACC AGCAGCTCAC TGCCGGTCAG CCCGGTGACA 900 
ACGTTGGTTT CAACGTGAAG AACGTCTCCG TCAAGGAAAT CCGTCGTGGT 950 

AACGTTGCTG GTGACAGCAA GAACGACCCC CCTGCCGGTG CTGCCTCCTT 1000 

5 CAACGCCCAG GTCATCGTCC TCAACCACCC CGGTCAGGTC GGTGCTGGTT 1050 

ACGCCCCAGT CCTCGATTGC CACACTGCCC ACATTGCTTG CAAGTTCGCT 1100 

GAGCTCCTCG AGAAGATTGA CCGTCGTACC GGAAAGTCTG TTGAGGACCA 1150 

CCCCAAGTTC ATCAAGTCCG GTGACGCTGC CATCGTCAAG ATGATTCCTT 1200 

CCAAGCCCAT GTGTGTTGAG GCTTTCACCG AGTACCCTCC TCTCGGTCGT 1250 

10 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG TTGTCTATTT TCCATCCTTC 1300 

CCTTCTCC TC CG TCTTCCAT ATATATTTTT TCAGTTATAT GTGACTAACC 1350 

ACAAATCACG GGA 1363" 



15 



2) INFORMATION FOR SEQ ID NO: 899 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1147 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 99 

CAAGCTGAAA GCCGAGCGTG AGCGTGGTAT CACTATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGCAT 100 

TATCTCACTC CTCACAGAAG CACGCTCCTA ACATCACACA GACGCTCCCG 150 

35 GTCACCGTGA TTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATCC TCATTATCGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCC TACACCCTCG 300 

GTGTGCGTCA GCTCATCGTC GCCATCAACA AGATGGACAC CACCAAGTGG 350 

TCTGAGGCCC GTTACCAGGA GATCATCAAG GAGACCTCCA ACTTCATCAA 4 00 

40 GAAGGTCGGC TACAACCCCA AGACCGTCGC TTTCGTCCCC ATCTCTGGCT 450 

TCAACGGCGA CAACATGCTT GCCCCCTCCA CCAACTGCCC CTGGTACAAG 500 

GGATGGGAGC GTGAGGTCAA GGGCAACAAG CAGACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGGCATTG AGCCCCCCAA GCGTCCTTCC GACAAGCCCC 600 

TCCGTCTTCC TCTCCAGGAT GTTTACAAGA TCGGTGGTAT CGGAACTGTT 650 

45 CCTGTCGGCC GTATCGAGAC TGGTGTCCTC AAGCCCGGTA TGGTCGTTAC 700 

CTTCGCTCCC GCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT CACTGAGGGT CTTCCCGGTG ACAACGTCGG TTTCAACGTG 800 

AAGAACGTTT CCGTCAAGGA CATCCGCCGT GGTAACGTTG CCAGTGACTC 850 

CAAGAACGAC CCCGCTCTGG GTGCCGCTTC TTTCGACGCC CAGGTCATCG 900 

50 TCCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC GGTCCTCGAT 950 

TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGATCA AGGAGAAGAT 1000 

CGACCGCCGT ACCGGCAAGT CTGTTGAGGA CGCCCCCAAG TTCATCAAGT 1050 

CTGGTGACTC TGCCATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTT 1100 

GAGGCTTTCA CCGACTACCC TCCTCTGGGC CGTTTCGCCG TCCGTGA 1147 

55 



2) INFORMATION FOR SEQ ID NO: 900 

60 (i) SEQUENCE CHARACTERISTICS: 

505 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
10 (B) STRAIN: ATCC 60735 

(xi") — SEQUENCE" DESCRTPTION-:-SEQ-rD -NO :- -900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

15 GTTCGAGACT CCCAAGTACT ATGTCACCGT/ CATTGGTACG TCGACTCGCG 100 

CGAGACTGGT CGCAATTTCC ACGTCGCTAA CGTGCTTGAA CAGACGCTCC 150 

CGGCCACCGT GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG 200 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 300 

20 CGGTGTTAAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 350 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 450 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGCTGGCA AGTCCACCGG CAAGACCCTC 550 

25 CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT ATCGGCACAG 650 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 70 0 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTCCG TTGAGATGCA 750 

CCACGAGCAG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

3 0 TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 850 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGCCGAGAT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

35 GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



40 2) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 



55. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 



TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

60 GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

506 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACCGATAAAC 
TATTGGCACT 
GTATGGTCGT 
GTCGAAATGC 
CGGCTTCAAC 
TTGCCGGTGA 
GCCCAGGTCA 
CCCAGTCCTC 
TCCTTGAGAA 
AAGTTCATCA 
G 



CTCTCCGTCT 
GTTCCTGTCG 
GACCTTCGCT 
ACCACCAGCA 
GTCAAGAATG 
CTCTAAGAAT 
TCGTCCTCAA 
GACTGCCATA 
GATTGATCGA 
AGTCCGGTGA 



TCCCCTCCAG GATGTCTACA AGATCTC&3G 
GACGTGTTGA GACTGGAGTC ATCAAGCCCG 
CCCGCCAACG TCACCACTGA AGTCAAGTCC 
GCTTACCGCC GGTAACCCCG GTGACAACGT 
TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 
GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 
CCACCCTGGT CAGGTTGGCG CTGGTTATGC 
CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 
CGAACCGGAA AGTCTGTTGA GAACAACCCC 
TGCTGCTATC GTCAAGATGA TTCCTTCCAA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
751 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902 

CGTGAGCGCG GTATCACCAT CGATATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 200 

TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG 250 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 300 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 350 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 450 

AAGAGCGGCA AGGTTACCCG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAGATCCGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 900 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 1000 

TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 

CCCCCT 1056 



2) INFORMATION FOR SEQ ID NO: 903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 bases 

(B) TYPE: Nucleic acid 

507 



SUBSTITUTE SHEET (RULE 26) 



5 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 



~ _ CAA"GGCTGAG~CGTGAGCGTG _ GTATCACCAT _ CGATATTGCT~CTCTGGAAGT —50 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

15 TCTCATCATT GCCTCTGGTA CTGGTGAATT CGAGGCTGGT ATCTCCAAGG 200 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 300 

TGAGTACTCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 350 

GATCGTTACA ACGAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 4 00 

20 CGGATACAAC CCTAAGAACG TTCCTTTCGT TCCTATCTCC GGTTTCAACG 450 

GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG 500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 60 0 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 650 

25 AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 700 

ACGGTTCCCG TCGGTCGTGT TGAGACTGGT ACCATCGTTC CTGGTATGGT 750 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 850 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 90 0 

30 GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 1000 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 1050 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGCTTGCA AGTTCGCTGA 110 0 

GCTCCTCGAG AAGATTGACC GTCGTACCGG AAAGTCTGTT GAGGACCACC 1150 

35 CCAAGTTCAT CAAGTCCGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 12 00 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 12 50 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 1300 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GACTAACCAC 1350 

AAATCACGGG AATAGC 1366 

40 



2) INFORMATION FOR SEQ ID NO: 904 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 
55 (B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 

CTATTATCGT TGTTGCCGCT TCCGACGGTC AAATGCCCCA GACTCGTGAG 50 
60 CATCTGCTGC TCGCCCGCCA GGTCGGTGTC CAGAAGATCG TTGTCTTCGT 100 

508 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

CAACAAGGTC GATGCTGTTG AGGACAAGGA GATGTTGGAG CTCGTCGAGA 150 
TGGAGATGCG CGAATTGCTC AGCAGCTACG GCTTCGAGGG CGACGAGACT 200 
CCCATCATCA TGGGATCTGC CCTCTGCGCC ATTGAGGGCC GCGAACCTGA 250 
GATTGGTGTC AACCGAATTG ATGAGCTGCT CGAGGCCGTT GATACTTGGA 300 
5 TCCCCACCCC TCAGCGTGAG ACCGACAAGC CTTTCCTCAT GGCCGTCGAG 350 
GATGTCTTCT CCATTGCTGG TCGTGGCACT GTCGTCTCTG GCCGTGTCGA 4 00 
GCGAGGTATC CTGAAGCGCG ATGCTGAAGT TGAGCTCGTC GGCAAGGGCA 450 
CCGCCCCCAT CAAGACCAAG GTTACCGATA TCGAGACCTT CAAGAAGTCC 500 
TGCGAGGAGT CTCGCGCTGG TGACAACTCC GGTCTCCTTC TTCGTGGTGT 550 
10 CAAGCGTGAT GAAGTCCGCC GTGGTATGGT CGTTTCCGTC CCTGGACAGG 600 
T.CAAGjGCGCA_jCAAGAAGTTC CTTGTCTCCA TGTACGTGTT GAGCAAGGAG 650 



GAAGGTGGTC GTCACACTGG CTTCGGTGAG AACTACAGGC CGCAAATGTT 700 

CATCCGCACT GCCGACGAGT CGTGTGCCCT GTACTGGCCA GAAGGCACCG 750 

AGGACGCCCA TGACAAGCTT GTTATGCCCG GTGACAACGT CGAGATGGTT 8 00 

15 TGCGAGCTCC ATGCACCACA CGTCTTGGAG CCTGGTCAAC G 841 



2) INFORMATION FOR SEQ ID NO: 905 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 

3 5 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 2 00 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 250 

4 0 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 3 00 

GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 3 50 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 4 00 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 4 50 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

45 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGCCAACAG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 750 

50 TACCGTCCCC AGCTGTTCAT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 800 

ATTCCGCTAC TAGGGAACCA TCTCTAATTC TATTTGCTAC AGATGAGGCT 850 

GCTGAGTTCA GCTTCCCCGA CGGAGACCAG TCCCGCCGTA TCATGCCCGG 900 

TGACAACGTC GAGATGATCG TCAAGACCCA CCGCCCCGTC GCCGCCGAGG 950 

CCGGTCAGCG CTTCAAC 967 

55 



2) INFORMATION FOR SEQ ID NO: 906 

60 (i) SEQUENCE CHARACTERISTICS: 

509 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 

(A) 
(B) 
(C) 
(D) 



PCT/CAOO/01150 



LENGTH: 852 bases 
TYPE: Nucleic acid 
STRANDEDNESS : Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 



Ixi^SEQUENCE - DESCRIPTION r-SEQ- ID-NO:- 906 



TGGTGCTATT 

15 GTGAGCATCT 
TTCGTTAACA 
CGAGATGGAG 
AGACCCCTAT 
CCCGACATTG 

20 TTGGATTCCC 
TCGAGGATGT 
GTCGAACGAG 
GGGCAGCGCA 
AGTCTTGCGA 

25 GGTGTTAAGC 
ACAGGTTAAG 
AGGAGGAAGG 
ATGTTCATCC 
CACCGAGGAT 

30 TGGTTTGCGA 
AA 



ATTGTTGTTG 
GCTGCTCGCC 
AGGTCGACGC 
ATGCGCGAAC 
CATCATGGGT 
GTGTCGAACG 
ACCCCTCAGC 
CTTCTCCATT 
GTATCCTGAA 
CCCATCAAGA 
GGAGTCCCGC 
GTGATGAAGT 
GCGCACAAGA 
TGGCCGACAC 
GCACTGCCGA 
GCCCACGACA 
GCTCCATGCA 



CCGCTTCCGA 
CGTCAGGTCG. 
TGTCGAGGAC 
TGCTCAGCAG 
TCTGCTCTCT 
AATTGACGAG 
GTGAGACCGA 
GCTGGTCGTG 
GCGCGATGCT 
CCAAGGTTAC 
GCTGGTGACA 
CCGCCGTGGT 
AGTTCCTTGT 
ACTGGCTTCG 
CGAGTCGTGT 
AGCTTGTCAT 
CCACACGTCT 



CGGTCAAATG 
GTGTTCAGAA 
AAGGAGATGT 
CTATGGCTTC 
GCGCCATTGA 
CTGCTCGAGG 
AAAGCCTTTC 
GTACCGTCGT 
GAAGTTGAGC 
CGATATCGAG 
ACTCCGGTCT 
ATGGTCGTTT 
CTCCATGTAT 
GTGAGAACTA 
GCCCTGTACT 
GCCCGGTGAC 
TGGAGACTGG 



CCCCAGACTC 
GATCGTTGTC 
TGGAGCTTGT 
GAGGGCGACG 
AGGCCGTCAA 
CTGTTGATAC 
CTCATGGCCG 
CTCTGGCCGT 
TTGTGGGCAA 
ACCTTCAAGA 
CCTTCTTCGT 
CCGTCCCTGG 
GTGCTGAGCA 
CAGGCCGCAA 
GGCCAGAAGG 
AACGTCGAGA 
TCAGCGCTTC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
852 



35 2 ) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 2 6947 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 



GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 200 

55 TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 300 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 350 

AGGCCGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 400 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

60 TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

510 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGATCATCGG AGGTAGCTTC GATGCTACCA 
GAGACCTTCA AGAAGTCTTG TGACGAGTCC 
CTTGCTTCTC CGTGGTATCC GTCGTGAAGA 
TTGCTGCTCC TGGCAGCACC AAGGCCCACG 
5 TACGTTCTCA CTGAGGCTGA GGGTGGTCGT 
CTACCGCCCC CAGGTGTTCG TTCGCACTGC 
TCACTTAACG GTATTGATTA AGTTCTAACT 
GCTGACCTCA GCTTCCCCGA CGGTGATGAG 
TGACAACGTC GAGATGGTCC TCAAGACTCA 
10 CTGGCCAGCG CTTCAA 



PCT/CAOO/01150 



AGACCAAGGT 


CACCGACATT 


550 


CGCGCTGGTG 


ACAACTCTGG 


600 


CGTCCGCCGC 


GGAATGATCA 


650 


ACCAGTTCTT 


GGTGTCCATG 


700 


CGTACTGGCT 


TCGGCTCCAA 


750 


TGGTAAGTCA 


AGCCTTTTGC 


800 


GTTGTATCCT 


AGATGAGGCT 


850 


TCCCGGAGGG 


TGATGCCTGG 


900 


CCGCCCCATT 


GCTGCTGAGG 


950 






966 



2) INFORMATION FOR SEQ ID NO:. 908 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases ' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908 

30 GGTGCTATCA TCGTCGTTGC TGCTTCCGAT GGTCAGATGC CCCAGACCCG 50 

TGAGCACTTG CTGCTCGCCC GTCAGGTCGG TGTTCAGAAG ATCGTTGTCT 100 

TCGTCAACAA GGTCGATGCT GTCGAAGACC CGGAGATGTT GGAACTCGTC 150 

GAGATGGAGA TGCGTGAGTT ACTCACCAGC TACGGCTTCG AGGGCGACGA 200 

GACACCCATC ATCATGGGTT CCGCTCTATG CGCCATCGAG GGCCGCCAGC 250 

35 CCGAGATCGG TGTTACCAAG GTCGACGAGC TAATGGACGC TGTCGACTCA 300 

TGGATCCCCA CCCCTCAGCG TGAGACCGAG AAGCCTTTCC TCATGGCTGT 350 

TGAGGATGTC TTCTCGATTG CTGGACGTGG TACCGTCGTT TCGGGCCGTG 400 

TCGAGCGCGG TATCTTGAAG CGTGACGCTG AAGTCGAGCT TGTCGGCAAG 450 

GGCACCGCGC CAATCAAGAC CAAGGTCACT GATATTGAGA CCTTCAAGAA 500 

40 GTCGTGCGAG GAGTCGCGCG CGGGTGATAA CTCCGGTCTT CTCCTCCGTG 550 
GTGTCAAGCG TGATGACGTT CGCCGCGGTA TGGTTGTTTC CGTTCCCGGA . 600 

CAAGTCAAGG CTCACAAGAA GTTCCTTGTC TCCATGTACG TTCTAAGCAA 650 

AGAGGAGGGT GGTCGTCACA CCGGCTTCGG CGAGAACTAC AGGCCGCAAA 700 

TGTTCATCCG AACTGCCGAT GAATCCTGCG CACTTCACTT CCCAGAGGGT 750 

45 ACCGAGGATG CGCACGACAA GCTAGTTATG CCCGGTGACA ACGTCGAGAT 800 

GGTCTGCGAA CTCCACCAGC CCCACGTTCT AGAGACCGGT CAGCG 845 



50 2) INFORMATION FOR SEQ ID NO: 909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

511 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) ORGANISM: Penicillium marneffei 1 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909 

5 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

10 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA - CAGAAGATTG-ACGAGCTCAT— GAAGGGGGT-T-GATACC-TGGA 350- 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

15 GCGTGGTATT TTGCGCAAGG ATTCTGAGGT, TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

20 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

25 



2) INFORMATION FOR SEQ ID NO: 910 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 



45 



50 



55 



60 



CGCTGTTGTT 
AGCTGCCAAT 
CACTTGCTCC 
CAACAAGGTT 
TGGAAATGCG 
CCTATCATTT 
GATTGGCGAA 
TCCCCACCCC 
GAAGTTTTCT 
GCGTGGTATT 
AGAACCCTAT 
TGCGATGAAT 
CAAGCGTGAG 
CCAAGGCTCA 
GAAGGTGGTC 
CATCCGTACT 
GCTAACGAGT 



GTCGTCGCTG 
TATGGACACT 
TCGCCCGTCA 
GATGCCGTCG 
TGAACTCTTG 
TCGGATCCGC 
CAGAAGATTG 
CCAGCGTGAC 
CCATCTCTGG 
TTGCGCAAGG 
CAAGACCAAG 
CTCGTGCTGG 
GACATTCGTC 
TGACAACTTC 
GTCGTACTGG 
GCCGGTATGT 
TATAGATGAG 



CTTCTGATGG TCAAATGTAA 
GCTGATAAGA ATAGGCCCCA 
GGTCGGTGTT CAAAAGATCG 
AGGACCCCGA GATGTTGGAA 
ACCACCTACG GTTTCGAGGG 
TCTTTGCGCC TTGGAAGGCC 
ACGAGCTCAT GAACGCCGTT 
CTTGACAAGC CCTTCTTGAT 
TCGTGGTACC GTTGCATCTG 
ATTCTGAGGT TGAGATTATC 
GTTACCGACA TTGAGACCTT 
TGACAACTCT GGCTTGCTTC 
GTGGTATGGT TATCGCTGCT 
TTGGTCTCCA TGTATGTCTT 
ATTCGGCGCC AACTACCGTC 
TCCCTTTCAA AGTCAATTAA 
GCTGCTACTC TCAGCTTCCC 
512 



CATATCCACG 50 

AACCCGTGAG 100 

TCGTCTTCGT 150 

CTTGTCGAAT 200 

TGAAGAGACC 250 

GCAAGCCCGA 300 

GATACCTGGA 350 

GTCCGTTGAG 400 

GTCGTGTTGA 450 

GGATACCAGA 500 

CAAGAAGTCT 550 

TCCGTGGTAT 600 

CCTGGAACCA 650 

GACTGAGGCT 700 

CTCAAGCTTT 750 

TGAGCGATTT 800 

CGGTGACGAT 850 
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CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGA^GAC 900* 
ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



2) INFORMATION FOR SEQ ID NO: 911 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
10 (B) TYPE: Nucleic acid 

( C)_ STRA NDEDNESS : Singl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 20 

2) INFORMATION FOR SEQ ID NO: 912 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 2 0 

35 



2) INFORMATION FOR SEQ ID NO: 913 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913 
50 GACGGYSYCA TGCCKCAGAC 20 



2) INFORMATION FOR SEQ ID NO: 914 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
60 (D) TOPOLOGY: Linear 

513 
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35 



40 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914 
GAARAGCTGC GGRCGRTAGT G 21 



10 2) INFORMATION FOR SEQ ID NO: 915 



(T) _ SEQUENCE "XHARACTERISTICSl 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single- 
CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915 

AAACGCCTGA GGRCGGTAGT T 21 



2) INFORMATION FOR SEQ ID NO: 916 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916 
GCCGAGCTGG CCGGCTTCAG 20 

2) INFORMATION FOR SEQ ID NO: 917 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 22 

55 

2) INFORMATION FOR SEQ ID NO: 918 

60 (i) SEQUENCE CHARACTERISTICS: 

5U 
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(A) LENGTH: 1391 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
10 (E) ACCESSION NUMBER: J01672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOT~9T8 

AGAGAAGCCT GTCGGCACCG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 50 

15 AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 100 

GTCAGGCTAC TGCGTATGCA TTGCAGACCT TGTGGCAACA ATTTCTACAA 150 

AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 200 

TTGACTATCC GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 250 

GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 300 

20 ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 350 

TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 400 

GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 450 

CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 500 

GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 550 

2 5 GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 600 

CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 650 

CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 700 

GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 750 

GATGAGCCAG GCGATGCGTA AGCTGGCGGG TAACCTGAAG CAGTCCAACA 800 

30 CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 850 

GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 900 

TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 950 

TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 1000 

CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GTATCAACTT 1050 

35 CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 1100 

CAGGCGCGTG GTACAGCTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 1150 

AATGCGACTG CCTGGCTGAA AGATAACCCG GAAACCGCGA AAGAGATCGA 1200 

GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 1250 

TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAACTAACGA AGATTTTTAA 1300 

40 TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 1350 

AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 1391 



45 2) INFORMATION FOR SEQ ID NO: 919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 

GGICCIGART CITMIGGIAA RAC 23 



60 
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2) INFORMATION FOR SEQ ID NO: 920 



PCT/CA00/01150 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 920 



50 



TCICCVATIT CICCITCIAI YTC 23 



2) INFORMATION FOR SEQ ID NO: 921 

(i) SEQUENCE CHARACTERISTICS: 
20. (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921 
TIYRTIGAYG CIGARCAIGC 20 

30 

2) INFORMATION FOR SEQ ID NO: 922 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
45 TARAAYTTIA RIGCIYKICC ICC 23 



2) INFORMATION FOR SEQ ID NO: 923 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 

60 
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GACGCIGCCA TCCTGATGAT C 21 



5 2) INFORMATION FOR SEQ ID NO: 924 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



20 



30 



35 



60 



(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 21 



2) INFORMATION FOR SEQ ID NO: 925 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925 
AAGCAGATGG TTGTGTGCTG 20 

2) INFORMATION FOR SEQ ID NO: 926 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926 
CAGCTGCTCG TGGTGCATCT CGAT 24 

50 



2) INFORMATION FOR SEQ ID NO: 927 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927 
5 ACGCGGAGAA GGTGCGCTT 19 



2) INFORMATION FOR SEQ ID NO: 928 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :~22~bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928 

20 

GGTCGTTCTT CGAGTCACCG CA 22 



25 2) INFORMATION FOR SEQ ID NO: 929 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: ATCC 25285 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 929 

40 

TTCAGCATGC CATTTCAAAA CAGGCCGAAG CCGATATCGT GATTATCGCT 50 

GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG GAAATCTTTA CCGAATTTCC 100 

GGAATTGGTG GACCCGCACA CGGGACGTAA GCTGATGGAG CGTACCATTA 150 

TTATCGCAAA TACATCGAAC ATGCCGGTAG CAGCGCGTGA AGCTTCTGTG 200 

45 TATACGGCCA TGACGATTGC CGAATACTAT CGTGCCATGG GATTGAAAGT 250 

CCTGCTGATG GCAGACTCCA CTTCCCGTTG GGCGCAGGCA TTGCGTGAGA 300 

TGTCGAACCG TATGGAGGAG TTGCCCGGAC CGGATGCATT CCCGATGGAC 350 

CTGTCCTCAA TCATTTCTAA CTTCTATGGC CGTGCAGGCT ACGTGAAACT 400 

GAATAACGGC GAGAGCGGTT CTATTACCTT TATCGGTACA GTATCACC 448 

50 



2) INFORMATION FOR SEQ ID NO: 93 0 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 
5 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930 

GCTATCTCTA AACAAGCGGA AGCGGATATC GTGATTATCG CCGCCTGCGG 50 
10 TGAGCGTGCG AATGAGGTCG TAGAGGTATT TACGGAGTTC CCGGAATTGG 100 
TAGACCCGCA TACGGGACGT AAATTGATGG AACGTACGAT CATTATCGCC 150 



AATACATCCA ACATGCCGGT AGCCGCTCGT GAGGCATCCG~TATATACGGC 200 

GATGACCATC GCCGAGTATT ATCGCAGCAT GGGTTTGAAG GTTCTGTTGA 250 

TGGCCGACTC TACTTCCCGC TGGGCACAGG CTTTGCGTGA GATGTCCAAC 300 

15 CGTTTGGAGG AGTTGCCGGG ACCGGATGCT TTCCCGATGG ACTTGTCCGC 350 

TATCGTGGCG AACTTCTACG CTCGTGCGGG ATTCGTTCAT TTGAATAACA 400 

ACGCTACAGG CTCCGTCACT TTCATCGGTA CGGTATCG 438 



20 



30 



2) INFORMATION FOR SEQ ID NO: 931 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 453 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 

35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931 

CCTCCAGCAC GCTATCTCTA AGCAGGCGGA GGCTGATATC GTCATTATGG 50 

CAGCCTGCGG TGAGCGTGCT AATGAGGTGG TGGAGATCTT TGCCGAGTTC 100 

CCTGAGCTCG AAGACCCACA CACGGGACGC AAGCTGATGG AGCGTACGAT 150 

40 CATCATCGCT AACACGAGTA ACATGCCAGT GGCTGCTCGT GAGGCTTCGG 200 

TCTACACCGC TATGACCATC GCTGAGTACT ACCGCTCGAT GGGTCTCAAA 250 

GTACTCCTAA TGGCTGACTC GACCTCTCGC TGGGCACAGG CACTGCGTGA 300 

GATGTCTAAC CGTCTAGAGG AGCTGCCTGG ACCAGATGCA TTCCCGATGG 350 

ACTTGTCGGC TATCGTGGCA AACTTCTACG CTCGTGCCGG CTTCGTCTAT 400 

45 CTCAACAACG GTGAGACAGG TTCTGTAACC TTCATCGGTA CGGTCTCTCC 450 

AGC 453 



50 2) INFORMATION FOR SEQ ID NO: 932 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 

5 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

GTGAACATAT CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCATGAACA AATGTGACAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA ATTCGTGATC TATTAACTGA ATATGAATTC CCTGGCGATG 200 

10 ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA AGGTGAAGCT 250 

GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 300 

TCCAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG 3-50- 

ATGTATTCTC AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA 400 

CGTGGACAAG TTAAAGTTGG TGACGAAGTA GAAGTTATCG GTATCGAAGA 450 

15 AGAAAGCAAA AAAGTAGTAG TAACTGGAGT, AGAAATGTTC CGTAAATTAC 500 

TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT ACGTGGTGTT 550 

GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 600 

TACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT 700 

20 TTCCGTACTA CTGACGTAAC TGGTATTGTT ACACTTCCAG AAGGTACTGA 750 

AATGGTAAYG CCTGGTGATA ACATTGAGCT TGCAGTTGAA CTAATTGCAC 800 

CAATCGCTAT CGAAGACGGT ACTAAATTCT CTATC 835 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 24 

2) INFORMATION FOR SEQ ID NO: 934 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG IAGA 24 

55 

2) INFORMATION FOR SEQ ID NO: 935 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935 
10 GGIAARWSIC ARYTITGYCA YAC 23 



15 



2 ) INFORMATION FOR SEQ ID NO: 936 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936 

25 

TCISIYTCIG GIARRCAIGG 20 



30 2) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937 

ATIACIGARG YITTYGGIGA RTT 23 

45 2) INFORMATION FOR SEQ ID NO: 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938 

CYIGTIGYIS WIGCRTGIGC 20 



60 
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2) INFORMATION FOR SEQ ID NO: 939 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) — ORGAN-ISM-:- Saccharomyces-cerevisiae 

(C) ACCESSION NUMBER: D10023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 9 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTACCAGC AGACCTTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

20 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGGAAGCGC AAGGTGAAAT 200 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 250 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAACTAAGG 300 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 3 50 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 4 00 

2 5 ACGAAGCGGC AAGGCTAGTG CCTATGGGAT TTGTCACGGC TGCTGATTTT 450 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 500 

GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT ACTGAGCTTT 550 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

ACATGCCAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 650 

3 0 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 700 

AGCGGTTCGG ATTAGACCCG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 800 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTCGAT TCTGTTATGG 850 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

35 ATGCATTTAG CCAAATTTAT GCGTGCTTTG CAAAGGCTGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 1050 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 1100 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 115 0 

40 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 12 0 0 

TAG 1203 



45 2) INFORMATION FOR SEQ ID NO: 94 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGATCCAATT GCTGGTCTTA AGATGCATTT GATTGATCTA GGTATTGCC^ 50 

CTGAAGCTGA AGTCAAAGCT TACGACAAGT CCGCTAGAAA ATACGTTGAC 100 

GAACAAGTTG AATTAGCTGA TGCTGCTCCT CCTCCAGAAG CCAAATTATC 150 

5 CATCTTGTTT GAAGACGTCT ACGTGAAAGG TACAGAAACT CCAACCCTAA 200 

GAGGTAGGAT CCCTGAAGAT ACTTGGGACT TCAAAAAGCA AGGTTTTGCC 250 

TCTAGGGATT AATTAAATCG TAAGGAAAAA TAAAATAATA GTGCTGTGAT 300 

CGCATGATAT TCTTCCCTGG AAGCGCCATT TTATAGCAAG AAATGTAAGT 350 

CAAGTATATT TTAACTGTAT ATACAACAAT ATGACTCTTT TTTATGCCTT 400 

10 GTTGTTTTTC TTCGGGTTTT CCCACACATT GTGTGGAGAG ATAGTTATTA 450 

ACAGA CCGAA AATAGCCGCC CAAGGATAAA CTTTTATATA AAGGGAAGGG 500 

TAGTTGACCTC - fifiSSATTTGG~ATTCTACTTT ~CCAGATTTAC _ TTTCACCCTT 5 5 0 

TTATATTTGC TGTAGTCTGT TATGCCAATC AGGAAAGCAT TTGAACAAAT 600 

ATGTCTGTTA CAGGAACTGA GATCGATAGT GATACAGCAA AAAATATTCT 650 

15 TAGTGTAGAT GAATTACAGA ACTATGGTAT- TAATGCCTCA GATCTTCAAA 700 

AATTGAAGTC TGGTGGGATA TACACAGTCA ATGTATGTTA TAATAACATT 750 

TTTAAAACCT CTGCTGTAGA GGTTCTTTCC CCCTTTCTTT TACTAACTAA 800 

TAATTTGGAA AGGAACTTTT ATAGACCGTT TTGTCAACAA CAAGAAGACA 850 

TCTATGTAAA ATTAAAGGGT TAAGTGAGGT GAAAGTGGAA AAAATTAAAG 900 

20 AAGCTGCTGG AAAAATCATA CAGGTGGGAT TTATCCCCGC TACTGTACAA 950 

TTAGATATAA GACAGCGTGT GTACTCCTTA TCAACTGGAT CTAAGCAACT 1000 

AGATTCAATC CTAGGTGGTG GAATAATGAC AATGAGTATC ACTGAAGTAT 1050 

TTGGTGAATT TAGGTGTGGT AAGACACAGA TGTCTCATAC TTTGTGTGTT 1100 

ACCACGCAGC TTCCGAGGGA AATGGGTGGT GGTGAAGGGA AAGTAGCATA 1150 

25 TATTGATACA GAAGGCACTT TCAGGCCCGA GAGGATTAAG CAAATTGCAG 1200 

AAGGTTATGA ATTGGATCCC GAGTCATGTT TGGCAAACGT TTCATATGCT 1250 

AGAGCCTTGA ATAGTGAACA TCAAATGGAA CTTGTTGAAC AATTGGGTGA 1300 

AGAACTTAGT TCTGGAGATT ATCGCCTTAT CGTGGTAGAT TCTATAATGG 1350 

CAAACTTCAG AGTAGACTAC TGCGGTAGAG GTGAACTAAG CGAAAGACAG 1400 

30 CAAAAGCTAA ATCAACATCT TTTCAAATTG AATAGATTGG CAGAGGAATT 1450 

TAATGTTGCA GTATTTCTGA CAAACCAAGT TCAATCAGAC CCAGGTGCTT 1500 

CTGCATTATT TGCCTCGGCA GATGGTAGGA AACCAATTGG AGGGCACGTT 1550 

CTGGCACATG CGTCAGCAAC AAGGATTTTG TTGAGAAAAG GGCGTGGTGA 1600 

CGAAAGAGTT GCCAAGTTAC AAGATTCCCC AGATATGCCT GAAAAAGAAT 1650 

3 5 GTGTCTACGT AATTGGTGAA AAAGGTATTA CCGATTCAAG TGACTAGTTT 1700 

TTGTATACTT TTTTAATGAA GATGACATTG CTCCTTTATT AAACTTTTCT 1750 

TTTACTTTGT GTTACTAATA TTATTAATAT CTTGTTATGA TTCTTTGTTT 1800 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(-A-) — ORGANISM-:— Gryptococcus-humi-eol-us 

(B) STRAIN: ATCC 38294 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

TCTCCGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 100 

TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 150 

20 GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 200 

GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 250 

GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 3 00 

CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 350 

ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 400 

25 ACCACCAAGA AGGGTTCGAT TACCTCCGTC 430 



2) INFORMATION FOR SEQ ID NO: 942 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 

45 CGTGTACGAT GCTCTTGAGG TGCAAAATGG TAATGAGCGT CTGGTGCTGG 50 

AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TGCGTACCAT CGCAATGGGT 100 

TCCTCCGACG GTCTGCGTCG CGGTCTGGAT GTAAAAGACC TCGAACACCC 150 

GATCGAAGTC CCGGTAGGTA AAGCGACTCT GGGCCGTATC ATGAACGTAC 2 00 

TGGGTGAACC GGTCGACATG AAAGGCGAGA TCGGTGAAGA AGAGCGTTGG 250 

50 GCGATTCACC GCGCAGCACC TTCCTACGAA GAGCTGTCAA ACTCTCAGGA 300 

ACTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT CCGTTCGCTA 350 

AGGGCGGTAA AGTTGGTCTG TTCGGTGGTG CGGGTGTAGG TAAAACCGTA 400 

AACATGATGG AGCTCATTCG TAACATCGCG ATCGAGCACT CCGGTTACTC 450 

TGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGGAAC GACTTCTACC 500 

55 ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT GGTGTATGGC 550 

CAGATGAACG AGCCGCCGGG AAACCGTCTG CGCGTAGCTC TGACCGGTCT 600 

GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTT CTGCTGTTCG 650 

TTGACAACAT CTATCGTTAC ACCCTGGCCG GTACGGAAGT ATCCGCACTG 700 

CTGGGCCGTA TGCCTTCAGC GGTAGGTTAT CAGCCGACCC TGGCGGAAGA 750 

60 GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAAACC GGTT 794 
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2} INFORMATION FOR SEQ ID NO: 94 3 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 



— (iipMOLECULE~TYPE:-"Genomrc-DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 

20 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTACGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATTGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

25 AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

30 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 6 00 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

35 CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

ACTGGTTCTA TCAC 814 



40 2) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGTACCATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 150 

ATCGAAGTCC CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 2 00 

60 GGGTGAACCG GTCGACATGA AAGGCGAGAT CGGTGAAGAA GAGCGTTGGG 250 
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CGATTCACCG CGCAGCACCT TCCTACGAAG AGCTGTCAAA CTCTC^GGAA 3 0/0 

CTGCTGGAAA CCGGTATCAA AGTTATCGAC CTGATGTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC GGGTGTAGGT AAAACCGTAA 4 00 

ACATGATGGA GCTTATTCGT AACATCGCGA TCGAGCACTC CGGTTACTCT 4 50 

5 GTGTTTGCGG GCGTAGGTGA ACGTACTCGT GAGGGTAACG ACTTCTACCA 500 

CGAAATGACC GACTCCAACG TTATCGACAA AGTATCCCTG GTGTATGGCC 550 

AGATGAACGA GCCGCCGGGA AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCTG AGAAATTCCG TGACGAAGGT CGTGACGTTC TGCTGTTCGT 650 

TGACAACATC TATCGTTACA CCCTGGCCGG TACGGAAGTA TCCGCACTGC 700 

10 TGGGCCGTAT GCCTTCAGCG GTAGGTTATC AGCCGACCCT GGCGGAAGAG 750 

ATGGGCGTTC TGCAGGAACG TATCACCTCC ACCAAAACCG GTTCTATC 798 



30 



15 2) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

35 ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 4 50 

40 ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 700 

45 AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



50 



60 



2) INFORMATION FOR SEQ ID NO: 946 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 
, (B) STRAIN: ATCC 43768 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946 

GCGACGCTAT CCCGCATGTT TACGATGCCC TGAAATTGGA CGAGAACGGT 50 

CTGACTCTGG AAGTTCAACA ACTTCTGGGT GACGGCGTTG TCCGTACTAT 100 

TGCAATGGGT AGTTCAGACG GCCTGAAACG CGGCATGTCT GTAAGCAATA 150 

10 CTGGTGCGCC AATCACTGTG CCGGTAGGTA AAGGTACTTT GGGTCGTATT 200 

GTCGACGTAT TGGGTACGCC TGTTGATGAA GCAGGTCCGA TCGATACCGA 250 

CAAGAGCCGT-GCCATTCACe AAACTGCTCG- GAAA-T-TGGAG— GAGTTGTGTG 300- 

CAACTACCGA ATTGTTGGAA ACCGGTATTA AAGTGATCGA CTTGCTGTGT 350 

CCGTTTGCTA AAGGCGGTAA AGTAGGTCTG TTCGGTGGTG CCGGTGTAGG 4 00 

15 CAAAACCGTG AACATGATGG AATTGATCAA. CAACATCGCC AAAGCGCACA 450 

GCGGTCTGTC CGTGTTCGCA GGTGTGGGCG AGCGTACCCG TGAAGGTAAC 500 

GACTTCTACC ACGAGATGAA AGATTCCAAC GTATTGGATA AAGTGGCAAT 550 

GGTTTACGGT CAGATGAACG AACCTCCGGG CAACCGTTTG CGCGTCGCAT 600 

TGACCGGTTT GACCATGGCG GAATACTTCC GTGACGAAAA AGACGAA&AC 650 

20 GGTAAAGGTC GCGACGTATT GTTCTTCGTT GACAACATCT ACCGTTACAC 7 00 

TCTGGCCGGT ACCGAAGTAT CTGCACTGTT GGGCCGTATG CCTTCTGCAG 750 

TGGGTTACCA ACCGACATTG GCTGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACCTCTA CCCAAACCGG TTCCATTACT TC 832 



25 
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2) INFORMATION FOR SEQ ID NO: 947 



(i) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947 



TCCGCGCGAT GCCATTCCGC ATGTTTACGA CGCCCTGAAA TTGGATGCAA 50 

ACGGCCTGAC TTTGGAAGTA CAACAGCTTC TGGGCGACGG CGTGGTTCGT 100 

4 5 ACTATTGCAA TGGGTAGTTC GGACGGTCTG AAACGCGGCA TGACTGTAAG 150 

CAATACAGAT GCGCCGATTA CTGTGCCGGT AGGTAAAGGT ACTTTGGGAC 200 

GTATTGTCGA TGTGTTGGGT ACACCTGTTG ATGAAGCAGG TCCGATTGAT 250 

ACCGACAAAC ACCGTGCTAT CCATCAGACA GCTCCGAAAT TCGATGAGTT 300 

GTCTGCTACT ACCGAGCTGC TGGAAACAGG CATTAAAGTG ATTGACTTGC 350 

50 TGTGTCCGTT TGCCAAAGGC GGTAAAGTAG GTCTGTTCGG TGGTGCCGGT 4 00 

GTAGGCAAAA CCGTCAACAT GATGGAATTG ATTAACAACA TCGCCAAAGC 4 50 

GCATAGTGGT TTGTCCGTGT TCGCCGGTGT GGGGGAACGT ACCCGTGAAG 500 

GTAACGACTT CTACCACGAG ATGAAAGATT CCAACGTATT GGACAAAGTG 550 

GCGATGGTTT ACGGTCAGAT GAACGAACCT CCGGGTAACC GTCTGCGTGT 600 

55 AGCCTTGACC GGTTTGACGA TGGCCGAATA CTTCCGTGAT GAAAAAGACG 650 

AAAGCGGCAA AGGTCGCGAC GTATTGTTCT TCGTGGACAA CATTTACCGT 700 

TACACTCTGG CCGGTACAGA AGTATCCGCA TTGCTCGGTC GTATGCCTTC 750 

AGCAGTAGGT TACCAACCGA CATTGGCTGA AGAAATGGGT CGTCTGCAAG 800 

AGCGTATTAC CCTCTACTCA AACAGGCTCC ATTACTTCTA 84 0 
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2) INFORMATION FOR SEQ ID NO: 948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
15 (B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948 

GCAGCTGGCG ACAAGCTACC TGAGATCAAT AATGCACTTG TAGTCTATAA 50 

20 AAATGACGAA AAAAAATCAA AAATCGTCCT TGAAGTAGCT CTTGAGCTTG 100 

GTGATGGAGT GGTTCGGACC ATCGCTATGG AATCAACGGA TGGGTTGACT 150 

CGTGGCATGG AAGTGCTAGA TACTGGCCGT CCAATTTCTG TGCCAGTCGG 2 00 

CAAAGAAACA CTTGGTCGCG TCTTTAACGT TTTGGGAGAT ACCATTGACT 250 

TGGATGCTCC TTTTGCGGAT GATGCAGAGC GCCAGCCAAT CCATAAGAAA 300 

25 GCTCCAACCT TTGATGAGTT GTCTACTTCT TCAGAGATCT TAGAGACAGG 350 

TATCAAGGTT ATCGACCTGT TAGCCCCTTA TCTGAAAGGT GGTAAAGTTG 400 

GACTCTTCGG TGGTGCCGGA GTTGGTAAGA CCGTCCTGAT TCAAGAATTG 450 

ATCCACAACA TTGCCCAAGA ACACGGTGGT ATTTCTGTAT TTACTGGCGT 500 

TGGGGAACGT ACCCGTGAAG GGAATGACCT TTATTGGGAA ATGAAAGAGT 550 

30 CTGGTGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAGAT GAATGAGCCG 600 

CCAGGAGCGC GTATGCGGGT TGCTTTGACT GGTTTGACGA TTGCAGAATA 650 

CTTCCGTGAT GTGGAAGGTC AAGATGTCTT GCTCTTCATT GACAACATCT 700 

TCCGTTTCAC GCAGGCAGGT TCTGAAGTTT CTGCCCTTTT GGGTCGGATG 750 

CCGTCAGCCG TTGGTTACCA ACCAACACTT GCGACAGAAA TGGGGCAATT 800 

35 GCAAGAGCGT ATCACATCGA CTAAGAAGGG TTCTGTAACC TCT 843 



2) INFORMATION FOR SEQ ID NO: 94 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 9456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCCAACTT TTGATGAATT GTCTACCTCT TCTGAAATCC TTGAAACAGG 350 

GATTAAGGTT ATCGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAGGTTG 4 00 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 4 50 

ATTCACAACA TTGCCCAAGA ACACGGTGGT ATTTCAGTAT TTACCGGTGT 500 

5 TGGGGAACGT ACTCGTGAGG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGTGTTAT CGAGAAAACA GCCATGGTAT TTGGTCAGAT GAATGAGCCG 600 

CCAGGAGCAC GTATGCGTGT TGCCCTAACT GGTTTGACAA TCGCCGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 700 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

10 CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAGCGT ATTACATCAA CTAAAAAGGG TTCTGTAACC T 841 



15 2) INFORMATION FOR SEQ ID NO: 950 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950 



GCTACCTGAG ATCAATAATG CACTTGTAGT CTATAAAAAT GACGAAAATA 50 

AATCAAAAAT CGTCCTTGAA GTAGCTCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 200 

35 GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 3 00 

TGAGTTGTCT ACTTCATCAG AGATCTTAGA GACAGGTATC AAGGTTATCG 350 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 4 00 

GCCGGAGTTG GTAAGACCGT CCTGATTCAG GAATTGATCC ACAACATTGC 450 

4 0 CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG GAACGTACCC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCCACCAG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

45 GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 750 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATCGACTAA GAAAGGTTCT GTAACTT 827 



50 

2) INFORMATION FOR SEQ ID NO: 951 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

529 
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WO 01/23604 PCT/CA00/01150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951 

GCAGCAGGGG AAACACTTCC TGAGATTAAT AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GTGATGGTAT GGTCCGTACG ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

10 CGTGGAATGG AAGTTTTGGA CACAGGCCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACTATTGACT 250 

TGGATGCTCC--TTTCGCTGAA_GACGCTGAGC_GTCAGCaU^T— TCATAAGAAA 300 

GCTCCAACTT TTGATGAATT GTCTACCTCA TCTGAAATCT TGGAAACAGG 350 

GATTAAGGTT ATCGACCTTC TTGCCCCTTA CCTTAAAGGT GGGAAAGTTG 4 00 

15 GACTCTTCGG TGGTGCCGGA GTTGGTAAAA, CTGTCTTGAT CCAAGAGTTG 4 50 

ATTCACAACA TTGCCCAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGAGAACGT ACCCGTGAGG GGAACGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTAT TTGGTCAGAT GAATGAGCCA 600 

CCTGGAGCAC GTATGCGTGT TGCTCTTACT GGTTTGACAA TCGCCGAATA 650 

20 CTTCCGTGAT GTAGAAGGCC AAGATGTGCT TCTCTTTATC GACAATATCT 700 

TCCGTTTCAC TCAAGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGGATG 750 

CCTTCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAGAAGGG TTCTGTAACC TCTA 844 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 952 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 



GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

45 GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT 250 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 

GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 3 50 

50 GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 4 00 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT 500 

TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

55 CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 7 00 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CCAAGAAGGG 830 



530 
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2) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

20 AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 200 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

25 TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

30 CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

35 ATCACATCAA CTAAGAAGGG TTC 823 



40 



2) INFORMATION FOR SEQ ID NO: 954 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 

531 
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GCTCCAACTT 
GATCAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGCGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



PCT/CAOO/01150 



TTGATGAGTT 
ATTGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATCACATCAA 



GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CCAAGAAGGG 



TCTGAAATCC 
CCTTAAAGGT 
CTGTCTTAAT 
ATTTCAGTAT 
TTACTGGGAA 
TTGGTCAGAT 
GGTTTGACAA 
TCTCTTTATC 
CTGCCCTTTT 
GCTACGGAAA 
TTCTGTAACC 



TTGAAACAGG 
GGTAAAGTTG 
CCAAGAATTG 
TTGCTGGTGT 
ATGAAAGAAT 
GAATGAGCCA 
TCGCTGAATA 
GATAATATCT 
GGGTCGTATG 
TGGGTCAATT 
TCTA 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
844 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 955, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 200 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



2) INFORMATION FOR SEQ ID NO: 956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



532 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956 



PCT/CAOO/01150 



TTGTATATCA 
ACGTGGGTTG 
TTCCCTGAAC 
10 ACGTACATGC 
AAGCTAGTAT 



GGTTACAACT 
TCTAAGGGAA 
ATCCGGCCTA 
15 GGGATAACTG 
ACACGAACCG 



CAGGCACTCA 
TGGTGAACGT 
TATCTACTGT 
TTAGTTGCCA 
ATACACTGGT 
"GCACTCTTAT" 
ATTTCTGGTA 
TTTATCGTCA 
TCTAATTAAT 
GATCTATTAC 



GCAAATATTC 
GGGAATGAAA 
AGTTAATGAT 
ATACTTCTAA 
ATTACAATTG 
GGCCGATTCC 
GATTGGCTGA 
AGGTTGTCAG 
TTAGGCTTGA 
GGTTGTAGGA 



CGATACTGAC 
TGGCTGAGAT 
GAAAAGGTGG 
TATGCCAGTG 
CTGAATATTT 
-ACTAGCCGAT- 
AATGCCTGCA 
CTTTTTATGA 
TTAAGTGCTT 
GCAGTTTCTC 



GTAATTATAT 50 

TCTATGCGAA 100 

CCATTATGGA 150 

GCCGCTAGAG 200 

CCGTGATATG 250 

GGGCAGAGGC 300- 

GATTCTGGCT 350 

ACGTGCAGGT 400 

AGGTTCACCA 450 

CACCA 495 



20 2) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-1MSS 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957 



AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAACGA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTCTATTAA AGTAGGAGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 200 

40 AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 250 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 300 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 350 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 400 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

45 AGGAGCTGTT TCACCACCT 469 



50 



2) INFORMATION FOR SEQ ID NO: 958 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Fusobacterium nucleatum subsp. polymorphic 

533 
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(B) STRAIN: ATCC 10953 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958 

5 TACAACACCA ACTTGCTAAA TGGGCAGATG CAGAAGTAGT TGTTTATGTT 50 

GGTTGTGGGG AACGTGGAAA TGAAATGACC GATGTACTTA TGGAATTCCC ■ 100 

AGAAATTATT GACCCTAAGA CAGGACAATC TTTAATGAAG AGAACAGTTC 150 

TTATAGCTAA TACTTCTAAT ATGCCAGTTG CTGCTCGTGA GGCTTCAATC 200 

TATACTGGTA TAACTATTGC AGAATATTTT AGAGATATGG GATATTCAGT 250 

10 GGCACTTATG GCAGATTCAA CAAGTCGTTG GGCAGAAGCA CTTCGTGAAA 300 

TGTCAGGACG TTTGGAAGAA ATGCCAGGTG ATGAAGGATA TCCAGCATAT 350 

CTM i CAAOT^ GAATAGC^ 400 

TCTAGGTAAT GGAGAAGAAG GAGCATTAAC TGTAATTGGA GCAGTATCTC 450 

CA 452 

15 



2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

35 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

40 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG 4 69 



2) INFORMATION FOR SEQ ID NO: 960 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 



534 



SUBSTITUTE SHEET (RULE 26) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 960 



TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT . 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

5 TTCCCGACCC TGACGACCGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA GATGCCGGCC GATGGTGGCT 350 

10 ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG — 469 



15 



25 



2) INFORMATION FOR SEQ ID NO: 961 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 69 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 

TGTCATCAGT CAGGCCCTCT CCAAGTACTC CAACTCGGAC TGTGTCATCT 50 

ATGTCGGCTG CGGTGAACGC GGTAACGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACTGT GATCGATGGT CGCGAAGAGT CCATCATGAA 150 

35 GCGCACCTGC CTCGTGGCGA ACACCTCGAA CATGCCCGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTATACCGGC ATCACCCTTG CTGAGTACTA CCGTGATATG 250 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GCTGCGTGAG ATTTCGGGTC GATTGGCGGA GATGCCGGCT GATGGTGGCT 350 

ACCCTGCCTA CCTCAGCGCC CGCCTCGCCT CCTTCTACGA GCGCGCCGGT 4 00 

40 CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATCGT 4 50 

CGGTGCAGTG TCTCCACCG 469 



45 2) INFORMATION FOR SEQ ID NO: 962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 

535 
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TGTCATTAGT CAGGCCCTCT CCAAGTACTC 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 

TTCCCGACCC TGACGACCGT GATCGATGGC 

GCGCACCTGC CTCGTGGCGA ACACCTCGAA 

5 AGGCCTCTAT TTACACCGGC ATCACCCTGG 

GGCAAGCATA TCGCCATGAT GGCTGACTCG 

GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA 

ACCCCGCCTA CCTCAGCGCT CGTCTCGCCT 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC 

10 CGGTGCCGTG TCTCCACCG 



PCT/CA00/01150 

CAACTCCGAT TGCGTCATCT 50 

TGGCCGAGGT GCTCATGGAG 100 

CGCGAGGAGT CGATCATGAA 150 

CATGCCAGTC GCAGCCCGTG 200 

CCGAGTACTA CCGTGATATG 250 

ACGTCTCGCT GGGCCGAGGC 300 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATCGT 450 

469 



2) INFORMATION FOR SEQ ID NO: 963 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20. (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

30 TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGACGACTGT GATCGATGGT CGCGAGGAGT CCATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 200 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 250 

35 GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 350 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 400 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 469 

40 



2) INFORMATION FOR SEQ ID NO: 964 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
55 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
60 ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

536 
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TTCCCGACCC TGACGACCAT GATCGATGGT 

GCGCACCTGC CTCGTGGCGA ACACCTCGAA 

AGGCCTCTAT CTACACCGGC ATCACCCTCG 

GGCAAGCATA TCGCCATGAT GGCCGACTCG 

5 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA 

ACCCCGCCTA CCTCAGCGCT CGTCTCGCCT 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC 

CGGTGCCGTG TCTCCGCCG 



PCT/CA00/01150 

CGGGAGGAGT CGATCATGAA 150 

CATGCCCGTC GCAGCCCGTG 200 

CCGAGTACTA CCGTGATATG 250 

ACGTCTCGCT GGGCTGAGGC 300 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATCGT 450 

469 



10 



-2-)-INFGRMA-T-I0N-F0R-SEQ -I-D-NO :- -9 65 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:. Linear 



20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965 



TGTAATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 

30 TTCCCGACCC TGACGACCGT GATCGATGGT 
GCGCACCTGC CTCGTGGCGA ACACTTCGAA 
AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCCGACTCG 
GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA 

35 ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
CGGTGCCGTG TCTCCGCCG 



CAACTCCGAC TGCGTCATCT 50 

TGGCCGAGGT GCTCATGGAG 100 

CGCGAGGAGT CGATCATGAA 150 

CATGCCAGTC GCAGCCCGTG 200 

CCGAGTACTA CCGTGATATG 250 

ACGTCTCGCT GGGCCGAGGC 300 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATTGT 450 

469 



40 



50 



2) INFORMATION FOR SEQ ID NO: 966 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 449 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

60 GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 200 

537 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
TAATGGCCGA CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
5 GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



PCT/CAOO/01150 

CATGGGATAT TCAGTAGCGG 250 

AGGCCCTAAG AGAGATGTCA 300 

GGTTATCCAG CCTATCTAGG 350 

AGGTAAGGTA ATATGTAAGG 400 

TAATAGGTGC CGTGTCACC 449 



10 



2) INFORMATION FOR SEQ ID NO: 967 

(i) SEQUENCE CHARACTERISTICS: 

(-A-)— L-ENGTH-:— 8-26-ba-ses- 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 9797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

25 CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 50 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 200 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 250 

30 GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGCTGGACAC GTACATTCCG 300 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 350 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 400 

GCGTGGTGAA GGTTGGCGAG GAAATCGAAA TCGTGGGCAT CAAGCCGACG 450 

GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA 500 

35 GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

40 CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 800 

CATGGAAGAA GGTCTGCGTT TCGCCA 826 



45 2) INFORMATION FOR SEQ ID NO: 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: BD180 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968 

538 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CGATCCTGGT GGTGTCGGCC GCAGACGGCC CGATGCCGCA GACGCGCGAG 50 

CACATTTTGC TGTCGCGCCA GGTTGGCGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGGCG GACATGGTTG ATGACGCGGA GCTGCTCGAG CTGGTGGAGA 150 

TGGAAGTCCG CGAACTGCTG AGCAAGTACG ATTTCCCGGG CGATGACACG 200 

5 CCGATCGTGA AGGGTTCGGC CAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 250 

ACTGGGCGAG CAGGCGATTC TGTCGCTGGC GCAAGCGCTG GACACGTACA 300 

TTCCGACGCC GGAGCGCGCG GTCGACGGTG CGTTCCTGAT GCCGGTGGAA 350 

GACGTGTTCT CGATCTCGGG CCGTGGCACG GTGGTGACTG GCCGTATCGA 400 

GCGCGGCGTG GTGAAGGTTG GCGAGGAAAT CGAAATCGTG GGCATCAAGC 450 

10 CGACGGTGAA GACGACCTGC ACGGGCGTGG AGATGTTCCG CAAGCTGCTG 500 

GACCAGGGCC AGGCGGGCGA CAACGTGGGT ATCTTGCTGC GCGGCACCAA 550 

GCGTGAAGAC _ GTCGAGCGTG~GCCAGGTGCT _ GGCCAAGCCG-GGTTCGATGA 600 

ACCCGCACAC GGACTTCACG GCCGAGGTGT ACATTCTGTC CAAGGAAGAG 650 

GGTGGCCGTC ACACGCCGTT CTTCAACGGC TATCGTCCGC AGTTCTACTT 700 

15 CCGCACGACG GACGTGACCG GCACGATCGA CCTGCCGGCG GACAAGGAAA 750 

TGGTGCTGCC GGGCGACAAC GTGTCGATGA CCGTCAAGCT GCTGGCCCCG 800 

ATCGCCATGG AAGAAGG 817 



20 



30 



2) INFORMATION FOR SEQ ID NO: 969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 7 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

40 GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 200 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 250 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 300 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 350 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

45 TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 450 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 500 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAACCAGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 637 

50 



2) INFORMATION FOR SEQ ID NO: 970 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

539 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970 

TATGCCTCAA ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC 50 

10 CATACATCGT TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA 150 

~ — — CTTCCCAGGC-GACGATGTTe-eTGTAATGGe-TGGT-Te-TGeT— T-TGAAAGGTG 200 

TTGAAGGCGA TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA 250 

GTTGACGAAT ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT 300 

15 GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT 500 

TGCTACGTGG GGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTAGCT 550 

20 AAAGCTGGTA CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT 600 

TTTAACAAAA GAAGAAGGTG GACGTCACAC TCCA 634 



25 2) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971 



GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

45 AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA 300 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

50 GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 500 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 550 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 600 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

55 AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA CGTGACAATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 787 



60 2) INFORMATION FOR SEQ ID NO: 972 

540 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(B) STRAIN: A-TGC— 35401- 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 



TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA GCACATCCTG 50 

CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC TGAACAAATG 100 

CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA ATGGAAGTTC 150 

GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC TCCGATCGTT 200 

20 CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT GGGAAGCGAA 250 

AATCCTGGAA CTGGCTGGCT TCCTGGATTC TTACATTCCG GAACCAGAGC 300 

GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT ATTCTCCATC 350 

TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG GTATCATCAA 400 

AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT CAGAAGTCTA 450 

25 CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA AGGCCGTGCT 500 

GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG AAGAAATCGA 550 

ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG CACACCAAGT 600 

TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG CCGTCATACT 650 

CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA CTACTGACGT 700 

30 GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA ATGCCGGGCG 750 

ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC GATGGACGAC 800 

GGT 803 



2) INFORMATION FOR SEQ ID NO: 973 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 762 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 4 3895 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 

ACTGACGGCC CGATGCCGCA GACTCGTGAG CACATCCTGC TGGGTCGTCA 50 

GGTAGGCGTT CCGTACATCA TCGTGTTCCT GAACAAATGC GACATGGTTG 100 

ATGACGAAGA GCTGCTGGAA CTGGTTGAAA TGGAAGTTCG TGAACTTCTG 150 

55 TCTCAGTACG ACTTCCCGGG CGACGACACT CCGATCGTTC GTGGTTCTGC 200 

TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG GGAAGCGAAA ATCCTGGAAC 250 

TGGCTGGCTT CCTGGATTCC TACATTCCGG AACCAGAGCG TGCRATTGAC 300 

AAGCCGTTCC TGCTGCCGAT CGAAGACGTA TTCTCCATCT CCGGTCGTGG 350 

TACCGTTGTT ACCGGTCGTG TAGAACGCGG TATCATCAAA GTTGGTGAAG 400 

60 AAGTTGAAAT CGTTGGTATC AAAGAGACTC AGAAGTCTAC CTGTACTGGC 450 

541 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGAAATGT TCCGCAAACT GCTGGACGAA 
AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 
TACTGGCTAA GCCGGGCACC ATCAAGCCGC 
GTGTACATTC TGTCCAAAGA TGAAGGCGGC 
5 AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 
TTGAACTGCC GGAAGGCGTA GAGATGGTAA 
ATGGTTGTTA CC 



PCT/CA00/01150 

GGCCGTGCTG GTGAGAACGT 500 

AGAAATCGAA CGTGGTCAGG 550 

ACACCAAGTT CGAATCTGAA 600 

CGTCATACTC CGTTCTTCAA 650 

TACTGACGTG ACTGGTACCA 700 

TGCCGGGCGA CAACATCAAA 750 

762 



10 



2) INFORMATION FOR SEQ ID NO: 974 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 804 bases 
15 (B) TYPE: Nucleic acid - 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 

CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

30 AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 2 00 

- TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCC TGGAACTGGC TGGCTTCCTG GATTCCTACA TTCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA ACGCGGTATC 400 

35 ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGTGA GAACGTAGGT GTTCTGCTGC GCGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGTCGTC 650 

40 ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGA 804 



45 



60 



2) INFORMATION FOR SEQ ID NO: 975 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



542 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975 



GCGATCCTGG TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

5 TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 200 

TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 250 

GGGAAGCGAA AATCCTGGAA CTGGCTGGCT TCCTGGATTC YTACATTCCG 300 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 350 

10 ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 400 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 4 50 

CAGAAGTCTA-eeTGTACTGG-GGT-TGAAATG-T-TGGGGAAAe— TGGTGGACGA —500 

AGGCCGTGCT GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 600 

15 CACACCAAGT TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG 650 

TCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGTGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 800 

GATG 804 

20 



2) INFORMATION FOR SEQ ID NO: 976 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 
35 (B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CGCAGACCCG 50 

40 TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 200 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 250 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 300 

45 GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 350 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 4 00 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA ACGTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

50 GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 600 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACGC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

55 CGCCATGGAC GACGGTSTGC GGTTC 825 



2) INFORMATION FOR SEQ ID NO: 977 

60 



543 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 

15 TATCCTTGTA GTAGCTTCAA CTGACGGACO AATGCCACAA ACTCGTGAGC 50 

ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAAGTTG ACTTGGTTGA CGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTATTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 200 

CAGTTATCCA AGGTTCAGCA CTTAAAGCTC TTGAAGGTGA CTCTAAATAC 250 

20 GAAGACATCG TTATGGAATT GATGAACACA GTTGATGAGT ATATCCCAGA 300 

ACCAGAACGT GACACTGACA AACCATTGCT TCTTCCAGTC GAGGACGTAT 350 

TCTCAATCAC TGGACGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 4 00 

ATCGTTAAAG TCAACGACGA AATCGAAATT GTTGGTATCA AAGAAGAAAC 450 

TCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAACTTGACG 500 

25 AAGGTCTTGC TGGAGATAAC GTAGGTGTCC TTCTTCGTGG TGTTCAACGT 550 

GATGAAATCG AACGTGGACA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAAGGTG AAGTCTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAATT CTACTTCCGT 700 

ACTACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

30 AATGCCTGGT GATAACGTGA CAATCGACGT TGAGTTGATT CACCCAATCG 800 

CCGTAGAACA AGGTACTACA 82 0 



35 2) INFORMATION FOR SEQ ID NO: 978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978 



GGCGCGATCC TGGTGGTCGC CGCCACCGAT GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTGGG CGTGCCCTAC ATCCTGGTGG 100 

CGCTGAACAA GTCCGACGCG GTCGACGACG AGGAGCTGCT CGAGCTCGTC 150 

GAGCTGGAGG TCCGCGAGTT GCTGGCCGCC CAGGACTTCG ACGAGGAAGC 200 

55 TCCGGTGGTC CGGGTCTCGG CGCTGAAGGC GCTCGAGGGC GACGCCACCT 250 

GGGTGAAGTC GGTAGAGGAC TTGATGGACG CGGTCGACGA GTCGATTCCG 300 

GACCCGGTCC GCGACACCGA CAAGCCGTTC CTGATGCCCG TCGAGGACGT 350 

CTTCACCATC ACCGGTCGTG GCACCGTCGT CACCGGCCGT GTGGAGCGCG 4 00 

GCGTGGTGAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CAAGCCGACC 450 

60 AGCACCAAGA CCACGGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

544 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCAGGGTCAG GCCGGTGACA ACGTCGGTCT 
GTGAGGACGT CGAGCGCGGC CAGGTCGTCA 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
GCACCACCGA CGTGACCGGT GTGGTGACGC 
GTGATGCCCG GTGACAACAC CAACATCTCG 
CGCCATGGAC GACGGTCTGC GG 



PCT/CA00/01150 

GCTGCTGCGT GGTGTCAAGC 550 

TCAAGCCCGG CACCACCACT 600 

ATCCTGTCCA AGGACGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TGCCGGAGGG CACCGAAATG 750 

GTGAAGCTGA TCCAGCCCGT 800 

822 



10 



2 ) INFORMATION FOR SEQ ID NO: 979 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 821 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

30 TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC ACTTAAAGCT CTTGAAGGTG ACTCTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TATATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAGGACGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

35 TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CTGGAGATAA CGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTCTACA TCCTTACTAA AGAAGAAGGT 650 

4 0 GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC A 821 



45 



60 



2) INFORMATION FOR SEQ ID NO: 980 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: ATCC 25177 



545 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ 



PCT/CA00/01150 



ID NO: 980 



GGTGCGATCC TGGTGGTCGC 
CGAGCACGTT CTGCTGGCGC 
CGCTGAACAA GGCCGACGCA 
GAGATGGAGG TCCGCGAGCT 
CCCGGTTGTG CGGGTCTCGG 
GGGTTGCCTC TGTCGAGGAA 
GACCCGGTCC GCGAGACCGA 
CTTCACCATT ACCGGCCGCG 
GCGTGATCAA CGTGAACGAG 
AGGAGGAAGA— C-GAGGGTGAG 
CCAGGGCCAG GCGGGCGACA 
GCGAGGACGT CGAGCGTGGC 
CCGCACACCG AGTTCGAAGG 
CGGCCGGCAC ACGCCGTTCT 
GCACCACCGA CGTGACCGGT 
GTGATGCCCG GTGACAACAC 
CGCCATGGAC GAAGGTNTGC 



CGCCACCGAC 
GTCAAGTGGG 
GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 
GAACCGTGGT 
GAAGTTGAGA 
GGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 
CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCG 



GGCCCGATGC CCCAGACCCG 
TGTGCCCTAC ATCCTGGTAG 
AGGAGCTGCT CGAACTCGTC 
CAGGAATTCG ACGAGGACGC 
GCTCGAGGGT GACGCGAAGT 
CGGTCGACGA GTCGATTCCG 
CTGATGCCGG TCGAGGACGT 
CACCGGACGT GTGGAGCGCG 
TCGTCGGCAT TCGCCCATCG 
A-TGT-T-GGGGA-AGGTGCTCGA- 
GCTGCTGCGG GGCGTCAAGC 
CCAAGCCCGG CACCACCACG 
ATCCTGTCCA AGGACGAGGG 
CCGTCCGCAG TTCTACTTCC 
TGCCGGAGGG CACCGAGATG 
GTGAAGTTGA TCCAGCCCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 981 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 144 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 

CGGCGGTATC TTAGTAGTAT CTGCTGCAGA TGGCCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGCTGACAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATACGACTTC CCTGGTGACG 200 

ATGTACCAGT TATCGTTGGT TCTGCATTAA AAGCTTTAGA AGGCGACCCA 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCTGTAG ATGACTACAT 300 

CCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCAACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATCACTGA 450 

AGAAAGCAAG AAAACAACAG TTACAGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT 550 

GCACGTGAAG ACGTACAACG TGGACAAGTA TTAGCAGCTC CTGGCTCTAT 600 

TACTCCACAC ACAAAATTCA AAGCTGATGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC TGGCGTTGTT CACTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTAGAAAT GACTGTTGAA TTAATCGCTC 800 

CAATCGCGAT TGAAGACGG 819 



2) INFORMATION FOR SEQ ID NO: 982 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982 

15 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAGATC GACTTGGTTG ATGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTCTTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCT CTTGAAGGTG ATACTAAGTA 250 

20 CGAAGACATC ATCATGGAAT TGATGAACAC TGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGATACTGAC AAACCTCTTC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGATG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 4 50 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAGCTTGAC 500 

25 GAAGGTCTTG CAGGGGACAA CGTAGGTGTA CTTCTTCGTG GTATCCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAGGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAGT TCTACTTCCG 700 

TACAACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

30 TAATGCCTGG TGATAACGTA ACTATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTTGAAC AAGG 814 



35 2) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 983 



CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

55 CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

60 CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

547 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CAOO/01150 

GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC 810 



2) INFORMATION FOR SEQ ID NO: 984 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 100 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

30 AAATCCGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGCTCTT GAAGGTGATA CTAAGTACGA 250 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 300 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AGACGTATTC 350 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG ACCGTGGTAC 400 

35 TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 450 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TCCGTAAACA GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGTACTT CTTCGTGGTA TCCAACGTGA 550 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA ACCAGGTTCA ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

4 0 CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT ACTTCCGTAC 700 

AACTGACGTT ACAGGTTCAA TCGAACTTCC AGCAGGTACT GAAATGGTAA 750 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 800 

GTTGAACAAG GTACTAC 817 



45 



60 



2) INFORMATION FOR SEQ ID NO: 985 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



548 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985 



GACGGACCAA TGCCACAAAC TCGTGAGCAC ATCCTTCTTT CACGTCAGGT 50 

TGGTGTTAAA CACCTTATTG TCTTCATGAA CAAAATTGAC TTGGTAGACG 100 

5 ACGAAGAATT GCTTGAATTG GTTGAAATGG AAATCCGTGA CCTCTTGTCA 150 

GAATACGACT TCCCAGGTGA CGATCTTCCA GTTATCCAAG GTTCAGCTCT 200 

TAAAGCTCTT GAAGGTGACT CTAAATACGA AGACATCATT ATGGAATTGA 250 

TGAACACTGT TGATGAGTAC ATCCCAGAAC CAGAACGTGA CACTGAAAAA 300 

CCATTGCTTC TTCCAGTCGA AGACGTATTC TCAATCACTG GACGTGGTAC 350 

10 AGTTGCTTCA GGACGTATCG ACCGTGGTAC TGTTCGTGTC AACGACGAAA 400 

TCGAAATCGT TGGTATCAAA GAAGAAACTC AAAAAGCAGT TGTTACTGGT 450 

— GTTGAAATGT-TCCGTAAACA— ACTTGAeGAA-GGTG-T-TGGGG-GAGA-TAAGGT 500 

AGGTGTCCTT CTTCGTGGTG TTCAACGTGA CGAAATCGAA CGTGGACAAG 550 

TTATCGCTAA ACCAGGTTCA ATCAACCCAC ACACTAAATT TAAAGGTGAA 600 

15 GTCTACATCC TTACTAAAGA AGAAGGTGGA , CGTCACACTC CATTCTTCAA 650 

CAACTACCGT CCACAATTCT ACTTCCGTAC TACTGACGTT ACAGGTTCAA 700 

TCGAACTTCC TGCAGGTACT GAAATGGTAA TGCCTGGTGA TAACGTGACT 750 

ATCGACGTTG AGTTGATCCA CCCAATCGCC GTAGAACAAG GTACTACA 798 



20 



2) INFORMATION FOR SEQ ID NO: 986 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986 



TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

4 0 AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT 2 00 

ATCCAAGGTT CAGCACTTAA AGCTCTTGAA GGTGACTCTA AATACGAAGA 2 50 

CATCGTTATG GAATTGATGA ACACAGTTGA TGAGTATATT CCAGAACCAG 3 00 

AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA CGTATTCTCA 3 50 

45 ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 400 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCTGGAG ATAACGTAGG TGTCCTTCTT CGTGGTGTTC AACGTGATGA 550 

AATCGAACGT GGACAAGTTA TCGCTAAACC AGGTTCAATC AACCCACACA 600 

50 CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 700 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA 800 

GAACAAGGTA CTACA 815 

55 



2) INFORMATION FOR SEQ ID NO: 987 

60 (i) SEQUENCE CHARACTERISTICS: 

549 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 
10 (B) STRAIN: ATCC 8043 



(xi-)— SEQUENGE-DESCRIP-T-ION-:_SEQ_ID-_NO.:„a8„7_ 



CGGACAATCT TGGTTGTTTC TGCTACGGAT GGCCCAATGC CTCAAACTCG 50 

15 TGAACACATT TTACTTTCTC GCCAAGTAGG CGTGAAATAT TTGATTGTTT 100 

TCTTGAACAA AACAGATTTA GTCGATGATG AAGAATTAAT TGATCTAGTA 150 

GAAATGGAAG TTCGTGAACT ATTAAGCGAA TATGGATTTC CAGGTGATGA 200 

TACACCTGTT ATCAAAGGAT, CAGCATTAAA AGCATTACAA GGTGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ACACAGTCGA TGAATACATC 300 

20 CCAACACCAG AACGTGATAC GGACAAACCA TTATTGTTAC CCGCCGAAGA 350 

TGTATTTTCA ATTACTGGAC GTGGAACAGT AGCATCTGGT CGGATTGACC 400 

GTGGGGCTGT TAGAGTCGGT GACGAAATCG AAATCGTAGG GATCAAACCA 450 

GAAACGCAAA GAGCCGTAGT AACAGGAGTT GAAATGTTCC GCAAAACGCT 500 

TGATTACGGT GAAGCAGGGG ATAACGTAGG TGTGTTATTA CGTGGGATTC 550 

2 5 AAAGAGAAGA CATCGAACGT GGCCAAGTGA TTGCCAAACC TGGTTCAATT 600 

ACACCTCATA CTAAATTCAA AGCAGAAGTT TAGGTTTTGA CTAAAGAAGA 650 

AGGCGGACGT CATACACCAT TCTTCAATAA TTATCGACCA CAATTTTATT 700 

TCCGCACAAC TGACGTAACA GGAACAATTG TTTTGCCAGA AGGAACGGAA 750 

ATGGTCATGC CTGGCGACAA CGTAACGATC GATGTAGAAT TGATTCATCC 800 

30 TGTTGCTATT GAAAACGGGA CGACTTTCTC CA 832 



2) INFORMATION FOR SEQ ID NO: 988 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 50 

GCGAGCATAT CCTTTTATCA CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA ATATGGTTTC CCAGGTGATG 2 00 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 250 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 300 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 350 

ATGTTTTCTC AATCACTGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 400 

CGTGGAGCCG TTCGTGTCGG TGATGAAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 

550 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CAAAGAGAAG ATATTGAACG 
CACACCACAT ACAAAATTCA 
AAGGCGGACG TCATACACCA 
TTCCGCACAA CAGATGTAAC 
5 AATGGTCATG CCTGGAGACA 
CAGTGGCAAT CGAACAAGGA 



TGGACAAGTA ATTGCGAAAC 
AAGCGGAAGT TTATGTATTG 
TTCTTCAATA ACTACCGCCC 
AGGTACGATC GTGTTGCCAG 
ACGTAACCAT CGAGGTAGAG 
ACGACTTTCT CTATT 



PCT/CA00/01150 

CTGGTTCAAT 600 
ACGAAAGAAG 650 
ACAATTTTAT 700 
AAGGAACAGA 750 
TTGATCCATC 800 
835 



10 2) INFORMATION FOR SEQ ID NO: 989 

(4-)— SEQUENGE-GHARAGT-ER-I-SIICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 4 9427 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989 



TGCGATCTTA GTAGTATCTA CTACTGATGG ACCAATGCCT CAAACACGTG 50 

AGCACATTTT GTTATCACGC CAAGTAGGTG TTAAATACTT GATCGTCTTC 100 

TTGAACAAAG TTGATTTAGT CGACGATGAA GAATTGATTG ATTTAGTTGA 150 

AATGGAAGTA CGTGAGTTAC TTTCAGAATA TGGTTTCCCA GGCGATGATA 200 

3 0 TTCCTGTTCT TAAAGGTTCA GCTCTGAAAG CTTTAGAAGG CGATCCTGAA 250 

CAAGAACAAG TAATCATGGA CCTAATGGAT ACGGTTGACG AATATATCCC 300 

AACACCAGAA CGTGACACTG ACAAACCATT CTTGTTACCA GTGGAAGATG 350 

TTTTCTCAAT CACAGGACGT GGGACTGTTG CATCTGGTCG TATTGATCGT 400 

GGGGAAGTTA AAGTCGGTGA CGAAGTTGAA ATTATCGGGA TCAAACCTGA 450 

35 AGTTCAAAAG GCTGTCGTAA CTGGACTTGA AATGTTCCGT AAAACATTGG 500 

ATTATGGTGA AGCTGGAGAT AACGTTGGGG TTCTATTACG TGGTATTACA 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCTAAACCAG GTTCAATTAC 600 

ACCACATACG AAATTCAGTG CAGAAGTTTA TGTGTTGACG AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC TTTAACAACT ATCGTCCTCA ATTCTACTTC 700 

40 CGTACAACAG ACGTTACCGG TAATATCGTG TTGCCAGAAG GTACTGAAAT 750 

GGTCATGCCT GGCGATAACG TAACAATCGA CGTTGAATTA ATCCATCCAA 800 

TCGCCGTAGA AAAAGGAACA ACTTTCTCTA TT 832 



45 



55 



2) INFORMATION FOR SEQ ID NO: 990 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 154 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990 

551 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGATCCTGTA TATGCACAAA AACTAGGTGT 

TATCACAACC TGATACAGGG GAGCAAGGTT 

GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT GAATTACTAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 



10 



2) INFORMATION FOR SEQ ID NO: 991 

(i) SEQUENCE CHARACTERISTICS: 

( A) — -LENGTH-:— 384— bases- 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991 

25 GCCATTGCAG AGGCACAGAA GCAGGGCGGT ATTGCAGCCT TCATTGATGC 50 

TGAGCACGCC TTCGACCGTT TCTATGCAGA GAAGTTAGGT GTGGATGTTG 100 

ATAACCTTTG GGTTTCACAG CCAGACAATG GTGAGCAGGC TTTAGAGATT 150 

GCCGACCAGC TGATTCGCTC TTCCGCTATT GACATTCTCG TTGTCGACTC 200 

AGTTGCAGCC TTGACTCCAA AGAAGGAGAT TGAGGGTGAC ATGGGTGACT 250 

3 0 CTGCAGTAGG TTTACAAGCA CGACTGATGA GTCAGGCATT GCGTAAACTT 3 00 

ACCTCAACAA TCGCAAAAAC TAATACTTGC TGCATCTTCA TCAACCAGTT 350 

GCGTGAGAAG ATTGGTGTGA TGTTTGGTAA TCCA 384 



35 



45 



2) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA TTACTAGAAT 100 

TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 200 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 250 

ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT GATGCCAGTC 300 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 3 50 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 400 

60 CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT GTTCCGTAAA 4 50 

552 



SUBSTITUTE SHEET (RULE 26) 
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TTGTTAGACT ATGCTGAAGC AGGGGATAAC 
TGTTGCTCGT GAAGACATCC AACGTGGACA 
CAATCACACC TCATACAAAA TTTAAAGCTG 
GAAGAAGGTG GACGTCATAC ACCA 



PCT/CA00/01150 

ATTGGTGCAT TGCTACGTGG 500 
AGTATTGGCT AAAGCTGGTA 550 
AAGTTTACGT TTTAACAAAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 993 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 
(B) — TYPE:— Nucleic— acid 



15 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
20 (C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

25 GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 100 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 250 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 300 

30 TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 350 

TTATTTATGT GAAAATGCAG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 400 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTCGTTAAA 450 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAATATCTTA 550 

35 CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 650 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 700 

ATGAAACGCT TGACTCAAAC ACAAGCCAAA TTGAAGTCTA CCTAACAACC 750 

AAGTAA 756 

40 



2) INFORMATION FOR SEQ ID NO: 994 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
55 TGGACTAACA ATCTCGCAAG AGG 23 



2) INFORMATION FOR SEQ ID NO: 995 

60 

553 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 

10 

ACATTCTCGT GAGTAACAGG GT 22 



15 2) INFORMATION FOR SEQ ID NO: 996 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 

ACAAATCATG AAGGGAATCA TTTAG 25 



30 



40 



45 



60 



2) INFORMATION FOR SEQ ID NO: 997 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 23 

2) INFORMATION FOR SEQ ID NO: 998 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 23 



554 
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2) INFORMATION FOR SEQ ID NO: 999 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG . 23 



20 



2) INFORMATION FOR SEQ ID NO: 1000 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000 

30 

TTAGTGTGTG GGTTGATTGA ACT 23 



35 2) INFORMATION FOR SEQ ID NO: 1001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001 

AAGAGTTGCT TGAATTAGTT GAG 23 



50 

2) INFORMATION FOR SEQ ID NO: 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS :' Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

60 

555 
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WO 01/23604 PCT/CA00/01150 
(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: ATCC 700294 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002 

AACATGATCA CTGGTGCCGC TCAAATGGAC GGAGCTATCC TTGTAGTTGC 50 
TTCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC 100 
GTCAGGTTGG TGTTAAACAC CTTATCGTGT TCATGAACAA AGTTGACCTT 150 

10 GTTGATGACG AAGAGTTGCT TGAATTAGTT GAGATGGAAA TTCGTGACCT 200 
TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTTCCAGTT ATCCAAGGTT 250 

eAGCTCTTAA-AGGTGT^GAA--GGCGACACTA-AATT-TGAAGA-CATCATCATG 30.0_ 



GAATTGATGG ATACTGTTGA TTCATACATT CCAGAACCAG AACGCGACAC 350 

TGACAAACCA TTGCTTCTTC CAGTCGAAGA CGTATTCTCA ATTACAGGTC 400 

15 GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTACTGT TCGTGTCAAC 450 

GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTAAAA AAGCTGTTGT 500 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCAGGAG 550 

ACAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA AATCGAACGT 600 

GGTCAAGTTA TTGCTAAACC AAGTTCAATC AACCCACACA CTAAATTCAA 650 

20 AGGTGAAGTA TATATCCTTT CTAAAGACGA AGGTGGACGT CACACTCCAT 700 

TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACAAC TGACGTAACA 750 

GGTTCAATCG AACTTCCAGC AGGTACAGAA ATGGTTATGC CTGGTGATAA 800 

CGTGACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA GAACAAGGTA 850 

CTACTTTCTC AATCCGTGAA GGTGGACGTA CTGTTGGTTC AGGT 8 94 

25 



2) INFORMATION FOR SEQ ID NO: 1003 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
40 (B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 50 

4 5 TATTATCACA GCCTGATACA GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 10 0 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CAAAAACAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

50 AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 332 



2) INFORMATION FOR SEQ ID NO: 1004 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

556 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 

10 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 
GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 
JT-TOVAAIGGT-AAAGTXTATCG 



ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

15 TTATGATTCC ACTGCAACTA TGGTTAATGA, TATTCCTTAT AACTATC CGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 4 00 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

20 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

25 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

30 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 

CAAATACAAC CC 1212 

35 



2) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

557 
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CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

5 TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

10 ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

— eTATCGTTGG-AGATGGTTTC -GTAGTTGGAG GTAAAGTTTA -TGGCTGAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

15 CGTGGAGCTC ACCTGCTCGA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1006 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 



ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

40 TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

45 ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCCAACG GTGGAATTTA CCACAAACCC ATGTATATCA 600 

50 ATAAGGTCGT CTTCAGTGAC GGTAGTAAAA AAGAATTTTC AGATGTAGGT 650 

ACACGAGCTA TGAAAGAAAC AACTGCTTAC ATGATGACCG AAATGATGAA 700 

AACTGTCTTG GCATACGGAA CTGGTCGTGG AGCCTATCTC CCATGGTTAG 750 

CGCAAGCTGG TAAGACAGGT ACTTCTAACT ACACAGATGA TGAAATTGAA 800 

AAACACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

55 TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTGTC TGAAGGAAGC AATCCAGAGG ATTGGAATAT 1000 

ACCAGAGGGG CTCTACAGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1050 

GTTCTACGTG GAGCTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1100 

60 TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAACT CAACCACTCC 1150 

558 
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AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CCC 1213 



2) INFORMATION FOR SEQ ID NO: 1007 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1218 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

— — — (D) — TOPOLOGY-:— Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

25 ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 2 00 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

30 TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 4 50 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

35 ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

40 TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

4 5 AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 12 00 

CAAATACAAC CCCTGATC 1218 



50 2) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

559 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

5 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 200 

10 GGGTTCTGCT ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT 300 

GGGGGAAGAA— GGAGAGGTGT— CTACAACTGG-GATAGAGCAT -ATT-TGGGTAA — — —350 

TATTACTCTG CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 4 00 

AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

15 CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 650 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 700 

20 TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 750 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT 800 

TGAAAACCAC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

25 TCGCTCAATG ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

3 0 CAATCAAATA CAACCCCTGA TCA 1223 



2) INFORMATION FOR SEQ ID NO: 1009 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

50 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

55 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

60 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

560 
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TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TGAAGG-TCAT— GATGAGAT-AG-TTCAACT-T.CA-CAGT.CTA6CT--.CAAC.CACT.ee 1.15_0_ 



AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CCCT 1214 



15 



2) INFORMATION FOR SEQ ID NO: 1010 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
30 (B) STRAIN: StrR-07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 

CCAAGAAGCT CAAAAACATC TGTGGGATAT CTACAACTCC GATCAATACG 50 

35 TCTCTTACCC TGACGATGAT TTGCAAGTCG CATCTACGGT CGTAGATGTT 100 

TCAAATGGTA AAGTCATCGC ACAACTTGGT GCTCGTCATC AAGCAAGTAA 150 

TGTTTCATTC GGTACCAACC AGGCCGTAGA AACCAATCGT GACTGGGGAT 2 00 

CATCAATGAA ACCAATCACT GACTATGCTC CCGCTTTAGA ATATGGAGTC 250 

TATGACTCTA CTGCTTCTAT TGTACATGAT GTCCCTTATA ACTATCCTGG 300 

40 CACTGATACT CCACTCTACA ACTGGGATCA TGTCTACTTT GGAAACATTA 3 50 

CAATCCAGTA TGCTCTTCAA CAATCACGAA ATGTCACAGC CGTTGAGACT 4 00 

TTGAATAAGG TCGGTCTAGA TAGAGCTAAA ACCTTCCTTA ATGGTCTTGG 4 50 

TATCGACTAT CCAAGCATGC ATTATGCAAA CGCCATTTCA AGTAACACAA 500 

CTGAATCCAA CAAAAAATAT GGTGCAAGTA GTGAAAAAAT GGCTGCTGCC 550 

45 TACGCTGCTT TTGCTAATGG TGGTATTTAT CACAAACCAA TGTATATCAA 600 

TAAAATCGTC TTTAGTGATG GTAGCGAAAA AGAATTTTCT GATGCTGGTA 650 

CACGAGCTAT GAAAGAGACT ACTGCCTATA TGATGACTGA AATGATGAAA 700 

ACTGTTTTAA CTTACGGAAC AGGACGTGGA GCCTACCTAC CATGGCTTCC 750 

ACAAGCAGGT AAGACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA 800 

50 AGTATATCAA GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG 850 

TATACCCGTA AATATGCAAT GGCTGTTTGG ACAGGATACT CAAATCGTCT 900 

AACTCCAATC ATCGGAGATG GTTTCCTTGT TGCTGGTAAA GTCTATCGTT 950 

CAATGATAAC TTACCTTTCT GAAGATGACC AACCTGGAGA TTGGACAATG 1000 

CCAGATGGCT TGTATAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG 1050 

55 TTCTACGTGG AGCTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT 1100 

CAAGCTCATC ATCAGATAGT TCAACTTCAC AGTCTAACTC AACCACTCCA 1150 

AGCACAAATA ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC 12 00 

AAATACAACC CCTGATCAAC AAA 1223 



561 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1011 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1207 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL S OURCE: . 



15 



45 



55 



(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

20 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

25 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

3 0 TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT CGGAACTCGT 650 

GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA TGAAAACAGT 700 

CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT TGAAAACCAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

35 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGCCCAA 1050 

TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA AAGTTCAAGC 1100 

4 0 TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCC 1207 



2 ) INFORMATION FOR SEQ ID NO: 1012 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 

562 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 
TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 
TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 
5 CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 
ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 
TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 
GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 
CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 
10 TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 
ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 
.T.CTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCT.G CTGCTTATGC 550- 



TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

15 GCTATGAAAG AAACAACTGC TTACATGATG- ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

20 CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGTTCTA 1050 

CGTGGAACTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

25 AAATAATAGT ACGACTACCG ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

C 1201 



30 2) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1220 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC AACTCCGATC 50 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC TACGGTCGTA 100 

GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC GTCATCAAGC 150 

AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC AATCGTGACT 200 

50 GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC TTTAGAATAT 250 

GGAGTCTATG ACTCTACTGC TTCTATTGTA CATGATGTCC CTTATAACTA 300 

TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC TACTTTGGAA 350 

ACATTACAAT CCAGTATGCT CTTCAACAAT CACGAAATGT CACAGCCGTT 400 

GAGACTTTGA ATAAGGTCGG TCTAGATAGA GCTAAAACCT TCCTTAATGG 450 

55 TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA 500 

ACACAACTGA ATCCAACAAA AAATATGGTG CAAGTAGTGA AAAAATGGCT 550 

GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA AACCAATGTA 600 

TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA TTTTCTGATG 650 

CTGGTACACG AGCTATGAAA GAGACTACTG CCTATATGAT GACTGAAATG 700 

60 ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT ACCTACCATG 750 

563 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCTTCCACAA GCAGGTAAGA CAGGTACTTC 
TTGAAAAGTA TATCAAGAAC ACTGGCTACG 
GTGGGTTATA CTCGTAAGTA TTCTATGGCT 
TCGTTTAACT CCTATCGTTG GAGATGGTTT 
5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG 
AATATACCAG AGGGGCTCTA CAGAAATGGA 
TGCTCGTTCT ACGTGGAGCT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
10 ACAATCAAAT ACAACCCCTG 



PCT/CA00/01150 

TAACTATACT GACGAAGAAA 800 

TAGCTCCAGA TGAAATGTTT 850 

GTATGGACAG GTTATTCGAA 900 

CCTAGTTGCA GCTAAAGTTT 950 

GAAGCAATCC AGAGGATTGG 1000 

GAATTCGTAT TTAAAAATGG 1050 

ACAACAACCC CCATCAACTG 1100 

CTTCACAGTC TAGCTCAACC 1150 

AATCCTAACA ATAATACGCA 1200 

1220 



2) INFORMATION FOR SEQ ID NO: 1014 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 
"(C) STRANDEDNESS : Double 

20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

} (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

30 CAAAAACATC TGTGGGATAT TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TCTAACGGTA 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCCTTC 150 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 2 00 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 250 

35 CTGCTACTAT CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACC 300 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 350 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 400 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 450 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

40 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AGAGTTCTCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA. CATGATGAAA ACTGTCTTAG 700 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 750 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA CCAGAGGGGC 1000 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 1050 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA 1150 

ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



55 

2) INFORMATION FOR SEQ ID NO: 1015 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 

564 



SUBSTITUTE SHEET (RULE 26) 



10 



40 



50 



WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 1015 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

15 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG, TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 3 00 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

2 0 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

25 TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC . 850 

30 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

35 TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 12 00 

CAACCCCTGA T 1211 



2) INFORMATION FOR SEQ ID NO: 1016 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

60 AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 200 

565 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGGATCATCA ATGAAACCAA TCACTGACTA 
GAGTCTATGA CTCTACTGCT TCTATTGTAC 
CCTGGCACTG ATACTCCACT CTACAACTGG 
CATTACAATC CAGTATGCTC TTCAACAATC 
5 AGACTTTGAA TAAGGTCGGT CTAGATAGAG 
CTTGGTATCG ACTATCCAAG CATGCATTAT 
CACAACTGAA TCCAACAAAA AATATGGTGC 
CTGCCTACGC TGCTTTTGCT AATGGTGGTA 
ATCAATAAAA TCGTCTTTAG TGATGGTAGC 

10 TGGTACACGA GCTATGAAAG AGACTACTGC 
TGAAAACTGT TTTAACTTAC GGAACAGGAC 
CTTCCACAAG CAGGTAAGAC AGGTACTTCT 
TGAAAAGTAT ATCAAGAACA CTGGCTACGT 
TGGGTTATAC TCGTAAGTAT TCTATGGCTG 

15 CGTTTAACTC CTATCGTTGG AGATGGTTTC 
TCGCTCTATG ATGACCTACC TGTCTGAAGG 
ATATACCAGA GGGGCTCTAC AGAAATGGAG 
GCTCGTTCTA CGTGGAGCTC ACCTGCTCCA 
AAGTTCAAGC TCATCATCAG ATAGTTCAAC 

20 CTCCAAGCAC AAATAATAGT ACGACTACCA 
CAATCAAATA CAACCCCTGA TC 



PCT/CA00/01150 

TGCTCCCGCT TTAGAATATG 250 

ATGATGTCCC TTATAACTAT 300 

GATCATGTCT ACTTTGGAAA 350 

ACGAAATGTC ACAGCCGTTG 4 00 

CTAAAACCTT CCTTAATGGT 450 

GCAAACGCCA TTTCAAGTAA 500 

AAGTAGTGAA AAAATGGCTG 550 

TTTATCACAA ACCAATGTAT 600 

GAAAAAGAAT TTTCTGATGC 650 

CTATATGATG ACTGAAATGA 700 

GTGGAGCCTA CCTACCATGG 750 

AACTATACTG ACGAAGAAAT 800 

AGCTCCAGAT GAAATGTTTG 850 

TATGGACAGG TTATTCGAAT 900 

CTAGTTGCAG CTAAAGTTTA 950 

AAGCAATCCA GAGGATTGGA 1000 

AATTCGTATT TAAAAATGGT 1050 

CAACAACCCC CATCAACTGA 1100 

TTCACAGTCT AGCTCAACCA 1150 

ATCCTAACAA TAATACGCAA 12 00 

1222 



25 2) INFORMATION FOR SEQ ID NO: 1017 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG 50 

AATACGTTGC CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT 100 

GATGTTTCTA ACGGTAAAGT CATTGCCCAG CTAGGAGCAC GCCATCAGTC 150 

AAGTAATGTT TCCTTCGGAA TTAACCAAGC AGTAGAAACA AACCGCGACT 200 

45 GGGGATCAAC TATGAAACCG ATCACAGACT ATGCTCCTGC CTTGGAGTAC 250 

GGTGTCTACG ATTCAACTGC TACTATCGTT CACGATGAGC CCTATAACTA 3 00 

CCCTGGGACA AATACCCCTG TTTATAACTG GGATAGGGGC TACTTTGGCA 350 

ACATCACCTT GCAATACGCC CTGCAACAAT CGCGAAACGT CCCAGCCGTG 4 00 

GAAACTCTAA ACAAGGTCGG ACTCAACCGC GCCAAGACTT TCCTAAATGG 4 50 

50 TCTCGGAATC GACTACCCAA GTATTCACTA CTCAAATGCC ATTTCAAGTA 500 

ACACAACCGA ATCAGACAAA AAATATGGAG CAAGTAGTGA AAAGATGGCT 550 

GCTGCTTACG CTGCCTTTGC AAATGGTGGA ACTTACTATA AACCAATGTA 600 

TATCCATAAA GTCGTCTTTA GTGATGGGAG TGAAAAAGAG TTCTCTAATG 650 

TCGGAACTCG TGCCATGAAA GAAACAACTG CTTACATGAT GACCGAAATG 700 

55 ATGAAAACTG TCCTGGCATA CGGAAGTGGT CGTGGAGCCT ATCTCCCATG 750 

GTTAGCGCAA GCTGGTAAGA CAGGTACTTC TAACTACACA GATGATGAAA 800 

TTGAAAAACA CATCAAGAAC ACTGGCTATG TAGCTCCAGA TGAAATGTTT 850 

GTTGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

60 ATCGCTCAAT GATAACGTAT CTATCAAAAG AT ACT CATC C AGAAGACTGG 1000 

566 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ACGATGCCAG ACGGACTTTT CAGAAACGGG 
AGCTCGTTCT ACGTGGAACT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
5 ACAATCAAAT ACAACCCCTG ATCAACAAA 



PCT/CA00/01150 

GAATTTGTAT TCAAAAATGG 10 50 
ACAACAACCC CCATCAACTG 1100 
CTTCACAGTC TAACTCAACC 1150 
AATCCTAACA ATAATACGCA 1200 

1229 



2) INFORMATION FOR SEQ ID NO: 1018 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases " " 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1018 

25 ATGTAGACCA AGAGGCTCAA AAACGTCTGT GGGATATCTA CAACTCCGAT 50 

CAATACGTCT CTTACCCTGA CGATGATTTG CAAGTCGCAT CTACGGTCGT 100 

AGATGTTTCA AATGGTAAAG TCATCGCCCA ACTTGGAGCT CGTCACCAAG 150 

CAAGTAACGT TTCATTTGGT ACCAACCAAG CTGTGGAAAC CAACCGTGAC 200 

TGGGGATCAA GCATGAAACC AATCACTGAT TATGCCCCAG CCTTAGAATA 250 

30 TGGTGTATAT GATTCCACTG CAACTATGGT TAATGATATT CCTTATAACT 300 

ATCCGGGCAC AAGCACACCT GTCTACAACT GGGATCGAGC ATATTTTGGT 350 

AATATTAGCC TGCAATATGC CCTTCAACAA TCTCGTAACG TGCCTGCCGT 4 00 

TGAAACACTA AACAAGGTTG GTTTAGATAG AGCCAAAACT TTCCTAAATG 450 

GTTTGGGAAT TGACTATCCA AGTATTCACT ACTCAAATGC TATTTCAAGT 500 

35 AATACAACTG AATCTAGTAA ACAGTACGGG GCAAGCAGTG AGAAAATGGC 550 

TGCGGCTTAC GCTGCATTCG CTAATGGCGG TATTTACCAC AAACCAATGT 600 

ACATCAATAA AGTTGTCTTT AGTGATGGTA GCGAAAAAGA ATTTTCTGAT 650 

GCTGGTACAC GAGCTATGAA AGAGACTACT GCCTATATGA TGACTGAAAT 700 

GATGAAAACT GTTTTAACTT ACGGAACAGG ACGTGGAGCC TACCTACCAT 750 

40 GGCTTCCACA AGCAGGTAAG ACAGGTACTT CTAACTATAC TGACGACGAA 800 

ATTGAAAAGT ATATCAAGAA CACTGGCTAC GTAGCTCCAG ATGAAATGTT 85 0 

TGTGGGTTAT ACTCGTAAGT ATTCTATGGC TGTATGGACT GGATACTCAA 900 

ATCGTTTAAC TCCAATCATT GGAGATGGTT TCCTAGTTGC TGCCAAAGTT 950 

TATCGCTCAA TGATATCGTA TCTATCAGAA GATGACCATC CTGGAGATTG 1000 

45 GACAATGCCT GAGGGAGTAT ACAGAAGTGG AGAATTCGTA TTTAAAAATG 1050 

GTGCTCGTTC TACGTGGAGC TCCCCTGCTC CACAACAACC CCCATCAACT 1100 

GAAAGTTCAA GCTCATCATC AGATAGTTCA ACTTCACAGT CTAGCTTAAC 1150 

CACTCCAAGC ACAAATAATA GTACGACTAC CAATCCTAAC AACAATACGC 12 00 

AACAATCAAA TACAACCCCT GATCA 122 5 

50 



2) INFORMATION- FOR SEQ ID NO: 1019 

55 • (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1439 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 

567 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
5 (B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

10 TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 

TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

15 AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 400 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

20 AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

TTCTTGGTAT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 750 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAACCA GCAATTTGGA 800 

25 AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

30 AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 12 00 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 1250 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 13 00 

35 CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 13 50 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 14 00 

GTATCAAAAA TACCATCCAA TGAACTAGAA AGGAAATTA 1439 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 10 0 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 
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< TGAAGAGACC 
AATATGGCAA 
AACAATATCA 
TTTGCTGAAA 
5 ATTCTGAAGG 
TTATCCATGT 
TGATTCCTTG 
AGGCTGCGAC 
CAAACCTTAA 

10 TTCTTGGTAT 
CCTTGGAGTA 
ATGGGCCAGA 
AACAGCTATG 
CTGCGACCGG 

15 GAGTATAGCT 
CTATACGCCC 
GTGTTCGTGT 
AAGGGAGGAC 
TAAGGTCAAT 

20 ATCAGGTTGC 
AATGGTGCCT 
TGTGGCAGAT 
CATCTGATAA 
AATCTAACAA 

25 GTATCAAAAA 



TTACAAGGAA 
TATGGAAAGC 
AACTGACCAT 
AGTTATTTCA 
TGTCTATGCA 
CAGGGATCAA 
GGAACGGTAA 
CATCAGCTCA 
CAGATCAGCC 
AAATTGGCAT 
TTCATCCAAT 
CCTATCAACC 
GGAAAACTTC 
AATTGACCTA 
TTGCTAATTA 
ATGCAGTTGG 
GGCTCCTCGT 
TGGGTGACTT 
ATATCCGACT 
CCATGGTACT 
TGGTATCCAT 
GGTCAGCAAG 
TCCCCAAATC 
ATGGTGTAGG 
TACCATCCAA 



AACGCTCGGT 
GTGGACACAA 
TGATTTGGCC 
ATTCCGAGCT 
GTCGCCCTTA 
ACATGACCTG 
CCAATGTCTT 
GGTTGGGAAA 
TATTGTTTTC 
ATGGATCTTT 
GCTTACATGG 
AAATATGTTT 
GTGCGACCTT 
CCAGATGAAT 
CATCACCAAT' 
CTCAGTATGT 
ATTGTTGAAG 
GATTCAGCAA 
CCGATATGAG 
AGTGGATTGA 
TAGCGGAAAA 
CAACCAATAC 
GCTGTCGCAG 
ACCTTCCATT 
TGAACTAGAA 



AAAAGAAATC 
TTGAGGAAGG 
TTCCAAGATA 
AGGAAATGGT 
ACCCAAAAAC 
AAAACGGGAG 
TGTCCCAGGT 
ATGGTGTTTT 
CAAGGTTCAG 
TCCTATTACA 
TTCAAACCGC 
GTTGGAACCA 
TGGCGAATAT 
CTACTGGATT 
TCCTTTGGGC 
AGCAACTATT 
GCATTTATGG 
CTGCAACCGA 
CATCTTGCAC 
CAACTGGACG 
ACAGGTACAG 
CAATGCGGTG 
TGGTCTTTCC 
GCGCGTGACA 
AGGAAATTAT 



PCT/CA00/01150 

CATCTGGATA 250 

TAGTAAGGGA 300 

GCGTGGATGC 350 

GGAGCTAAAT 4 00 

AGGTGCTGTT 4 50 

AGTTGACTCC 500 

TCGGTTGTCA 550 

ATCAGGAAAC 600 

CTCCAATTTA 650 

GCTGTGGAAG 700 

TCTTGGAATC 750 

GCAATTTGGA 800 

GGCTTGGGGG 850 

TGTTCCCAAA 900 

AGTTTGATAA 950 

GCAAATAATG 1000 

TAATAATGAT 1050 

CAGAGATGAA 1100 

CAAGGTTTTT 1150 

TGCCTTTTCA 12 00 

CCGAAAGCTA 12 50 

GCCTATGCCC 13 00 

TCATAATACC 13 50 

TTATCAATCT 1400 

G 1441 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1396 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 



45 TATTGCCTCT 
GGGATCGAAA 
TCCAGTGAAA 
AAAAGGCTAT 
AATATGAAGA 

50 GATAAATATG 
GGGAAACAAT 
ATGCTTTGCT 
AAATATTCTG 
TGTTTTATCC 

55 CTCCTGATTC 
GTCAAGGCTG 
AAACCAAACC 
TTTATTCTTG 
GAAGCCTTGG 

60 AATCATGGGC 



ATTTCAAAGG 
GGTTTTGGAA 
AAGCTGGTCT 
TCTCTAAATG 
GACCTTACAA 
GCAATATGGA 
ATCAAACTGA 
GAAAAGTTAT 
AAGGTGTCTA 
ATGTCAGGGA 
CTTGGGAACG 
CGACCATCAG 
TTAACAGATC 
GTATAAATTG 
AGTATTCATC 
CAGACCTATC 



AGATGCCTGG 
ACTTCCCTTT 
CCCAGCGGAA 
ACCGTGTTGG 
GGAAAACGCT 
AAGCGTGGAC 
CCATTGATTT 
TTCAATTCCG 
TGCAGTCGCC 
TCAAACATGA 
GTAACCAATG 
CTCAGGTTGG 
AGCCTATTGT 
GCATATGGAT 
CAATGCTTAC 
AACCAAATAT 



ID NO: 1021 




CATTAGTATT 


TCTACTTCTT 


50 


CTTCTATAGT 


AGGGAGTGTA 


100 


GAAGCAGAAT 


CCTATCTTAA 


150 


AACCTCCTAT 


TTGGAAAAGC 


200 


CGGTAAAAGA 


AATCCATCTG 


250 


ACAATTGAGG 


AAGGTAGTAA 


300 


GGCCTTCCAA 


GATAGCGTGG 


350 


AGCTAGGAAA 


TGGTGGAGCT 


400 


CTTAACCCAA 


AAACAGGTGC 


450 


CCTGAAAACG 


GGAGAGTTGA 


500 


TCTTTGTCCC 


AGGTTCGGTT 


550 


GAAAATGGTG 


TTTTATCAGG 


600 


TTTCCAAGGT 


TCAGCTCCAA 


650 


CTTTTCCTAT 


TACAGCTGTG 


700 


ATGGTTCAAA 


CCGCTCTTGG 


750 


GTTTGTTGGA 


ACCAGCAATT 


800 
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TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

5 AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

10 GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATT 1396 



15 



2) INFORMATION FOR SEQ ID NO: 1022 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1428 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 

AAAGGAGATG CCTGGCATTA GTATTTCTAC TTCTTGGGAT CGAAAGGTTT 50 

TGGAAACTTC CCTTTCTTCT ATAGTAGGGA GTGTATCCAG TGAAAAAGCT 100 

GGTCTCCCAG CGGAAGAAGC AGAATCCTAT CTTAAAAAAG GCTATTCTCT 150 

35 AAATGACCGT GTTGGAACCT CCTATTTGGA AAAGCAATAT GAAGAGACCT 200 

TACAAGGAAA ACGCTCGGTA AAAGAAATCC ATCTGGATAA ATATGGCAAT 2 50 

ATGGAAAGCG TGGACACAAT TGAGGAAGGT AGTAAGGGAA ACAATATCAA 300 

ACTGACCATT GATTTGGCCT TCCAAGATAG CGTGGATGCT TTGCTGAAAA 350 

GTTATTTCAA TTCCGAGCTA GGAAATGGTG GAGCTAAATA TTCTGAAGGT 4 00 

40 GTCTATGCAG TCGCCCTTAA CCCAAAAACA GGTGCTGTTT TATCCATGTC 4 50 

AGGGATCAAA CATGACCTGA AAACGGGAGA GTTGACTCCT GATTCCTTGG 500 

GAACGGTAAC CAATGTCTTT GTCCCAGGTT CGGTTGTCAA GGCTGCGACC 550 

ATCAGCTCAG GTTGGGAAAA TGGTGTTTTA TCAGGAAACC AAACCTTAAC 600 

AGATCAGCCT ATTGTTTTCC AAGGTTCAGC TCCAATTTAT TCTTGGTATA 650 

4 5 AATTGGCATA TGGATCTTTT CCTATTACAG CTGTGGAAGC CTTGGAGTAT 700 

TCATCCAATG CTTACATGGT TCAAACCGCT CTTGGAATCA TGGGCCAGAC 750 

CTATCAACCA AATATGTTTG TTGGAACCAG CAATTTGGAA ACAGCTATGG 800 

GAAAACTTCG TGCGACCTTT GGCGAATATG GCTTGGGGGC TGCGACCGGA 850 

ATTGACCTAC CAGATGAATC TACTGGATTT GTTCCCAAAG AGTATAGCTT 900 

50 TGCTAATTAC ATCACCAATT CCTTTGGGCA GTTTGATAAC TATACGCCCA 950 

TGCAGTTGGC TCAGTATGTA GCAACTATTG CAAATAATGG TGTTCGTGTG 1000 

GCTCCTCGTA TTGTTGAAGG CATTTATGGT AATAATGATA AGGGAGGACT 1050 

GGGTGACTTG ATTCAGCAAC TGCAACCGAC AGAGATGAAT AAGGTCAATA 1100 

TATCCGACTC CGATATGAGC ATCTTGCACC AAGGTTTTTA TCAGGTTGCC 1150 

55 CATGGTACTA GTGGATTGAC AACTGGACGT GCCTTTTCAA ATGGTGCCTT 1200 

GGTATCCATT AGCGGAAAAA CAGGTACAGC CGAAAGCTAT GTGGCAGATG 1250 

GTCAGCAAGC AACCAATACC AATGCGGTGG CCTATGCCCC ATCTGATAAT 1300 

CCCCAAATCG CTGTCGCAGT GGTCTTTCCT CATAATACCA ATCTAACAAA 13 50 

TGGTGTAGGA CCTTCCATTG CGCGTGACAT TATCAATCTG TATCAAAAAT 140 0 

60 ACCATCCAAT GAACTAGAAA GGAAATTA 1428 

570 
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2) INFORMATION FOR SEQ ID NO: 1023 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

-(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

2 0 TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 

ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGTATCC 100 

AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 

AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 200 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 250 

25 AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 300 

AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 

CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAGCTAAA 400 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 450 

TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 500 

30 CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 550 

AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 

CCAAACCTTA ACAGATCAGC CTATTGTTTT CCAAGGTTCA GCTCCAATTT 650 

ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCCTATTAC AGCTGTGGAA 700 

GCCTTGGAGT ATTCATCCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 750 

35 CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 800 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 850 

GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 900 

AGAGTATAGC TTTGCTAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

ACTATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 1000 

40 GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTGACT TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG ACAACTGGAC GTGCCTTTTC 1200 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

45 ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 1300 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 1350 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 14 00 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



50 



2) INFORMATION FOR SEQ ID NO: 1024 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

571 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCCTATTT GGAAAAGCAA 200 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

15 GCTTTGCTGA AAAGTTATTT CAATTCCGAG, CTAGGAAATG GTGGAGCTAA 400 

ATATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCTG 450 

TTTTATCCAT GTCAGGGATC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CAGGTTGGGA AAATGGTGTT TTATCAGGAA 600 

20 ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

25 GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGAGTATAG CTTTGCTAAT TACATCACCA ATTCCTTTGG GCAGTTTGAT 950 

AACTATACAC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATAA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

30 AATAAGGTCA ATATATCCGA CTCCGATATG AGCATCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCCCATGGTA CTAGTGGATT GACAACTGGA CGTGCCTTTT 1200 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACAGGTAC AGCCGAAAGC 1250 

TATGTGGCAG ATGGTCAGCA AGCAACCAAT ACCAATGCGG TGGCCTATGC 1300 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

35 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AATGAACTAG AAAGGAAATT ATGCT 1445 



40 2) INFORMATION FOR SEQ ID NO: 1025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 200 

60 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 
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TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCCAA 400 

GTATTCTGAG GGTGTGTATG CAGTCGCCCT TAACCCCAAA ACAGGTGCTG 450 

5 TTTTGTCTAT GTCAGGACTC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CTGGCTGGGA AAATGGTGTT TTATCAGGAA 600 

ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

10 AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAG CTTTGCTAAT TACATCACCA ATGCCTTTGG GCAGTTTGAT 950 

15 AACTATACGC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTATCTTGC ACCAAGGATT 1150 

TTACCAAGTA TCGCATGGAA CTAGTCCCCT TACGACAGGA CGGGCGTTTT 1200 

20 CAGATGGCGC CACTGTTTCT ATCAGTGGTA AGACCGGTAC AGGTGAAAGC 1250 

TATGTAGCTG GTGGTCAAGA AGCTAATAAT ACCAATGCCG TGGCCTATGC 1300 

TCCAACAGAA AATCCTCAAA TTGCAGTTGC AGTAGTCTTT CCTCATAATA 1350 

CCAATTTAAC CAAAAATGTT GGGCCAGCAA TTGCTCGCGA CATTATCAAT 1400 

TTATATAACC AACACCATCC AATGAATTAG AAAGGAAGCC A 1441 

25 



2) INFORMATION FOR SEQ ID NO: 1026 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

5 TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

10 GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTA 1443 



15 



2) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1443 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) . ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

35 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGG AAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 35 0 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 400 

4 0 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 45 0 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 

CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 

GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 

AAATCAGACC TTGACAGACC AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

4 5 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 700 

GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 

TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 

TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 

GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 95 0 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAGCGGA TTGACAACAG GTCGTGCCTT 1200 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 1250 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 1300 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 1350 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 14 00 

60 ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAT TTA 1443 

574 
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2) INFORMATION FOR SEQ ID NO: 1028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 200 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 500 

CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGC GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGC 144 6 



2) INFORMATION FOR SEQ ID NO: 1029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

575 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 9 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATAGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTTG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAAGCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTAGGAA CCTCCTATTT GGAAAAGCAA 200 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGATAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

15 GCTTTACTGA AAAGTTATTT CAATTCTGAG, CTAGAAAATG GTGGAGCCAA 400 

GTATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCGG 4 50 

TTTTGTCTAT GTCAGGGATT AAACATGACT TGAAAACGGG AGAGTTGACG 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTTCCAG GTTCGGTTGT 550 

CAAGGCGGCG ACCATCAGCT CAGGTTGGGA AAATGGAGTC TTGTCAGGAA 600 

20 ACCAGACCTT GACAGACCAG TCCATTGTCT TCCAAGGTTC AGCTCCCATC 650 

AATTCTTGGT ATACTCAGGC TTACGGTTCA TTCCCTATCA CAGCGGTCCA 700 

AGCTCTGGAG TATTCATCCA ATGCTTATAT GGTCCAAACA GCCTTAGGTC 750 

TTATGGGGCA GACCTATCAA CCCAATATGT TTGTCGGCAC CAGCAATCTA 800 

GAGTCTGCTA TGGGTAAATT GCGTTCAACC TTTGGCGAAT ATGGCTTGGG 850 

25 GGCTGCGACT GGGATTGATC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAA CTTTGCCAAT TTCATTACCA ATGCCTTTGG GCAGTTTGAT 950 

AACTATACCC CAATGCAATT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

30 AATAAGGTCA ATATATCCGA CTCCGATATG AGTGTCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCTCATGGGA CTAGTGGGTT GACAACTGGC CGTGCCTTTT 1200 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACGGGTAC AGCCGAAAGC 1250 

TATGTGGCAG ATGGTCAGGA AGCAACCAAT ACCAATGCGG TGGCCTATGC 1300 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

35 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AAT 1423 



40 2) INFORMATION FOR SEQ ID NO: 1030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 47 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1030 



TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

60 AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

576 
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GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

5 TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

10 GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

15 ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

20 TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGCT 144 7 

25 



2) INFORMATION FOR SEQ ID NO: 1031 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) , TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACCTCTTG 50 

4 5 GGATAGAAAG GTTTTGGAAA CTTCTCTTTC TTCTATAGTA GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

AAAGGCTATT CTCTAAATGA CCGTGTTGGG ACTTCTTACC TGGAAAAACA 200 

ATACGAGGAA ACCTTACAAG GAAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGCGTGGATA CAATTGAGGA AGGTAGTAAG 300 

50 GGAAACAATA TCAAACTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCTA 400 

AGTATTCTGA AGGCGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

■ GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGACTTGAC 500 

GCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGGTCAGTAG 550 

55 TTAAGGCCGC TACCATCAGC TCAGGTTGGG AAAATGGTGT TTTATCAGGA 600 

AACCAAACCT TAACAGATCA GCCTATTGTT TTCCAAGGTT CAGCTCCAAT 650 

TTATTCTTGG TATAAATTGG CATATGGATC TTTTCCTATT ACAGCTGTGG 700 

AAGCCTTGGA GTATTCATCT AATGCTTACA TGGTTCAAAC CGCTCTTGGA 750 

ATCATGGGCC AGACCTATCA ACCAAATATG TTTGTTGGAA CCAGCAATTT 800 

60 GGAAACAGCT ATGGGAAAAC TTCGTGCGAC CTTTGGCGAA TATGGCTTGG 850 

577 
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GGGCTGCGAC CGGAATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

AAAGAGTATA GCTTTGCTAA TTACATTACT AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGTAATAAT 1050 

5 GATAAGGGAG GACTGGGTGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCCGATAT GAGCATCTTG CACCAAGGTT 1150 

TTTATCAGGT TGCCCATGGT ACTAGTGGAT TGACAACTGG ACGTGCCTTT 1200 

TCAAATGGCG CCTTGGTATC CATTAGCGGA AAAACAGGTA CAGCCGAAAG 1250 

CTATGTGGCA GATGGTCAGC AAGCAACCAA TACCAATGCG GTGGCCTATG 1300 

10 CCCCATCTGA TAATCCCCAA ATCGCTGTTG CAGTGGTCTT TCCTCATAAT 1350 

ACCAATCTAA CAAATGGTGT AGGACCTTCC ATTGCGCGTG ACATTATCAA 1400 

TCTGTATCAA AAATACCATC CAATGAACTA GAAAGGAAAT TATGCT 1446 



15 



2) INFORMATION FOR SEQ ID NO: 1032 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1446 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 

GGATAGAAAG GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

35 AAAGGCTATT CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 200 

ATATGAAGAG ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGTGTGGATA CAATTGAGGA AGGTAGTAAG 300 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 400 

4 0 AGTATTCTGA GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 

TCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGTTCGGTTG 550 

TCAAGGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 

AACCAGACCT TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

45 CAATTCTTGG TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 700 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 

CTTATGGGGC AGACCTACCA ACCTAATATG TTTGTCGGCA CCAGCAACCT 800 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 

GTTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

50 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 1200 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 1250 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 130 0 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 1350 

ACCAACCTTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 1400 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 1446 

578 
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2) INFORMATION FOR SEQ ID NO: 1033 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 

20 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

25 GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAGCTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCC 4 00 

AAGTATTCTG AGGGTGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

30 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCTCATGG GACTAGTGGA TTGACAACTG GACGTGCTTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

45 GTTATGTTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 13 00 

GCCCCATCAG ATAATCCTCA AATCGCTGTT GCCGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 1400 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



50 



2) INFORMATION FOR SEQ ID NO: 1034 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

579 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 



GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 50 

GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 100 

10 CATGAAAATG AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 150 

GAGTTCCTTG AACAGTATTC TTGCAGGGAC AGACGGTATT ATTACCTATG 200 

AAAAAGACCG TGTAGGAAAT ATCGTACCAG " GTACAGAACT GGTATCGCAft 250 

CAAACTGTGG ATGGCAAGGA TGTTTATACA ACATTGTCTA GTCCGCTACA 3 00 

ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 350 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA, AGACCGGTGA AATCCTCGCT 400 

ACCACCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 4 50 

GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 500 

CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 550 

CCAAGTGGAG AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 600 

20 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 650 

CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 700 

GAACAAAAAA TGGGAGATGC TACTTGGTTG GATTATCTAA AACGCTTTAA 750 

ATTTGGGGTT CCAACTCGCT TTGGCTTGAC AGATGAATAC GCTGGTCAAC 800 

TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGGCAAGGA 850 

2 5 . ATTTCAGTGA CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 900 

TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 950 

ACAATCAGTC TGTACGTAAG TCACAAAAAG AAATAGTAGG AAATCCTGTT 1000 

TCCAAAGAGG CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 1050 

GGACCCTCTA TATGGAACTA TGTATAATCA CTACACAGGA AAGCCAATTA 1100 

30 TAACAGTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 1150 

GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 12 00 

CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTGTATG 1250 

TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 1300 

GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 13 50 

35 TCAATCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 14 00 

CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA 14 50 

CGCCGAAATA TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 1500 

AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 1550 

TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 1600 

4 0 AAAGAGACTG CCGAGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 1650 

TGAAGGTTCA GGTTCCGTTG 1670 



45 2) INFORMATION FOR SEQ ID NO: 1035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035 

580 
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AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA CAACCAGTCC 50 

CAATCGTAGT TACCCAAATG GACAATTTGC TTCTAGTTTT ATCGGTCTAG 100 

CTCAGCTCCA TGAAAATGAA GATGGAAGCA AGAGTTTGCT GGGAACCTCT 150 

GGAATGGAGA GTTCCTTGAA CAGTATTCTT GCAGGGACAG ACGGCATTAT 200 

5 TACCTATGAA AAGGATCGTC TGGGCAATAT TGTACCCGGA ACAGAACAAG 250 

TTTCCCAACA AACGGTGGAT GGCAAGGATG TTTACACAAC CATTTCCAGC 300 

CCCCTCCAGT CCTTCATGGA AACTCAGATG GATGCCTTTC TAGAAAAAGT 350 

AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG ACCGGTGAAA 400 

TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC TAAAGAAGGA 450 

10 ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA GTAACTATGA 500 

ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT ATTGATAATA 550 

ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT CAAAATAGCG 600 

GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA CTACTGGTGG 650 

GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT GTTGGAATGA 700 

15 GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA TTATCTAAAA 750 

CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG ATGAATACGC 800 

TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA AGCTCATTTG 850 

GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC CTTTACAGCT 900 

ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA GTGCTATTTA 950 

2 0 TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA ATAGTAGGAA 1000 

ATCCTGTTTC CAAAGAGGCA GCAAGCACAA CTCGAAATCA CATGATCTTA 1050 

GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT ACACAGGAAA 1100 

GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA TCCGGTACGG 1150 

CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG TTCTACCAAT 12 00 

25 TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC CTGATTTTAT 1250 

CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT ATCCAGTTGG 1300 

GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT GAAAGAATCT 1350 

CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA CGACAGAATC 1400 

TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT GAGTTGGCGG 1450 

30 AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG TACTGGAACA 1500 

AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG CACCAAACCA 1550 

ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA GACATGTATG 1600 

GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT GGATATTGAA 1650 

CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAG 1683 

35 



2) INFORMATION FOR SEQ ID NO: 1036 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036 

TCAAAAAAGA ATTGGAAACT GCAGAGGTCA AGGGGATTGA TTTTACAACC 50 

55 AGTCCCAATC GTAGTTACCC AAATGGACAA TTTGCTTCTA GTTTTATCGG 100 

TCTAGCTCAG CTCCATGAAA ATGAAGATGG AAGCAAGAGT TTGCTGGGAA 150 

CCTCTGGAAT GGAGAGTTCC TTGAACAGTA TTCTTGCAGG GACAGACGGC 200 

ATTATTACCT ATGAAAAGGA TCGTCTGGGC AATATTGTAC CCGGAACAGA 250 

ACAAGTTTCC CAACAAACGG TGGATGGCAA GGATGTTTAC ACAACCATTT 300 

60 CCAGCCCCCT CCAGTCCTTC ATGGAAACTC AGATGGATGC CTTTCTAGAA 350 

581 
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AAAGTAAAAG GTAAGTATAT GACCGCGACC TTGGTCAGTG CAAAGACCGG 4 00 

TGAAATCCTC GCTACCACCC AACGACCTAC CTTTAATGCA GATACTAAAG 450 

AAGGAATCAC TGAGGACTTT GTTTGGCGTG ATATTCTTTA TCAAAGTAAC 500 

TATGAACCAG GATCAGCCAT GAAGGTTATG ACGTTAGCTT CTTCTATTGA 550 

5 TAATAATACC TTCCCAAGTG GAGAATACTT CAATAGCAGT GAATTCAAAA 600 

TAGCGGATGC GACGACTCGA GATTGGGATG TTAATGATGG TTTGACTACT 650 

GGTGGGATGA TGACTTTCTT ACAAGGTTTC GCTCACTCCA GTAATGTTGG 700 

AATGAGTCTA CTTGAACAAA AAATGGGAGA TGCTACTTGG TTGGATTATC 750 

TAAAACGCTT TAAATTTGGG GTTCCAACTC GCTTTGGCTT GACAGATGAA 800 

10 TACGCTGGTC AACTTCCAGC TGATAATATT GTTAGTATTG CTCAAAGCTC 850 

ATTTGGGCAA GGAATTTCAG TGACACAAAC ACAAATGCTT CGTGCCTTTA 900 

CAGCTATTGC TAATGATGGA GTTATGCTGG AGCCAAAATT TATAAGTGCT 950 

ATTTATGATA CTAACAATCA GTCTGTACGT AAGTCACAAA AAGAAATAGT 1000 

AGGAAATCCT GTTTCCAAAG AGGCAGCAAG CACAACTCGA AATCACATGA 1050 

15 TCTTAGTTGG GACGGACCCT CTATATGGAA, CTATGTATAA TCACTACACA 1100 

GGAAAGCCAA TTATAACAGT TCCTGGACAA AATGTAGCAG TTAAATCCGG 1150 

TACGGCTCAA ATCGCTGATG AGAAAAATGG AGGATACTTG GTTGGTTCTA 1200 

CCAATTATAT TTTCTCAGTT GTGACTATGA ATCCTGCTGA AAATCCTGAT 1250 

TTTATCTTGT ATGTAACGGT TCAACAGCCT GAGCATTATT CAGGTATCCA 1300 

20 GTTGGGAGAA TTTGCCACCC CAATCTTGGA GCGGGCTTCA GCTATGAAAG 1350 

AATCTCTCAA TCTTCAATCT CCAGCCAAAA ATTTAGATAA AGTTACGACA 1400 

GAATCTTCTT ATGCAATGCC TAGCATCAAG GATATTTCAC CTGGTGAGTT 1450 

GGCGGAAGCC TTACGCCGAA ATATTGTGCA ACCAATCGTT GTAGGTACTG 1500 

GAACAAAGAT TAAAGAGACT TCTGTAGAAG AAGGGACCAA TCTTGCACCA 1550 

25 AACCAACAAG TTCTCCTTTT ATCGGATAAG GTAGAAGAAA TTCCAGACAT 1600 

GTATGGCTGG AAAAAAGAGA CTGCCGAGAC CTTTGCTAAA TGGTTGGATA 1650 

TTGAACTGGA ATTTGAAGGT TCAGGTTCCG TT 1682 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1037 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1681 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037 

CAAAAAAGAA TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAATCG TAGTTATCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TACTGGGAAC 150 

50 CTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGTA 200 

TTATTACCTA TGAAAAAGAC CGTGTAGGAA ATATCGTACC AGGTACAGAA 250 

CTGGTATCGC AACAAACTGT GGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

55 GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

60 GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 
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ATGAGTCTAC 
AAAACGCTTT 
ACGCTGGTCA 
TTTGGGCAAG 
5 AGCTATTGCT 
TTTATGATAC 
GGAAATCCTG 
CTTAGTTGGG 
GAAAGCCAAT 

10 ACGGCTCAAA 
CAATTATATT 
TTATCTTGTA 
TTGGGAGAAT 
ATCTCTCAAT 

15 AATCTTCTTA 
GCGGAAGCCT 
AACAAAGATT 
ACCAACAAGT 
TATGGCTGGA 

2 0 TGAACTGGAA 



TTGAACAAAA 
AAATTTGGGG 
ACTTCCAGCT 
GAATTTCAGT 
AATGATGGAG 
TAACAATCAG 
TTTCCAAAGA 
ACGGACCCTC 
TATAACAGTT 
TCGCTGATGA 
TTCTCAGTTG 
TGTAACGGTT 
TTGCCACCCC 
CTTCAATCTC 
TGCAATGCCT 
TACGCCGAAA 
AAAGAGACTT 
TCTCCTTTTA 
AAAAAGAGAC 
TTTGAAGGTT 



AATGGGAGAT 
TTCCAACTCG 
GATAATATTG 
GACACAAACA 
TTATGCTGGA 
TCTGTACGTA 
GGCAGCAAGC 
TATATGGAAC 
CCTGGACAAA 
GAAAAATGGA 
TGACTATGAA 
CAACAGCCTG 
AATCTTGGAG 
CAGCCAAAAA 
AGCATCAAGG 
TATTGTGCAA 
CTGTAGAAGA 
TCGGATAAGG 
TGCCGAGACC 
CAGGTTCCGT 



GCTACTTGGT 
CTTTGGCTTG 
TTAGTATTGC 
CAAATGCTTC 
GCCAAAATTT 
AGTCACAAAA 
ACAACTCGAA 
TATGTATAAT 
ATGTAGCAGT 
GGATACTTGG 
TCCTGCTGAA 
AGCATTATTC 
CGGGCTTCAG 
TTTAGATAAA 
ATATTTCACC 
CCAATCGTTG 
AGGGACCAAT 
TAGAAGAAAT 
TTTGCTAAAT 
T 



PCT/CA00/01150 

TGGATTATCT 750 

ACAGATGAAT 800 

TCAAAGCTCA 850 

GTGCCTTTAC 900 

ATAAGTGCTA 950 

AGAAATAGTA 1000 

ATCACATGAT 1050 

CACTACACAG 1100 

TAAATCCGGT 1150 

TTGGTTCTAC 1200 

AATCCTGATT 1250 

AGGTATCCAG 1300 

CTATGAAAGA 1350 

GTTACGACAG 1400 

TGGTGAGTTG 1450 

TAGGTACTGG 1500 

CTTGCACCAA 1550 

TCCAGACATG 1600 

GGTTGGATAT 1650 
1681 



25 



2) INFORMATION FOR SEQ ID NO: 1038 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038 

4 0 AAAAAAGAAT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTATCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT ACTGGGAACC 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGTAT 200 

TATTACCTAT GAAAAAGACC GTGTAGGAAA TATCGTACCA GGTACAGAAC 250 

4 5 TGGTATCGCA ACAAACTGTG GATGGCAAGG ATGTTTATAC AACATTGTCT 300 

AGTCCGCTAC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

50 TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

55 AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

60 GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 
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TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

5 TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 1300 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 1400 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG 1450 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

10 ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAGAGACT GCCGAGACCT TTGCTAAATG GTTGGATATT 1650 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCA 1685 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1039 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1679 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039 



AAAAGAATTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGACTA 100 

35 GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTAC TGGGAACCTC 150 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGTATTA 2 00 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

4 0 TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

45 GGATGCGACG ACTCGAGATT GGGATGTTAA TGATGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

50 GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

55 AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

60 TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1400 
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CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
ACTGGAATTT GAAGGTTCAG GTTCCGTTG 



PCT/CAOO/01150 

TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2) INFORMATION FOR SEQ ID NO: 1040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040 



TAAAAAGGAT CTAAAAGACG CTAGTGTTGA AGGAATTGAC TTCACAACTA 50 

GCCCTAATAG AAGCTATCCA AATGGACAAT TCGCTTCTAG TTTTATTGGT 100 

TTGGCCCAAC TCCATGAAAA TGAGGATGGT AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGGATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

30 TTATTACCTA TGAAAAAGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

35 AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGAAGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

40 ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

45 TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

50 CAATTATATT TTCTCAGCTG TGACTATGAA TCCTGCTGAA AATCCTGATT 12 50 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 14 50 

55 GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT CGTTC 1685 
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2) INFORMATION FOR SEQ ID NO: 1041 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 



GTCTATCAAA 
20 CAACCAGTCC 
ATCGGACTAG 
GGGAACCTCT 
ACGGTATTAT 
ACAGAACTGG 

2 5 ATTGTCTAGT 

TAGAAAAAGT 
ACCGGTGAAA 
TAAAGAAGGA 
GTAACTATGA 
30 ATTGATAATA 
CAAAATAGCG 
CTACTGGTGG 
GTTGGAATGA 
TTATCTAAAA 

3 5 ATGAATACGC 

AGCTCATTTG 
CTTTACAGCT 
GTGCTATTTA 
ATAGTAGGAA 

40 CATGATCTTA 
ACACAGGAAA 
TCCGGTACGG 
TTCTACCAAT 
CTGATTTTAT 

45 ATCCAGTTGG 
GAAAGAATCT 
CGACAGAATC 
GAGTTGGCGG 
TACTGGAACA 

50 CACCAAACCA 
GACATGTATG 
GGATATTGAA 



AAAGAATTGG 
CAATCGTAGT 
CTCAGCTCCA 
GGAATGGAGA 
TACCTATGAA 
TATCGCAACA 
CCGCTACAAT 
AAAAGGTAAG 
TCCTCGCTAC 
ATCACTGAGG 
ACCAGGATCA 
ATACCTTCCC 
GATGCGACGA 
GATGATGACT 
GTCTACTTGA 
CGCTTTAAAT 
TGGTCAACTT 
GGCAAGGAAT 
ATTGCTAATG 
TGATACTAAC 
ATCCTGTTTC 
GTTGGGACGG 
GCCAATTATA 
CTCAAATCGC 
TATATTTTCT 
CTTGTATGTA 
GAGAATTTGC 
CTCAATCTTC 
TTCTTATGCA 
AAGCCTTACG 
AAGATTAAAG 
ACAAGTTCTC 
GCTGGAAAAA 
CTGGAATTTG 



AAACTGCAGA 
TATCCAAACG 
TGAAAATGAA 
GTTCCTTGAA 
AAAGACCGTG 
AACTGTGGAT 
CTTTCATGGA 
TATATGACCG 
CACCCAACGA 
ACTTTGTTTG 
GCCATGAAGG 
AAGTGGAGAA 
CTCGAGATTG 
TTCTTACAAG 
ACAAAAAATG 
TTGGGGTTCC 
CCAGCTGATA 
TTCAGTGACA 
ATGGAGTTAT 
AATCAGTCTG 
CAAAGAGGCA 
ACCCTCTATA 
ACAGTTCCTG 
TGATGAGAAA 
CAGTTGTGAC 
ACGGTTCAAC 
CACCCCAATC 
AATCTCCAGC 
ATGCCTAGCA 
CCGAAATATT 
AGACTTCTGT 
CTTTTATCGG 
AGAGACTGCC 
AAGGTTCAGG 



GGTCAAGGGG 
GACAATTTGC 
GATGGCAGCA 
CAGTATTCTT 
TAGGAAATAT 
GGCAAGGATG 
AACTCAGATG 
CGACCTTGGT 
CCTACCTTTA 
GCGTGATATT 
TTATGACGTT 
TACTTCAATA 
GGATGTTAAT 
GTTTCGCTCA 
GGAGATGCTA 
AACTCGCTTT 
ATATTGTTAG 
CAAACACAAA 
GCTGGAGCCA 
TACGTAAGTC 
GCAAGCACAA 
TGGAACTATG 
GACAAAATGT 
AATGGAGGAT 
TATGAATCCT 
AGCCTGAGCA 
TTGGAGCGGG 
CAAAAATTTA 
TCAAGGATAT 
GTGCAACCAA 
AGAAGAAGGG 
ATAAGGTAGA 
GAGACCTTTG 
TTCCGTTGTT 



ATTGATTTTA 


50 


TTCTAGTTTT 


100 


AGAGCTTACT 


150 


GCAGGGACAG 


200 


CGTACCAGGT 


250 


TTTATACAAC 


300 


GATGCCTTTC 


350 


CAGTGCAAAG 


400 


ATGCAGATAC 


450 


CTTTATCAAA 


500 


AGCTTCTTCT 


550 


GCAGTGAATT 


600 


GATGGTTTGA 


650 


CTCCAGTAAT 


700 


CTTGGTTGGA 


750 


GGCTTGACAG 


800 


TATTGCTCAA 


850 


TGCTTCGTGC 


900 


AAATTTATAA 


950 


ACAAAAAGAA 


1000 


CTCGAAATCA 


1050 


TATAATCACT 


1100 


AGCAGTTAAA 


1150 


ACTTGGTTGG 


1200 


GCTGAAAATC 


1250 


TTATTCAGGT 


1300 


CTTCAGCTAT 


1350 


GATAAAGTTA 


1400 


TTCACCTGGT 


1450 


TCGTTGTAGG 


1500 


ACCAATCTTG 


1550 


AGAAATTCCA 


1600 


CTAAATGGTT 


1650 


CAGAAG 


1696 



55 



60 



2) INFORMATION FOR SEQ ID NO: 1042 

. <i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 
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10 



WO 01/23604 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042 



AAAAAAGAGT 
TCCCAATCGT 
TAGCTCAGCT 

15 TCTGGAATGG 
TATTACCTAT 
AAGTTTCCCA 
AGCACCCTTC 
AGTAAAAGGT 

20 AAATCCTCGC 
GGAATCACTG 
TGAACCAGGA 
ATAATACCTT 
GCGGATGCGA 

25 TGGGATGATG 
TGAGTCTACT 
AAACGCTTTA 
CGCTGGTCAA 
TTGGGCAAGG 

30 GCTATTGCTA 
TTATGATACT 
GAAATCCTGT 
TTAGTTGGGA 
AAAGCCAATT 

35 CGGCTCAAAT 
AATTATATTT 
TATCTTGTAT 
TGGGAGAATT 
TCTCTCAATC 

40 AAGTCCTTAT 
CAGAAGAATT 
ACGAAGATTA 
TCAGCAAGTC 
ATGGTTGGAC 

45 GAACTTGAAT 



TGGAAACTGC 
AGTTACCCAA 
CCATGAAAAT 
AGAGTTCCTT 
GAAAAGGATC 
ACAAACTGTA 
AGTCCTTCAT 
AAGTATATGA 
TACCACCCAA 
AGGACTTTGT 
TCAGCCATGA 
CCCAAGTGGA 
CGACTCGAGA 
ACTTTCTTAC 
TGAACAAAAA 
AATTTGGGGT 
CTTCCAGCTG 
AATTTCAGTG 
ATGATGGAGT 
AACAATCAGT 
TTCCAAAGAG 
CGGACCCTCT 
ATAACAGTTC 
CGCTGATGAG 
TCTCAGTTGT 
GTAACGGTTC 
TGCCACCCCA 
TTCAAACAAC 
CCTATGCCTA 
GCGTCGCAAT 
AAAACAGTTC 
CTTATCTTAT 
AAAGGAGACT 
TTCAAGGTTC 



AGAGGTCAAG 
ACGGACAATT 
GAAGATGGCA 
GAACAGTATT 
GTCTGGGAAA 
GATGGCAAGG 
GGAAACTCAG 
CCGCGACCTT 
CGACCTACCT 
TTGGCGTGAT 
AGGTTATGAC 
GAATACTTCA 
TTGGGATGTT 
AAGGTTTCGC 
ATGGGAGATG 
TCCAACTCGC 
ATAATATTGT 
ACACAAACAC 
TATGCTGGAG 
CTGTACGTAA 
GCAGCAAGCA 
ATATGGAACT 
CTGGACAAAA 
AAAAATGGAG 
GACTATGAAT 
AACAGCCTGA 
ATCTTGGAGC 
AGCTAAAGCT 
GTGTCAAGGA 
CTTGTACAAC 
TGCTGAAGAA 
CTGATAAAGT 
GCTGAGACCC 
GGGTTCTACT 



GGGATTGATT 
TGCTTCTAGT 
GCAAGAGCTT 
CTTGCAGGGA 
TATTGTCCCC 
ATGTTTATAC 
ATGGATGCCT 
GGTCAGTGCA 
TTAATGCAGA 
ATTCTTTATC 
GTTAGCTTCT 
ATAGCAGTGA 
AATGATGGTT 
TCACTCCAGT 
CTACTTGGTT 
TTTGGCTTGA 
TAGTATTGCT 
AAATGCTTCG 
CCAAAATTTA 
GTCACAAAAA 
CAACTCGAAA 
ATGTATAATC 
TGTAGCAGTT 
GATACTTGGT 
CCTGCTGAAA 
GCATTATTCA 
GGGCTTCAGC 
TTGGAGCAAG 
TATTTCACCT 
CCATCGTTGT 
GGGAAGAATC 
AGAGGAAGTT 
TTGCTAAGTG 
GTGCAGAAG 



TTACAACCAG 50 

TTTATCGGAC 100 

GCTGGGAACC 150 

CAGACGGCAT 200 

GGAACGGAAC 250 

GACTATTTCC 300 

TTCTAGAAAA 350 

AAGACCGGTG 400 

TACTAAAGAA 450 

AAAGTAACTA 500 

TCTATTGATA 550 

ATTCAAAATA 600 

TGACTACTGG 650 

AATGTTGGAA 700 

GGATTATCTA 750 

CAGATGAATA 800 

CAAAGCTCAT 850 

TGCCTTTACA 900 

TAAGTGCTAT 950 

GAAATAGTAG 1000 

TCACATGATC 1050 

ACTACACAGG 1100 

AAATCCGGTA 1150 

TGGTTCTACC 1200 

ATCCTGATTT 1250 

GGTATCCAGT 1300 

TATGAAAGAC 1350 

TAAGTCAACA 1400 

GGTGATTTAG 1450 

GGGAACAGGA 1500 

TTGCCCCGAA 1550 

CCAGATATGT 1600 

GCTCAATATA 1650 
1689 



50 



2) INFORMATION FOR SEQ ID NO: 1043 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus pneumoniae 

587 
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(B) STRAIN: StrR-10 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043 

5 CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 

CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 2 00 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

10 CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

15 ATGAACCAGG ATCAGCCATG AAGGTTATGA' CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

20 AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

25 GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

30 TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 1300 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

35 AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1690 



40 



2) INFORMATION FOR SEQ ID NO: 1044 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

< 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 



AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTCCTAAT 50 

CGTAGTTACC CAAACGGACA ATTTGCTTCT AGTTTTATCG GACTAGCTCA 100 

60 GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 

588 
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TGGAGAGTTC TTTAAATAGA ATTCTTGCAG GGACAGACGG CATTATTACC 200 

TATGAAAAGG ATCGTCTGGG AAATATTGTC CCCGGAACGG AACAAGTTTC 250 

CCAACAAACG GTAGATGGCA AGGATGTTTA TACGACTATT TCCAGCACCC 300 

TTCAGTCCTT CATGGAGACC CAGATGAATG CCTTTCAAGA AAAAGTAAAA 350 

5 GGCAAGTATA TGACGGCTAC CTTGGTCAGT GCTAAAACAG GGGAAATTCT 400 

TGCAACAACG CAACGGCCGA CCTTCGATGC TGATACTAAG GAAGGACTTA 450 

CCAAGGACTT TGTTTGGCGT GATATCCTCT ATCAAAGTAA CTATGAGCCA 500 

GGGTCAACCA TGAAGGTCAT GACGCTTGCT GCTGCTATTG ATAATAACAC 550 

TTTCCCAGGA GGAGAAGTTT TCAATAGTAG TGAATTAAAA ATAGCGGATG 600 

10 CGACAATTCG AGATTGGGAT GTTAATGATG GTTTGACGAC TGGTGGGATG 650 

ATGACTTTCT TACAAGGTTT CGCTCACTCC AGTAATGTTG GAATGAGTCT 700 

ACTTGAACAA AAAATGGGAG ATGCTACTTG GTTGGATTAT CTAAACCGCT 750 

TTAAGTTTGG GGTGCCGACG CGTTTTGGTC TGACTGATGA GTATTCAGGT 800 

CAATTGCCTG CAGATAATAT TGTTAATATT GCCATGAGTG CATTTGGTCA 850 

15 GGGGATTTCA GTGACCCAGA CGCAAATGAT TCGTGCCTTT ACAGCTATTG 900 

CTAATGATGG TGTTATGCTG GAGCCTAAAT TTATTAGTGC CATTTATGAT 950 

CCAAATGATC AAACTGCTCG GAAATCTCAA AAAGAAATTG TGGGAAATCC 1000 

TGTTTCTAAA GATGCAGCTA* GTCTAACTCG GACTAACATG ATTTTGGTAG 1050 

GGACGGATCC GGTTTATGGA ACCATGTATA ACCACAGCAC AGGTAAGCCA 1100 

2 0 ACTGTAACTG TTCCTGGGCA AAATGTAGCC CTCAAGTCTG GTACGGCTCA 1150 

GATTGCTGAC GAGAAAAATG GTGGTTATCT AGTCGGGTTA ACCAACTATA 1200 

TTTTCTCGGC TGTATCGATG AATCCGGCTG AAAATCCTGA TTTTATCTTG 1250 

TATGTAACGG TACAGCAACC TGAACATTAT TCAGGTATCC AGTTGGGAGA 1300 

ATTTGCCACC CCAATCTTGG AGCGGGCTTC AGCTATGAAA GAATCTCTCA 1350 

25 ATCTTCAATC TCCAGCCAAG AATTTAGATA AAGTTACGAC AGAATCTTCT 1400 

TATGCAATGC CTAGCATCAA GGATATTTCA CCTGGTGAGT TGGCGGAAGC 14 50 

CTTACGCCGA AATATTGTGC AACCAATCGT TGTAGGTACT GGAACAAAGA 1500 

TTAAAGAGAC TTCTGTAGAA GAAGGGACCA ATCTTGCACC AAACCAACAA 1550 

GTTCTCCTTT TATCGGATAA GGTAGAAGAA ATTCCAGACA TGTATGGCTG 1600 

30 GAAAAAAGAG ACTGCTGAAA CCTTTGCTAA ATGGTTGGAT ATTGAGCTGG 1650 

AATTTGAAGG GTCAGGTT 1668 



35 2) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGCTTCTAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 

GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA TTATTACCTA 200 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 2 50 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 3 00 

CAGTCCTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA AAGTAAAAGG 350 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 4 00 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 4 50 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 500 

589 
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ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT AATAATACCT 550 

TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT AGCGGATGCG 600 

ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG GTGGGATGAT 650 

GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA ATGAGTCTAC 700 

5 TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT AAAACGCTTT 750 

AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT ACGCTGGTCA 800 

ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA TTTGGGCAAG 850 

GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC AGCTATTGCT 900 

AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA TTTATGATAC 950 

10 TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA GGAAATCCTG 1000 

TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT CTTAGTTGGG 1050 

ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG GAAAGCCAAT 1100 

TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT GCGGCTCAAA 1150 

TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC CAATTATATT 1200 

15 TTCTCAGTTG TGACTATGAA TCCTGCTGAA. AATCCTGATT TTATCTTGTA 1250 

TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG TTGGGAGAAT 1300 

TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA ATCTCTCAAT 1350 

CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG AATCTTCTTA 1400 

TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG GCGGAAGCCT 1450 

20 TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG AACAAAGATT 1500 

AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA ACCAACAAGT 1550 

TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG TATGGCTGGA 1600 

AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT TGAACTGGAA 1650 

TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1680 

25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1046 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 



AAAAAAGAGT 

45 TCCTAATCGT 
TAGCTCAACT 
TCTGGAATGG 
TATTACTTAT 
AAGTTTCCCA 

50 AGTCCGCTCC 
AGTAAAAGGT 
AAATCCTCGC 
GGAATCACTG 
TGAACCAGGA 

55 ATAATACCTT 
GCGGATGCGA 
TGGGATGATG 
TGAGTCTACT 
AAACGCTTTA 

60 CGCTGGTCAA 



TGGAAACTGC 
AGTTACCCAA 
CCATGAAAAT 
AGAGTTCCTT 
GAAAAGGATC 
ACAAACGGTA 
AATCTTTCAT 
AAGTATATGA 
TACCACCCAA 
AGGACTTTGT 
TCAGCCATGA 
CCCAAGTGGA 
CGACTCGAGA 
ACTTTCTTAC 
TGAACAAAAA 
AATTTGGGGT 
CTTCCAGCTG 



AGAGGTCAAG 
ACGGACAATT 
GAAGATGGAA 
GAACAGTATT 
GTCTGGGTAA 
GATGGCAAGG 
GGAAACTCAG 
CCGCGACCTT 
CGACCTACCT 
TTGGCGTGAT 
AGGTTATGAC 
GAATACTTCA 
TTGGGATGTT 
AAGGTTTCGC 
ATGGGAGATG 
TCCAACTCGC 
ATAATATTGT 

590 



ID NO: 1046 




GGGATTGATT 


TTACAACCAG 


50 


TGCTTCTAGT 


TTTATCGGTC 


100 


GCAAGAGTTT 


GCTGGGAACT 


150 


CTTGCAGGGA 


AAGACGGTAT 


200 


TATTGTCCCT 


GGAACAGAAC 


250 


ATGTTTATAC 


AACATTGTCT 


300 


ATGGATGCCT 


TTCTAGAAAA 


350 


GGTCAGTGCA 


AAGACCGGTG 


400 


TTAATGCAGA 


TACTAAAGAA 


450 


ATTCTTTATC 


AAAGTAACTA 


500 


GTTAGCTTCT 


TCTATTGATA 


550 


ATAGCAGTGA 


ATTCAAAATA 


600 


AATGATGGTT 


TGACTACTGG 


650 


TCACTCCAGT 


AATGTTGGAA 


700 


CTACTTGGTT 


GGATTATCTA 


750 


TTTGGCTTGA 


CAGATGAATA 


800 


TAGTATTGCT 


CAAAGCTCAT 


850 
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TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

5 TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 1300 

10 TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 1400 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG "1450 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

15 CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAGAGACT GCCGAGACCT TTGCTAAATG GTTGGATATT 1650 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCAGAAG 1689 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1047 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1690 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 

CAAAAAAGAA TTGGAAGCTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAACCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TGCTGGGAAC 150 

4 0 TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA 200 

TTATTACCTA TGAAAAGGAT CGTCTGGGTA ATATTGTACC CGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTACA CAACCATTTC 300 

CAGCCCCCTC CAGTCCTTTA TGGAAACCCA GATGGATGCT TTTCAAGAGA 350 

AGGTAAAAGG AAAGTACACG ACAGCGACTT TGGTCAGTGC TAAAACAGGG 400 

4 5 GAAATTCTGG CAACAACGCA ACGACCGACC TTTGATGCAG ATACAAAAGA 4 50 

AGGCATTACA GAGGACTTTG TTTGGCGTGA TATCCTTTAC CAAAGTAACT 500 

ATGAGCCAGG TTCCCCTATG AAAGTGATGA TGTTGGCTGC TGCTATTGAT 550 

AATAATACCT TTCCAGGGGG AGAAGTCTTC AATAGTAGTG AGTTAAAAAT 600 

TGCAGATGTC ACGATTCGAG ATTGGGACGT CAATGAAGGA TTGACTGGTG 650 

50 GCAGAATGAT GACCTTTTCT CAAGGGTTCG CTCACTCAAG TAACGTTGGG 700 

ATGACGCTTC TTGAGCAAAA GATGGGAGAT GCTACATGGC TTGATTATCT 750 

AAATCGCTTT AAATTTGGTG TTCCGACTCG TTTTGGCTTG ACGGATGAAT 8 00 

ATGCAGGCCA ACTTCCAGCT GACAATATCG TAAATATTGC TCAGAGTTCA 850 

TTTGGACAAG GGATTTCAGC GACCCAGACG CAAATGATTC GTGCCTTCAC 900 

55 GGCTATTGCC AACGATGGAG TCATGTTAGA ACCTAAATTT ATCAGTGCCA 950 

TTTATGATCC AAATGATCAA ACTGCTCGGA AATCACAAAA AGAAGTTGTG 1000 

GGAAATCCTG TGTCTAAAGA TGCAGCGAGC TTGACGCGAA CGCATATGGT 1050 

TTTAGTCGGT ACCGATCCAG TATATGGAAC TATGTATAAT CATAAGACAG 1100 

GGAAACCAAC TGTAACTGTT CCTGGGCAAA ATGTAGCCCT CAAGTCTGGT 1150 

60 ACGGCTCAGA TTGCCGATGA GAAAAATGGA GGTTACTTAG TTGGTACGAC 1200 

591 
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CAATTACATT TTTTCGGCTG 
TTATTCTCTA TGTGACGGTT 
TTGGGGGAAT TTGCCAATCC 
TTCCCTTAAC CTCCAATCTA 
5 AAAGCGCTTA TGCCATGCCT 
GCGGAAGCCT TACGTCGCAA 
AACAAAAATT AAAGAATCAT 
ACCAGCAAGT TCTTCTCTTA 
TATGGTTGGA CAAAAGAGAC 
10 AGAACTTGAA TTTGAAGGTT 



TATCGATGAA CCCTGCTGAA 
CAACAGCCTG AGCATTATTC 
TATCTTGGAA AGGGCAGTGG 
CCGCTAAAAC GTTAAATCAG 
AGCATCAAGG ACATTTCACC 
TATTGTGCAA CCAATCGTTG 
CTGTAGAAGA AGGGACGGAT 
TCTGATAAAG CAGAGGAAGT 
TGCTGAGACC TTTGCTAAGT 
CGGGCTCTAC TGTGCAGAAG 



PCT/CA00/01150 

AATCCTGATT 1250 

AGGTATTCAG 1300 

CTATGAAAGA 1350 

GTAACCAATC 14 00 

TGGCGATTTG 1450 

TAGGAACAGG 1500 

CTTGCACCTA 1550 

TCCAGATATG 1600 

GGCTCAATAT 1650 
1690 



2) INFORMATION FOR SEQ ID NO: 1048 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1048 

30 AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 
CCAATCGTAG TTACCCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA . 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 150 

TGGTTTGGAG AGTTCTTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 200 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

35 GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

40 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 7 00 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1400 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1450 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1500 

60 AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1550 

592 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AACAAGTTCT CCTTTTATCG GATAAGGTAG AAGAAATTCC AGACATGTAT 
GGCTGGAAAA AAGAGACTGC TGAAACCTTT GCTAAATGGT TGGATATTGA 
GTTGGAATTT GAAGGTTCAG GTTCCGTCGT TC 



PCT/CA00/01150 

1600 
1650 
1682 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 1049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : 1 Enterococcus faecium 

(B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049 



GCAGCGTTGC 
TGAAAGGAGA 

25 GGTTGCTCAG 
CGCTAACATT 
AATCTGGTGT 
GACAATTGCT 
ACTTGTTAAA 

3 0 TTTCAGCTTT 
TTTGAATTGT 
AATTTGTATG 
TAGCTACTCC 
GCTACGTTTA 

35 CTTCGGTGTG 
AATCGGCAAG 
GGCTGTGAGG 
TGGCGAGGTG 
AGGAAGTCGA 

40 GCAGACCTTT 
AATATATAAA 
TACAAGATAA 
TTCACGTCAT 
ACTTCCCGAA 

45 AAGCATGGAA 
GTTGGGACGC 
GACGGTTATG 



GTGATACCGT 
CAGGAGCATG 
AGGAGCATGA 
AATAAAGAAA 
ATGGAAAATG 
ATTCAGCTGT 
AAGAACCATG 
GCATGGCAAG 
CCGGTATCCC 
GACAAATCGT 
CGCCTTTTGG 
CCTATCCTGT 
AAAAAAGTCA 
ACAATATGAC 
TCGGTTGTGC 
GACCAAATCA 
GCCGGAAAAA 
CAGCAGAGGA 
GCGCTCGGCT 
CGGCCGCATT 
ACAGTCGTTA 
CTGATTGACC 
ATAGGATTTA 
TAAATATGCC 
AAGTAAATCG 



TGAAAAAACC 
AATAGAATAA 
CGTATCGGTA 
AATACGAGCC 
TGCGAAAAAC 
ACTCTCGCCG 
AATATGAAAT 
TCAGGTGAAG 
TTTTGTAGGC 
TGACATACAT 
GTTATTAATA 
TTTTGTTAAG 
ATAGCGCGGA 
AGCAAAATCT 
GGTATTGGGA 
GGCTGCAGTA 
GGCTCTGAAA 
GCGAGGACGG 
GTAGAGGTCT 
GTACTGAACG 
TCCCCGTATG 
GCTTGATCGT 
CTTTTTTAGA 
ACTTGGGATA 
CATTGTAGGG 



ID NO: 1049 




ATTAAAAACT 


GTTTGGATTT 


50 


AAGTTGCAAT 


ACTGTTTGGG 


100 


AAATCTGCAA 


TAGAGATAGC 


150 


GTTATACATT 


GGAATTACGA 


200 


CTTGCGCGGA 


ATGGGAAAAC 


250 


GATAAAAAAA 


TGCACGGATT 


300 


CAACCATGTT 


GATGTAGCAT 


350 


ATGGATCCAT 


ACAAGGTCTG 


400 


TGCGATATTC 


AAAGCTCAGC 


450 


CGTTGCGAAA 


AATGCTGGGA 


500 


AAGATGATAG 


GCCGGTGGCA 


550 


CCGGCGCGTT 


CAGGCTCATC 


600 


CGAATTGGAC 


TACGCAATTG 


650 


TAATTGAGCA 


GGCTGTTTCG 


700 


AACAGTGCCG 


CGTTAGCTGT 


750 


CGGAATCTTT 


CGTATTCATC 


800 


ACGCAGTTAT 


AACCGTTCCC 


850 


ATACAGGAAA 


CGGCAAAAAA 


900 


AGCCCGTGTG 


GATATGTTTT 


950 


AAGTCAATAC 


TCTGCCCGGT 


1000 


ATGGCCGCTG 


CAGGTATTGC 


1050 


ATTAGCGTTA 


AAGGGGTGAT 


1100 


TGAAATAGTA 


CACGGTGTTC 


1150 


ATTTCACCGG 


AAAACCGGTT 


1200 


ACATACGAAT 


T 


1241 



50 



60 



2) INFORMATION FOR SEQ ID NO: 1050 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1249 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

593 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

10 GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

15 CAGGTGAAGA TGGATCCATA CAAGGTCTGT- TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

20 TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

25 CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

30 TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCG 124 9 



35 2) INFORMATION FOR SEQ ID NO: 1051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1272 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R481 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051 



TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 100 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 150 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 200 

55 GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 250 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 300 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 350 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 4 00 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 450 

60 AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 500 

594 
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ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 
CCGGTGGCAG CTACGTTTAC CTATCCTGTT 
AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 
ACGCAATTGA ATCGGCAAGA CAATATGACA 
GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 
GTTAGTTGTT GGCGAGGTGG ACCAAATCAG 
GTATTCATCA GGAAGTCGAG CCGGAAAAAG 
ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 
GGCAAAAAAA ATATATAAAG CGCTCGGCTG 
ATATGTTTTT ACAAGATAAC GGCCGCATTG 
CTGCCCGGTT TCACGTCATA CAGTCGTTAT 
AGGTATTGCA CTTCCCGAAC TGATTGACCG 
AGGGGTGATA AGCATGGAAA TAGGATTTAC 
ACGGTGTTCG TTGGGACGCT AAATATGCCA 
AAACCGGTTG ACGGTTATGA AGTAAATCGC 
GGCTTGAATC GCTTTTTGAA GG 



PCT/CAOO/01150 

TTATTAATAA AGATGATAGG 550 

TTTGTTAAGC CGGCGCGTTC 600 

TAGCGCGGAC GAATTGGACT 650 

GCAAAATCTT AATTGAGCAG 700 

GTATTGGGAA ACAGTGCCGC 750 

GCTGCAGTAC GGAATCTTTC 800 

GCTCTGAAAA CGCAGTTATA 850 

CGAGGACGGA TACAGGAAAC 900 

TAGAGGTCTA GCCCGTGTGG 950 

TACTGAACGA AGTCAATACT 1000 

CCCCGTATGA TGGCCGCTGC . 1050 

CTTGATCGTA TTAGCGTTAA 1100 

TTTTTTAGAT GAAATAGTAC 1150 

CTTGGGATAA TTTCACCGGA 1200 

ATTGTAGGGA CATACGAATT 1250 

1272 



20 2) INFORMATION FOR SEQ ID NO: 1052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R4 92 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 



TCCCCCGGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

40 ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 4 00 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 4 50 

45 TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

50 GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

55 TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 110 0 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

60 CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTA 1237 

595 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2 ) INFORMATION FOR SEQ ID NO: 1053 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053 

20 CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 100 

AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 200 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 250 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 300 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 350 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 400 

GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 450 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

30 TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 550 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 

GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 650 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 700 

TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 750 

35 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 

ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 900 

GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 950 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

4 0 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 1100 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT 1200 

AATTTCACCG GAAAACCGGT TGACGGGTAT GAAAGTAAAT CGCATTGTAG 1250 

45 GGACATTCGA ATT 12 63 



50 



2) INFORMATION FOR SEQ ID NO: 1054 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus faecalis 

596 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 

5 CGGCCTATTA TNCCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT 50 

AAAAACTGTT TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG 100 

TTGCAATACT GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA 150 

TCTGCAATAG AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT 200 

ATACATTGGA ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT 250 

10 GCGCGGAATG GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT 300 

AAAAAAATGC ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA 350 

CCATGTTGAT GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG 400 

GATCCATACA AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC 450 

GATATTCAAA GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT 500 

15 TGCGAAAAAT GCTGGGATAG CTACTCCCGC , CTTTTGGGTT ATTAATAAAG 550 

ATGATAGGCC GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG 600 

GCGCGTTCAG GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA 650 

ATTGGACTAC GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA 700 

TTGAGCAGGC TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC 750 

2 0 AGTGCCGCGT TAGTTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG 800 

AATCTTTCGT ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG 850 

CAGTTATAAC CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA 900 

CAGGAAACGG CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC 950 

CCGTGTGGAT ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG 1000 

25 TCAATACTCT GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG 1050 

GCCGCTGCAG GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT 1100 

AGCGTTAAAG GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA 1150 

AATAGTACAC GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT 1200 

TCACCGGAAA ACCGGTTGAC GGTTATAAGT AA 1232 

30 



2) INFORMATION FOR SEQ ID NO: 1055 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
45 (B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 

TACCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT AAAAACTGTT 50 

50 TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG TTGCAATACT 100 

GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA TCTGCAATAG 150 

AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT ATACATTGGA 200 

ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT GCGCGGAATG 250 

GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT AAAAAAATGC 300 

55 ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA CCATGTTGAT 350 

GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG GATCCATACA 400 

AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC GATATTCAAA 450 

GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT TGCGAAAAAT 500 

GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG ATGATAGGCC 550 

60 GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG GCGCGTTCAG 600 

597 
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GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC 650 

GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA TTGAGCAGGC 700 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 750 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 850 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCAATACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 1100 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 1150 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 1200 

ACCGGTTGAC GGTTAGAA 1218 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 1056 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056 



AATCACACCG CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

35 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 200 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 250 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 300 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 350 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 450 

CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 500 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 700 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 750 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 

AGCGAGGACG GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 1000 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 1100 

55 CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 



598 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE:" 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



15 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 



TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

20 ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

25 ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

30 TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

35 GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

40 CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCGC 1250 

ATTGTAGGGA CATACGAAT 1269 



2) INFORMATION FOR SEQ ID NO: 1058 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1169 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R420 



599 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

5 ATTCTCCAGA ATACTCAGTG TCACTAACCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTAACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 300 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

10 CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 4 00 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

15 GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 7 00 

AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

TTGACGATTG GTGCTTGTGA TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

20 CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC ACAGCTGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

25 AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTGCTA 1169 



30 2) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 300 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 4 50 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

600 



SUBSTITUTE SHEET (RULE 26) 



10 



20 



WO 01/23604 PCT/CA00/01150 

TTGACGATTG GTGCTTGTGA TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC ACAGCTGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTG 1166 



2) INFORMATION FOR SEQ ID NO: 1060 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC 50 

30 GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTACTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 3 00 

35 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AGCTGCCTTA TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT 4 00 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 4 50 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 550 

40 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 7 00 

CTCTTTGACT GTCGGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 800 

45 GCGCCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTGGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 

50 



2) INFORMATION FOR SEQ ID NO: 1061 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

601 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 
5 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061 

AATATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAGTACGCC 50 

10 GTTTCTTTAG CCTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 
CGATGACTAT- GACCTCTCTT -TGATCGGGAT CGCCCCAGAT GCTATGGATT . 150 

GGTATTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 200 

TTGGATACGA AACATAAACA GAAAATCCAG CCGCTTTTTG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AACAAACGTT GGTTCCTGAT GTTTTATTTC 300 

1'5 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AACTACCTTA TGTAGGCTGC GGGGTGGCAG CCTCTGCCTT 4 00 

ATGTATGAAC AAATGGCTGC TGCATCAAGC AGCAGAAGCG ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTO TTGACAAATC AAGCCAACCA GCAAGATCAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTT CCGGTTTTTT TTAAGCCTAA 550 

20 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCAGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCTAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAGAT CACCGTTCCT 800 

25 GCACCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTTGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGATCA AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGA TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTTGT 1030 

30 



2) INFORMATION FOR SEQ ID NO: 1062 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS.: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
45 (B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

50 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 200 

TGTTGGATAC GAAACATAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGC TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

55 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTGCCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGCT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 4 50 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCAAGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TCCCAGTTTT CTTTAAGCCT 550 

60 AATGAAGCGG GCTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

602 
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AATCGCTTCT GCCTTAAAAG 
TACAAAAAAA TATTGCCGGT 
GACTCTTTGA CTGTCGGTGC 
TTTCGATTTT GAAGAAAAGT 
5 CTGCGCCATT GCCTGAAACG 
CTGCTCTATC GTAGTCTTGG 
TGTCACGGAG CGAGGAGAAC 
GCTTTACGAG TCACTCCCGC 
TCCTATCAAG AACTACTACA 



AAGCCTTTAC TTATTGTTCC 
GTTGAGATCG GTTGCGGTAT 
TTGTGACGCC ATTTCATTAG 
ACCAGCTGAT CAGCGCCAAA 
ATTGAAACCA AGGTCAAAGA 
TCTTAAAGGT CTTGCTCGCA 
TATACTTGAA TGAAATCAAT 
TATCCTGCCA TGATGGCAGC 
AAAACTGCTT G 



PCT/CA00/01150 



GCAGTGCTCC 


650 


TTTGGGCAAC 


700 


TAGACGGCTT 


750 


ATCACCGTCC 


800 


ACAAGCTCAG 


850 


TCGACTTTTT 


900 


ACTATGCCGG 


950 


GGTCGGCTTA 


1000 




1031 



2) INFORMATION FOR SEQ ID NO: 1063 

15 (i) SEQUENCE CHARACTERISTICS: < 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

30 CGTTTCTTTA GCTTCAGCAA CTAGCGCAAT CGAAGCACTC CAATCATCTC 100 

CCTATGACTA CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

TGGTACTTGT ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 2 00 

GTTGGATACG AAACATAAAC AGAAAATACA GCCGCTATTT GAAGGAAACG 250 

GCTTTTGGCT AAGTGAAGAG CAGCAAACGT TAGTACCTGA TATTTTATTT 300 

35 CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 350 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 400 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 450 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 550 

4 0 ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGCCGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

ACTCTTTGAC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 800 

45 TGCGCCATTG CCTGAAACGA TTGAAACCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT ATACTTGAAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 1030 

50 



2) INFORMATION FOR SEQ ID NO: 1064 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

603 
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WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/0J150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

L0 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGG.G ATCGCCCCAG. ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT" 200 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

L5 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 400 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTO TCTTGACAAA TCACGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 550 

20 AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 650 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 700 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 750 

TTTCGATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 800 

25 CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 850 

CTGCTCTATC GTAGTCTTGG ACTTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAAC TATACTTGAA TGAAATCAAT ACTATGCCGG 950 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 1000 

TCCTATCAAG AACTACTACA AAAACTACTT GT 1032 

30 



2) INFORMATION FOR SEQ ID NO: 1065 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
45 (B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGGCAATT CACCGGAATA 50 

50 CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATCCGAC AAGACACGTG 200 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 300 

55 TTTCCCATTA TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGCTGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 4 00 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 450 

GTACAAAGTG CTCCTACGAT TCTCTTGACA AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

60 CTAATGAAGC GGGCTCCTCA AAAGGGATCA CTAAAGTCAC CTGCGTTGAA 600 

604 
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GAAATCGCTT CTGCCTTAAA 
CCTACAAAAA AATATTGCCG 
ACGACTCTTT GACTGTCGGT 
TTTTTCGATT TTGAAGAAAA 
5 CCCTGCGCCA TTGCCTGAAA 
AGCTGCTCTA TCGTAGTCTT 
TTTGTCACGG ATCAAGGAGA 
GGGCTTTACG AGTCACTCCC 
TATCCTATCA AGAACTACTA 



AGAAGCCTTT ACTTATTGTT 
GTGTTGAGAT CGGTTGCGGT 
GCTTGTGACG CCATTTCATT 
GTACCAGCTG ATCAGCGCCA 
CGATTGAAAC CAAGGTCAAA 
GGTCTTAAAG GTCTTGCTCG 
ACTATACTTG AATGAAATCA 
GCTATCCTGC CATGATGGCA 
CAAAAACTGC TTGT 



PCT/CAOO/01150 

CCGCAGTGCT 650 

ATTTTGGGCA 700 

AGTAGACGGC 750 

AAATCACCGT 800 

GAACAAGCTC 850 

CATCGACTTT 900 

ATACTATGCC 950 

GCGGTCGGCT 100 0 
1034 



2) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066 

CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATACACCG 50 

30 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 100 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 200 

TGGATACGAA ACATACACAG AAAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA GCAAACGTTG GTACCTGATG TTTTATTTCC 300 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 350 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 400 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 4 50 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 500 

TCGAAGCCTT TATCCAGACC CATGGCTTTC CAGTTTTCTT TAAGCCTAAT 550 

40 GAAGCGGGCT CCTCAAAAGG GATCACAAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 650 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 700 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGCTTTTT 750 

CGATTTTGAA GAAAAGTACC AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAACGATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

CTCTATCACA GTCTTGGTCT TAAAGGACTT GCTCGCATCG ACTTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 950 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 1000 

TATCAAGAAT TA 1012 

50 



2) INFORMATION FOR SEQ ID NO: 1067 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

605 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 

CTTACGCTTT ATCGATTAGA CACGGGNAGC TTGTCCAATG GGRAGCCGAT 50 

10 TTGATTTTAT GGATGAACGC TCTCATCATG CGGCAAATGG AATATCATGC 100 

AATGAAGCGC AAAATCGCAG ACGTTTGCGC TCCATCATGG AAAACAGTGG 150 
GTTTGAAGCA TATAGCCTCG "AATGGTGGCA CTATGTATTA AGAGACGAAC " 200 

CATACCCCAA TAGCTATTTT GATTTCCCCG TTAAATAAAC TTTTAACCGT 250 

TGCACGGACA AACTATATAA GCTAACTCTT TCGGCAGGAA ACCCGACGTA 300 

15 TGTAACTGGT TCTTAGGGAA TTTATATATA GTAGATAGTA TTGAAGATGT 350 

AAGGCAGAGC GATATTGCGG TCATTATCTG CGTGCGCTGC GGCAAGATAG 400 

CCTGATAATA AGACTGATCG CATAGAGGGG TGGTATTTCA CACCGCCCAT 450 

TGTCAACAGG CAGTTCAGCC TCGTTAAATT CAGCATGGGT ATCACTTATG 500 

AAAATTCATC TACATTGGTG ATAATAGTAA ATCCAGTAGG GCGAAATAAT 550 

20 TGACTGTAAT TTACGGGGCA AAACGGCACA ATCTCAAACG AGATTGTGCC 600 

GTTTAAGGGG AAGATTCTAG AAATATTTCA TACTTCCAAC TATATAGTTA 650 

AGGAGGAGAC TGAAAATGAA GAAGTTGTTT TTTTTATTGT TATTGTTATT 700 
CTTAATATAC TTAGGTTATT G .721 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1068 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 



ATTTTTAAGG ATGAACGCTC TTCATCATGC GGCAAATGGA ATATCATGCA 50 

ATGAAGCGCA AAATCGCAGA CGTTTGCGCT CCATCATGGA AAACAGTGGG 100 

45 TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC TATGTATTAA GAGACGAACC 150 

ATACCCCAAT AGCTATTTTG ATTTCCCCGT TAAATAAACT TTTAACCGTT 200 

GCACGGACAA ACTATATAAG CTAACTCTTT CGGCAGGAAA CCCGACGTAT 250 

GTAACTGGTT CTTAGGGAAT TTATATATAG TAGATAGTAT TGAAGATGTA 300 

AGGCAGAGCG ATATTGCGGT CATTATCTGC GTGCGCTGCG GCAAGATAGC 350 

50 CTGATAATAA GACTGATCGC ATAGAGGGGT GGTATTTCAC ACCGCCCATT 4 00 

GTCAACAGGC AGTTCAGCCT CGTTAAATTC AGCATGGGTA TCACTTATGA 450 

AAATTCATCT ACATTGGTGA TAATAGTAAA TCCAGTAGGG CGAAATAATT 500 

GACTGTAATT TACGGGGCAA AACGGCACAA TCTCAAACGA GATTGTGCCG 550 

TTTAAGGGGA AGATTCTAGA AATATTTCAT ACTTCCAACT ATATAGTTAA 600 

55 GGAGGAGACT GAAAATGAAG AAGTTGTTTT TTTTATTGTT ATTGTTATTC 650 

TTAATATACT TAGGTTAT 668 



60 2) INFORMATION FOR SEQ ID NO: 1069 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
- - - (A)- -ORGANISM:. Enterococcus faecium 
(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069 

CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 100 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 200 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 250 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 300 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 350 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 400 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 450 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 500 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 7 00 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA 7 60 



2) INFORMATION FOR SEQ ID NO: 1070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070 

AAAAGGAATA CGGGGCTTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 200 

TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT 250 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA 300 

ATAAACTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 350 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 400 

ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

607 
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ATTTCACACC GCCCATTGTC AACAGGCAGT TCAGCCTCGT TAAATTCAGC 
ATGGGTATCA CTTATGAAAA TTCATCTACA TTGGTGATAA TAGTAAATCC 
AGTAGGGCGA AATAATTGAC TGTAATTTAC GGGGCAAAAC GGCACAATCT 
CAAACGAGAT TGTGCCGTTT AAGGGGAAGA TTCTAGAAAT ATTTCATACT 
TCCAACTATA TAGTTAAGGA GGAGACTGAA AATGAAGAAG TTGTTTTTTT 
TTATTGTTAT TGTTATTCTT AATATACTTA GGTTATGACT ACGTTAATGA 
A 



550 
600 
650 
700 
750 
800 
801 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR -SEQ -ID NO : 1071 - . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 100 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

ACTATGTATT AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC 200 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 250 

TTCGGCAGGA AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT 300 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 400 

GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 450 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G 711 



2) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 
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GCCATTGATC TTACGCTTTA TCGATTAGAC 
GGGGAGCCGA TTTGATTTTA TGGATGAACG 
GAATATCATG CAATGAAGCG CAAAATCGCA 
GAAAACAGTG GGTTTGAAGC ATATAGCCTC 
5 AAGAGACGAA CCATACCCCA ATAGCTATTT 
CTTTTAACCG TTGCACGGAC AAACTATATA 
AACCCGACGT ATGTAACTGG TTCTTAGGGA 
ATTGAAGATG TAAGGCAGAG CGATATTGCG 
CGGCAAGATA GCCTGATAAT AAGACTGATC 

10 ACACCGCCCA TTGTCAACAG GCAGTTCAGC 
- - TATCACTTAT- GAAAATTCAT CTACATTGGT 
GGCGAAATAA TTGACTGTAA TTTACGGGGC 
GAGATTGTGC CGTTTAAGGG GAAGATTCTA 
CTATATAGTT AAGGAGGAGA CTGAAAATGA 

15 TTATTGTTAT TCTTAATATA CTTAGGTTAT ■ 
G 



PCT/CA00/01150 

ACGGGTNAGC TTGTACCAAT 50 

CTCTCATCAT GCGGCAAATG 100 

GACGTTTGCG CTCCATCATG 150 

GAATGGTGGC ACTATGTATT 200 

TGATTTCCCC GTTAAATAAA 250 

AGCTAACTCT TTCGGCAGGA 300 

ATTTATATAT AGTAGATAGT 350 

GTCATTATCT GCGTGCGCTG 400 

GCATAGAGGG GTGGTATTTC 450 

CTCGTTAAAT TCAGCATGGG 500 

GATAATAGTA AATCCAGTAG 550 

AAAACGGCAC AATCTCAAAC "600 

GAAATATTTC ATACTTCCAA 650 

AGAAGTTGTT TTTTTTATTG 700 

GACTACGTTA ATGAAGCACT 750 

751 



20 2) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



35 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 50 

GCAATGAAGC GCAAAATCGC AGACGTTTCC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA 150 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC 200 

40 GTTGCACGGA CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG 250 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 300 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT 350 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 400 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 450 

45 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 500 

ATTGACTGTA ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT 600 

TAAGGAGGAG ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 650 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 

2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



609 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faeciuw 
5 (B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074 

• ATCGATTAGA CACGGGTGAG CTTGTACCAA TGGGGAGCCG ATTTGATTTT 50 

10 ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT GCAATGAAGC 100 

. .GCAAAATCGC.AGACGTTTGC GCTCCATCAT GGAAAACAGT GGGTTTGAAG 150 

CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC " 200 

AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC GTTGCACGGA 250 

CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG TATGTAACTG 300 

15 GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT GTAAGGCAGA 350 

GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT AGCCTGATAA 4 00 

TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC ATTGTCAACA 450 

GGCAGTTCAG CCTCGTTAAA* TTCAGCATGG GTATCACTTA TGAAAATTCA 500 

TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA ATTGACTGTA 550 

20 ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG CCGTTTAAGG 600 

GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT TAAGGAGGAG 650 

ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA TTCTTAATAT 700 

ACTTAGGTTA TGACTACGTT AATGAAGCAC TG 732 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1075 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075 



TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 100 

45 AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 200 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 2 50 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

50 CATAGAGGGG TGGTATTTCA ■ CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGCATGGGT ATCACTTATG AAAATTCATC TACATTGGTG 4 50 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TGACTGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTCTAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

55 GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 650 

ACTACGTTAA TGAAGCACTG 670 



60 2) INFORMATION FOR SEQ ID NO: 1076 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) - ORGANISM:. Escherichia coli 

(B) STRAIN: DG131/3 serotype 0X3 :H8 

(C) ACCESSION NUMBER: Z36901 

15 (xi) SEQUENCE DESCRIPTION: SEQ< ID NO: 1076 

ATGAAAATAA TAATTTTTAG AGTGCTAACT TTTTTCTTTG TTATCTTTTC 50 

TGTTAATGTG GTTGCGAAGG AATTTACCTT AGATTTCTCG ACAGCAAAGA 100 

CGTATGTAGA TTCGCTGAAT GTCATTCGCT CTGCAATAGG TACTCCATTA 150 

20 CAGACTATTT CATCAGGAGG TACGTCTTTA CTGATGATTG ATAGTGGCAC 200 

AGGGGATAAT TTGTTTGCAG TTGATGTCAG AGGGATAGAT CCAGAGGAAG 250 

GGCGGTTTAA TAATCTACGG CTTATTGTTG AACGAAATAA TTTATATGTG 300 

ACAGGATTTG TTAACAGGAC AAATAATGTT TTTTATCGCT TTGCTGATTT 350 

TTCACATGTT ACCTTTCCTG GTACAACTGC GGTTACATTG TCTGGTGACA 400 

25 GTAGCTATAC CACGTTACAG CGTGTTGCGG GGATCAGTCG TACGGGGATG 450 

CAGATAAATC GCCATTCGTT GACTACTTCT TATCTGGATT TAATGTCGCA 500 

TAGCGGAACC TCACTGACGC AGTCTGTGGC AAGAGCGATG TTACGGTTTG 550 

TTACTGTGAC AGCTGAAGCT TTACGTTTTC GGCAAATTCA GAGGGGATTT 600 

CGTACAACAC TTGATGATCT CAGTGGGCGT TCTTATGTAA TGACTGCTGA 650 

30 AGATGTTGAT CTTACGTTGA ACTGGGGAAG GTTGAGTAGT GTCCTGCCTG 700 

ACTATCATGG ACAAGACTCT GTTCGTGTTG GAAGAATTTC TTTTGGAAGT 750 

GTTAATGCAA TTCTGGGTAG CGTGGCATTA ATACTGAATT GTCATCATCA 800 

TGCATCGCGA GTTGCCAGAA TTGTACCTAA TGAGTTTCCT TCTATGTGCC 850 

CGGTAGATGG AAGAGTGCGT GGGATTACGC ACAATAAAAT ATTGTGGGAC 900 

35 TCATCCACTC TGGGGGCAAT TTTGATACGC AGGGCTATTA GCAGTTGA 948 



2) INFORMATION FOR SEQ ID NO: 1077 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 



CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 

GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GACTTTTCGA 100 

CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 

ACCCCTCTTG AACATATATC TCAGGGGACC ACATCGGTGT CTGTTATTAA 200 

60 / CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 

611 
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TCTATCAGGC GCGTTTTGAC CATCTTCGGC TGATTATTGA GCAAAATAAT 300 

TTATATGTGG CCGGGTTCGT TAATACGGCA ACAAATACTT TCTACCGTTT 350 

TTCAGATTTT ACACATATAT CAGTGCCCGA TGTGACAACG GTTTCCATGA 400 

CAACGGACAG CAGTTATACC ACTCTGCAAC GTGTCGCAGC GCTGGAACGT 450 

5 TCCGGAATGC AAATCAGTCG TCACTCACTG GTTTCATCAT ATCTGGCGTT 500 

AATGGAGTTC AGTGGTAATA CAATGACCAG AGATGCATCC AGAGCAGTTC 550 

TGCGTTTTGT CACTGTCACA GCAGAAGCCT TACGCTTCAG GCAGATACAG 600 

AGAGAATTTC GTCAGGCACT GTCTGAAACT GCTCCTGTGT ATACCATGAC 650 

GCCGGGAGAC GTGGACCTCA CTCTGAACTG GGGGCGAATC AGCAATGTGC 700 

10 TTCCGGAGTA TCGGGGAGAG GATGGTGTCA GAGTGGGGAG AATATCCTTT 750 

AATAATATAT CAGCGATACT .GGGTACTGTG GCCGTTATAC TGAATTGCCA 800 

TCATCAGGGG GCGCGTTCTG TTCGCGCCGT GAATGAAGAG AGTCAACCAG "850 

AATGTCAGAT AACTGGCGAC AGGCCCGTTA TAAAAATAAA CAATACATTA 900 

TGGGAAAGTA ATACAGCTGC AGCGTTTCTG AACAGAAAGT CACAGTTTTT 950 

15 ATATACAACG GGTAAATAAA GGAGTTAAGT ATGAAGAAGA TGTTTATGGC 1000 

GGTTTTATTT GCATTAGTTT CTGTTAATGC AATGGCGGCG GATTGTGCTA 1050 

AAGGTAAAAT TGAGTTTTCC AAGTATAATG AGGATGACAC ATTTACAGTG 1100 

AAGGTTGACG GGAAAGAATA CTGGACCAGT CGCTGGAATC TGCAACCGTT 1150 

ACTGCAAAGT GCTCAGCTGA CAGGAATGAC TGTCACAATC AAATCCAGTA 1200 

20 CCTGTGAATC AGGCTCCGGA TTTGCTGAAG TGCAGTTTAA TAATGACTGA 1250 

GGCATAACC 1259 



25 2) INFORMATION FOR SEQ ID NO: 1078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078 

AGTTCTGCGT TTTGTCACTG TC 22 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1079 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079 
CGGAAGCACA TTGCTGATT 19 

2) INFORMATION FOR SEQ ID NO: 1080 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 25 bases 

612 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
TATAGCTACT GTCACCAGAC AATGT 25 



10 



2) INFORMATION FOR SEQ ID NO: 1081 

15 (i) SEQUENCE CHARACTERISTICS: > 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : S ingl e 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 
25 ATGTCAGAGG GATAGATCCA 20 



2) INFORMATION FOR SEQ ID NO: 1082 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 

40 

TTGARCRAAA TAATTTATAT GTG 23 



45 2 ) INFORMATION FOR SEQ ID NO: 1083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 

TGATGATGRC AATTCAGTAT 20 



60 
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2) INFORMATION FOR SEQ ID NO: 1084 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 



2) INFORMATION FOR SEQ ID NO: 1085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085 
CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 



2) INFORMATION FOR SEQ ID NO: 1086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 
CTACTCCCGC CTTTTGGGTT 



2) INFORMATION FOR SEQ ID NO: 1087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 
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CTCACAGCCC GAAACAGCCT 



PCT/CAOO/01150 

20 



5 2) INFORMATION FOR SEQ ID NO: 1088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ - ID NO: 1088 

TGCCGTTTCC TGTATCCGTC 20 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1089 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089 
ATCCACACGG GCTAGACCTC 20 

2) INFORMATION FOR SEQ ID NO: 1090 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090 
AATAGCGCGG ACGAATTGGA C 21 

50 



2) INFORMATION FOR SEQ ID NO: 1091 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091 
5 AACGCGGCAC TGTTTCCCAA 20 



10 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1092 



. . (i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 

20 

TCGGCAAGAC AATATGACAG C 21 



25 2) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa-165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 



TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

CGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACCTGCTT 200 

45 TTTTAAAGTA TAGGTATAAA ATACAATTGA TTAAAATAGT AAAGGAAATG 250 

AATCATGAAA CAATTAACTA AGCCTTTATA CTTTTACCTA TTACTTTTTA 300 

TTACAACAAC ACTGATTGGC GCGTTACTAT TATATTTGCC AATCACAGGT 350 

AAACATCCTA TTGATTTTGT GGACGCCCGT T 381 



2) INFORMATION FOR SEQ ID NO: 1094 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 

616 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION.: SEQ ID NO: 1094 
GGTAAAACAG GTACTTCTAA CTA 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1095 

(i) SEQUENCE CHARACTERISTICS: 
" - - (A)- LENGTH:- 20 bases - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095 
CGATAGAAGC AGCAGGACAA 



2) INFORMATION FOR SEQ ID NO: 1096 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096 
CTGATGGATG CGGAAGATAC 



2) INFORMATION FOR SEQ ID NO: 1097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097 
TCYTCAAAAG GGATCACWAA AGTMAC 



2) INFORMATION FOR SEQ ID NO: 1098 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
TCTTCAAAAT CGAAAAAGCC GTC 23 



2) INFORMATION FOR SEQ ID NO: 1099 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY :■ Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 
TCAAAAGGGA TCACWAAAGT MAC 23 

25 

2) INFORMATION FOR SEQ ID NO: 1100 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 
40 GTAAAKCCCG GCATRGTRTT GATTTC 26 



45 



2 ) INFORMATION FOR SEQ ID NO: 1101 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 

55 

GACGGYTTTT TYGATTTTGA AGA 23 



60 2) INFORMATION FOR SEQ ID NO: 1102 

618 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102 

AAAAARTCGA TKCGAGCMAG ACC ' " 



15 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 1103 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 

2) INFORMATION FOR SEQ ID NO: 1104 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
GGATCAACAC AGTAGAACCG 

55 



2 ) INFORMATION FOR SEQ ID NO: 1105 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105 
50 CTCCTACGAT TCTCTTGAYA AATCA 
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2) INFORMATION FOR SEQ ID NO: 1106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION :• SEQ ID NO: 1106 
CAACCGATCT CAACACCGGC AAT 



2) INFORMATION FOR SEQ ID NO: 1107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 
CTCATTTGAC TTCCTCCTTT GCT 



2) INFORMATION FOR SEQ ID NO: 1108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 



2) INFORMATION FOR SEQ ID NO: 1109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
ACATCGTGAT CGCTAAAAGG AGC 



PCT/CAOO/01150 



23 



2) INFORMATION FOR SEQ ID NO: 1110 

10 (i) SEQUENCE CHARACTERISTICS: 

"(A) LENGTH: 23 bases- . - . _ 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
20 ACGAGAAAGA CAACAGGAAG ACC 23 



25 



2) INFORMATION FOR SEQ ID NO: 1111 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111 

35 

CTTTTTCCGG CTCGWYTTCC TGATG 25 



10 2) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112 

GGCTGYGATA TTCAAAGCTC 20 



55 

2) INFORMATION FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
50 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 20 



2) INFORMATION FOR SEQ ID NO: 1114 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 
TCWGAGCCTT TTTCCGGCTC G 21 

25 

2) INFORMATION FOR SEQ ID NO: 1115 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 
40 TTTCGGGCTG TGAGGTCGGB TGHGCG 26 



45 



2) INFORMATION FOR SEQ ID NO: 1116 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 

55 

TTTCGGGCTG TGAGGTCGGB TGHGCGG 27 



60 2) INFORMATION FOR SEQ ID NO: 1117 

622 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L0 (vi) ORIGINAL SOURCE: 

- - - - (A) ORGANISM: Enterococcus _f.ae.civm 

(B) STRAIN: 94 

(C) ACCESSION NUMBER: U94526 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 

AAATTCGATC CGCACTACAT CGGAATTACA AAAAACGGTG TATGGAAGCT 50 

ATGCAAGAAG CCATGTACGG* AATGGGAAGC CGACAGTCTC CCCGCCATAC 100 

TCTCCCCGGA TAGGAAAACG CATGGGCTGC TTGTCATGAA AGAAAGCGAA 150 

20 TACGAAACAC GGCGTATTGA TGTGGCTTTC CCGGTTTTGC ATGGCAAATG 200 

CGGGGAGGAT GGTGCGATAC AGGGGCTGTT TGTATTGTCT GGTATCCCCT 250 

ATGTGGGCTG TGATATTCAA AGCTCCGCAG CTTGCATGGA CAAATCACTG 300 

GCCTACATTC TTACAAAAAA TGCGGGCATC GCCGTTCCCG AATTTCAAAT 350 

GATTGATAAA GGTGACAAGC CGGAGGCGGG TGCGCTTACC TACCCTGTCT 4 00 

25 TTGTGAAGCC GGCACGGTCA GGTTCGTCCT TTGGCGTAAC CAAAGTAAAC 450 

GGTACGGAAG AACTTAACGC TGCGATAGAA GCGGCAGGAC AATATGATGG 500 

AAAAATCTTA ATTGAGCAAG CGATTTCGGG CTGTGAGGTC GGGTGTGCGG 550 

TCATGGGGAA CGAGGATGAT TTGATTGTCG GCGAAGTGGA TCAAATCCGG 600 

CTGAGCCACG GTATCTTCCG CATCCATCAG GAAAACGAGC CGGAAAAAGG 650 

B0 CTCAGAAAAT GCGATGATTA CAGTTCCCGC AGACATTCCG GTCGAGGAAC 7 00 

GAAATCGGGT GCAGGAAACG GCAAAGAAAG TATATCGGGT GCTTGGATGC 750 

AGAGGGCTTG CCCGTGTTGA TCTTTTTTTG CAGGAGGATG GCGGCATCGT 800 

T 801 



35 



2 ) INFORMATION FOR SEQ ID NO: 1118 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG 24 

50 



2) INFORMATION FOR SEQ ID NO: 1119 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



SO 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 
5 TTTCGGGCTG TGAGGTCGGB TGHGC 25 



10 



2) INFORMATION FOR SEQ ID NO: 1120 



(i) SEQUENCE CHARACTERISTICS: . ..... 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120 

20 

TTTCGGGCTG TGAGGTCGGB TGHG 24 



25 2) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121 

TGTTTGWATT GTCYGGYATC CC 22 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1122 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
TGGTGCATTG CTACGTGG 18 

2) INFORMATION FOR SEQ ID NO: 1123 



(i) SEQUENCE CHARACTERISTICS 
60 (A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123 
TTTCGGGCTG TGAGGTCGGB TG 



2) INFORMATION FOR SEQ ID NO: 1124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124 
GATTTGRTCC ACYTCGCCRA CA 



2) INFORMATION FOR SEQ ID NO: 1125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125 
ACTCACAACT GGGATGGATG 



2) INFORMATION FOR SEQ ID NO: 1126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE^ TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126 
TTATGGTTGT GCTGGTTGAG G 
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2) INFORMATION FOR SEQ ID NO: 1127 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
- (xi) SEQUENCE .DESCRIPTION: SEQ ID NO: 1127 
KCAAAYGCCA TTTCAAGTAA 



2) INFORMATION FOR SEQ ID NO: 1128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128 
GACGACYTTA TKGATATACA 



2) INFORMATION FOR SEQ ID NO: 1129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 9 
ATGATGACHG AMATGATGAA AAC 



2) INFORMATION FOR SEQ ID NO: 1130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130 
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GACTATCCAA GCATGCATTA TG 
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22 



5 2) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

■ " - - (D) - TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 22 



20 



30 



35 



60 



2 ) INFORMATION FOR SEQ ID NO: 1132 



(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132 
AGTGAAAARA TGGCTGCTGC 20 

2) INFORMATION FOR SEQ ID NO: 1133 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133 
CATCAAGAAC ACTGGCTAYG TAG 23 

50 

2) INFORMATION FOR SEQ ID NO: 1134 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 
5 CTAGATAGAG CTAAAACCTT CCT 23 



2) INFORMATION FOR SEQ ID NO: 1135 

10 

"'" " " (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 

20 

CATTATGCAA ACGCCATTTC AAG 23 



25 2) INFORMATION FOR SEQ ID NO: 1136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136 

ACTTGTCCAC GTTSGATRTC T 21 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1137 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 23 

2) INFORMATION FOR SEQ ID NO: 1138 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1032 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 

(xi) "SEQUENCE DESCRIPTION: SEQ ID NO: 1138 



ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT 50 

GTCACTAACC TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT 100 

15 ATGAAGTAAT GACCATTGGC ATCGCACCAA CAATGGATTG GTATTGGTAT 150 

CAAGGAAACC TCGCGAATGT TCGCAATGAT ACTTGGCTAG AAGATCACAA 200 

AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA TTAGGAGAAA 250 

AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 300 

GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT 350 

20 TGGTTGCCAT GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC 4 00 

ATCAACTTGC TGATACCATG GGAATCGCTA GTGCTCCCAC TTTGCTTTTA 450 

TCCCGCTATG AAAACGATCC TGCCACAATC GATCGTTTTA TTCAAGACCA 500 

TGGATTCCCG ATCTTTATCA AGCCGAATGA AGCCGGTTCT TCAAAAGGGA 550 

TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 600 

25 TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA 650 

AATTGGCTGC GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG 700 

ATGCGATTTC TCTTGTCGAC GGTTTTTTTG ATTTTGAAGA GAAATACCAA 750 

TTAATCAGCG CCACGATCAC TGTCCCAGCA CCATTGCCTC TCGCGCTTGA 800 

ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC TTGGGATTGA 850 

30 CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 900 

TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC 950 

AGCTATGATG GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT 1000 

TGATTGCACT GGCAGAGGAG GACAAACGAT GA 1032 



2) INFORMATION FOR SEQ ID NO: 1139 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

50 (C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 9 

GATATCGTTA CGCTTCATGT GCCGCTCAAT ACGGATACGC ACTATATTAT 50 

55 CAGCCACGAA CAAATACAGA GAATGAAGCA AGGAGCATTT CTTATCAATA 100 

CTGGGCGCGG TCCACTTGTA GATACCTATG AGTTGGTTAA AGCATTAGAA 150 

AACGGGAAAC TGGGCGGTGC CGCATTGGAT GTATTGGAAG GAGAGGAAGA 200 

GTTTTTCTAC TCTGATTGCA CCCAAAAACC AATTGATAAT CAATTTTTAC 250 

TTAAACTTCA AAGAATGCCT AACGTGATAA TCACACCGCA TACGGCCTAT 3 00 

60 TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 350 
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TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 4 00 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 450 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

5 TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 600 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 700 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 800 

-LO- ATGC.TGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG CTACGTTTAC "CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 950 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

L5 GTTAGTTGTT GGCGAGGTGG ACCAAATCAG ' GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGACCTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 12 00 

GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 12 50 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 13 0 0 

20 CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 1350 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1400 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 1500 

AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

2 5 GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 1600 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAACTGTTTT 1650 

ATGCAATGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 1700 

TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 1768 

30 



2) INFORMATION FOR SEQ ID NO: 1140 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
45 (C) ACCESSION NUMBER: L29638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140 

GTAAGAATCG GAAAAGCGGA AGGAAGAAAA ACATGAAAAA AATCGCCATT 50 

50 ATTTTTGGAG GCAATTCACC GGAATACACC GTTTCTTTAG CTTCAGCAAC 100 

TAGCGCAATC GAAGCACTCC AATCATCTCC CTATGACTAC GACCTCTCTT 150 

TGATCGGGAT CGCCCCAGAT GCTATGGATT GGTACTTGTA TACAGGAGAA 200 

CTGGAAAACA TCCGACAAGA CACGTGGTTG TTGGATACGA AACATAAACA 250 

GAAAATACAG CCGCTATTCG AAGGAAACGG CTTTTGGCTA AGTGAAGAGC 300 

55 AGCAAACGTT GGTACCTGAT GTTTTATTTC CCATTATGCA TGGCAAATAC 350 

GGGGAAGATG GCAGTATCCA AGGATTGTTT GAATTGATGA AGCTGCCTTA 4 00 

TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT ATGTATGAAC AAATGGCTGC 4 50 

TGCATCAAGC TGCAGCAGCC ATTGGCGTAC AAAGTGCTCC TACGATTCTC 500 

TTGACAAATC AAGCCAACCA GCAAGAACAA ATCGAAGCTT TTATCCAGAC 550 

60 CCATGGCTTC CCAGTTTTCT TTAAGCCTAA TGAAGCGGGC TCCTCAAAAG 600 
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GGATCACTAA AGTCACCTGC 
GCCTTTACTT ATTGTTCCGC 
TGAGATCGGT TGCGGTATTT 
GTGACGCCAT TTCATTAGTA 
5 CAGCTGATCA GCGCCAAAAT 
TGAAACCAAG GTCAAAGAAC 
TTAAAGGTCT TGCTCGCATC 
TACTTGAATG AAATCAATAC 
TCCTGCCATG ATGGCAGCGG 
LO AACTGCTTGT CTTAGCAAAG 



GTTGAAGAAA TCGCTTCTGC 
AGTGCTCCTA CAAAAAAATA 
TGGGCAACGA CTCTTTGACT 
GACGGCTTTT TCGATTTTGA 
CACCGTCCCT GCGCCATTGC 
AAGCTCAGCT GCTCTATCGT 
GACTTTTTTG TCACGGAGCG 
TATGCCGGGC TTTACGAGTC 
TCGGCTTATC CTATCAAGAA 
GAGGAAGTCA AATGAG 



CTTAAAAGAA 650 

TTGCCGGTGT 700 

GTCGGTGCTT 750 

AGAAAAGTAC 800 

CTGAAACGAT 850 

AGTCTTGGTC 900 

AGGAGAACTA 950 

ACTCCCGCTA 1000 

CTACTACAAA 1050 
1086 



2) INFORMATION FOR SEQ ID NO: 1141 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3946 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
25 (B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M97297 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



30 ATGAATAACA 
TGCATTCCAT 
ACGCCAACGT 
ATCAGTGTGG 
GAAGAGAGCC 

35 ATATAGATAC 
GCGTACTCGC 
GGCAGTACGC 
TCAGGTTGGA. 
GTGGTGGGAA 

40 ATTTGGATGT 
ACTATGTACC 
CATGTGCCGC 
ACAGAGAATG 
TTGTAGATAC 

45 GGTGCCGCAT 
TTGCACCCAA 
TGCCTAACGT 
TTGCGTGATA 
GAGACAGGAG 

50 TCAGAGGAGC 
CATTAATAAA 
GTGTATGGAA 
TGCTATTCAG 
TAAAAAGAAC 

55 CTTTGCATGG 
TTGTCCGGTA 
TATGGACAAA 
CTCCCGCCTT 
TTTACCTATC 

60 TGTGAAAAAA 



TCGGCATTAC 
GCTCTTTCGC 
GTCGGAATCC 
GACATAAATC 
GGTGTGAAAT 
AACTGCTGCT 
CGGATAGCGT 
AACGTAAAAT 
CAGCGACCGT 
CGGGCCAGAT 
AAAGTGTTGG 
GTTTGATGAG 
TCAATACGGA 
AAGCAAGGAG 
CTATGAGTTG 
TGGATGTATT 
AAACCAATTG 
GATAATCACA 
CCGTTGAAAA 
CATGAATAGA 
ATGACGTATC 
GAAAAATACG 
AATGTGCGAA 
CTGTACTCTC 
CATGAATATG 
CAAGTCAGGT 
TCCCTTTTGT 
TCGTTGACAT 
TTGGGTTATT 
CTGTTTTTGT 
GTCAATAGCG 



TGTTTATGGA 
CTCGCTTTGG 
AACGCCAAAT 
AGAGATTTCC 
ATATTTCTAC 
AAGAGAATGG 
TGCCGATTAT 
CGATTGTGCG 
GGCAAGGTAC 
AGGCAAAGCG 
CTTATAGTCG 
TTGCTGCAAA 
TACGCACTAT 
CATTTCTTAT 
GTTAAAGCAT 
GGAAGGAGAG 
ATAATCAATT 
CCGCATACGG 
AACCATTAAA 
ATAAAAGTTG 
GGTAAAATCT 
AGCCGTTATA 
AAACCTTGCG 
GCCGGATAAA 
AAATCAACCA 
GAAGATGGAT 
AGGCTGCGAT 
ACATCGTTGC 
AATAAAGATG 
TAAGCCGGCG 
CGGACGAATT 



ID NO: 1141 




TGTGAGCAGG 


ATGAGGCAGA 


50 


CGTTATGGCA 


ACGATAATTA 


100 


CCGCGCCTTT 


CAATCAATGT 


150 


GCCTCTATTC 


TTCTTGCGCT 


200 


CCGAAGCATC 


GGCTGCAATC 


250 


GCATCACTGT 


CGACAATGTG 


300 


ACTATGATGC 


TAATTCTTAT 


350 


CTCTGTGGAA 


AAACATGATT 


400 


TCAGCGACAT 


GACAGTTGGT 


450 


GTTATTGAGC 


GGCTGCGAGG 


500 


CAGCCGAAGT 


ATAGAGGTAA 


550 


ATAGCGATAT 


CGTTACGCTT 


600 


ATTATCAGCC 


ACGAACAAAT 


650 


CAATACTGGG 


CGCGGTCCAC 


700 


TAGAAAACGG 


GAAACTGGGC 


750 


GAAGAGTTTT 


TCTACTCTGA 


800 


TTTACTTAAA 


CTTCAAAGAA 


850 


CCTATTATAC 


CGAGCAAGCG 


900 


AACTGTTTGG 


ATTTTGAAAG 


950 


CAATACTGTT 


TGGGGGTTGC 


1000 


GCAATAGAGA 


TAGCCGCTAA 


1050 


CATTGGAATT 


ACGAAATCTG 


1100 


CGGAATGGGA 


AAACGACAAT 


1150 


AAAATGCACG 


GATTACTTGT 


1200 


TGTTGATGTA 


GCATTTTCAG 


1250 


CCATACAAGG 


TCTGTTTGAA 


1300 


ATTCAAAGCT 


CAGCAATTTG 


1350 


GAAAAATGCT 


GGGATAGCTA 


1400 


ATAGGCCGGT 


GGCAGCTACG 


1450 


CGTTCAGGCT 


CATCCTTCGG 


1500 


GGACTACGCA 


ATTGAATCGG 


1550 
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CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 1600 

GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 1650 

GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 1700 

TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 1750 

5 CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 1800 

TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 1850 

ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 1900 

TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 1950 

CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 2000 

10 GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 2050 

~ ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 2100 

TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AATCGCTTTT 2150 

GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 2200 

ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 2250 

15 CAGCCGGAAA ATAACCTGAC AAAGGAAAGT TATTATCCCA ATATTGACCG 2300 

AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 2350 

GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 24 00 

GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 2450 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 2500 

2 0 CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 2550 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2600 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 2650 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 2700 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 2750 

2 5 GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 2800 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 2850 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 2900 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 2950 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 3000 

30 ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 3050 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 3100 

GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA 3150 

TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 3200 

AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 3250 

35 AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 33 00 

ATCTAAACAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 3350 

TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 34 00 

GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 34 50 

CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 3500 

4 0 TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 3550 

TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 3600 

AAA TGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 3650 

AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 3700 

CCACATAGTG CGATTATGAA AGAAAAGAAT TTCGTTCTCG AGGAATATAT 3750 

45 GGATTACCTA AAAGAAGAAA AAACCATTTC TGTTAGTGTA AATGGGGAAA 3800 

AATATGAGAT CTTTTATTAT CCTGTTACTA AAAATACCAC CATTCATGTG 3850 

CCGACTAATC TTCGTTATGA GATATCAGGA AACAATATAG ACGGTGTAAT 3900 

TGTGACAGTG TTTCCCGGAT CAACACATAC TAATTCAAGG AGGTAA 3946 

50 

2) INFORMATION FOR SEQ ID NO: 1142 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142 
GATCCTCTAA ATGATTCTCA GGTGG 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 
- - (A) LENGTH: - 25~ bases . ....... 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear- 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143 
CAATTAGCTT AGCAATAGGT GTTGG 



2) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144 
TGTYTTCCAA GGTTCAGCTC 



2) INFORMATION FOR SEQ ID NO: 1145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
AACATATTKG GTTGATAGGT 



2) INFORMATION FOR SEQ ID NO: 1146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146 
GGGATTACCT ATGCCAATAT GAT 23 

2) INFORMATION FOR SEQ ID NO: 1147 



(i) SEQUENCE CHARACTERISTICS: 
15. (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 
AGCTGTGTTA GCVCGAACAT CTTG 24 

25 

2) INFORMATION FOR SEQ ID NO: 1148 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
4 0 GACTTTGTTT GGCGTGATAT 2 0 



45 



2) INFORMATION FOR SEQ ID NO: 1149 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149 

55 

TCCYACWATT TCTTTTTGWG 20 



60 2) INFORMATION FOR SEQ ID NO: 1150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 
TGATAATCAC ACCGCATACG 



2) INFORMATION FOR SEQ ID NO: 1151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151 
TGCTGTCATA TTGTCTTGCC 



2) INFORMATION FOR SEQ ID NO: 1152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152 
ATAAAGATGA TAGGCCGGTG 



2) INFORMATION FOR SEQ ID NO: 1153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 
CTCGTATGTC CCTACAATGC 
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2) INFORMATION FOR SEQ ID NO: 1154 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE :~ DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 

15 

GTTTGAAGCA TATAGCCTCG 20 



20 2) INFORMATION FOR SEQ ID NO: 1155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 

CAGTGCTTCA TTAACGTAGT C 21 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1156 



(i) SEQUENCE CHARACTERISTICS: \ 
(A) LENGTH: 23 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 
GTTGAAATGC ATCACGAACA ATT 23 

2) INFORMATION FOR SEQ ID NO: 1157 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
AAGAACGTTT CAGTTAAGGA AAT 
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2) INFORMATION FOR SEQ ID NO: 1158 

(i) SEQUENCE CHARACTERISTICS: 
(A)_ LENGTH: 19 bases 
"(B) TYPE : Nucleic" acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 
AAGAGGTAAT GTCTGTGGT 



2) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159 
TGAAGGTTTG CCAGGTGA 



2) INFORMATION FOR SEQ ID NO: 1160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 



2) INFORMATION FOR SEQ ID NO: 1161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 
TCCAGGTGAT AACGTTGG 18 

2) INFORMATION FOR "SEQ "ID NO': 1162 ---- - - -- 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162 
CAAGTCCGTG GAAATGCA 18 

25 

2) INFORMATION FOR SEQ ID NO: 1163 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163 
40 GTTGGTTTCA ACGTTAAGAA C 21 



2) INFORMATION FOR SEQ ID NO: 1164 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 

55 

GGTTTCAACG TCAAGAAC 18 



60 2) INFORMATION FOR SEQ ID NO: 1165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 
GGTTTCAACG TGAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166 
ACGTTAAGAA TGTTTCTGTC AA 



2) INFORMATION FOR SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167 
GTTGGTTTCA ACGT 



2) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1168 
GAACAATTGG TTGAAGGTGT 

639 
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2) INFORMATION FOR SEQ ID NO: 1169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii)~ MOLECULE TYPE": Genomic "DNA " " 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AF139883 



(xi) SEQUENCE DESCRIPTION: SEQ 

20 

ATGAACAAAC CAACGATTCT GCGCCTAATC 
CTTAAGCTTG GTTATCGCAG CCATTGTCTT 
ACTACGTTAG CAAGGCTCCT AGCCTATCCG 
ACTTCTAGTA AAATCTACGA CAATAAAAAT 

25 TTCTGAACGC CGCGTCAATG CCCAAGCTAA 
TTAAGGCAAT CGTTTCTATC GAAGACCATC 
ATTGATTCCA TCCGTATCCT GGGAGCTTTC 
TTCCCTCCAA GGTGGATCAA CTCTCACCCA 
ACTTTTCAAC CTCGACTTCC GACCAGACTA 

30 GCTTGGTTAG CGATTCAGTT AGAACAAAAA 
GACCTACTAT ATAAATAAGG TCTACATGTC 
AGACAGCAGC TCAAAACTAC TATGGTAAAG 
CCTCAGTTAG CCTTGCTGGC TGGAATGCCT 
CCCCTATTCA CATCCAGAAG CAGCCCAAGA 

35 CTGAAATGAA AAATCAAGGT TACATCTCTG 
GTCAATACAC CAATTACTGA TGGACTACAA 
TTACCCTGCT TACATGGATA ATTACCTCAA 
AACAAGAAAC TGGCTATAAC CTTCTAACTA 
AATGTAGACC AAGAAGCTCA AAAACATCTG 

4 0 TCAATACGTC TCTTACCCTG ACGATGATTT 
TAGATGTTTC AAATGGTAAA GTCATCGCCC 
GCAAGTAACG TTTCATTTGG TACCAACCAA 
CTGGGGTTCT GCTATGAAAC CAATCACCGA 
ACGGTGTTTA TGATTCCACT GCAACTATGG 

45 TATCCGGGAA CAAGCACACC TGTCTACAAC 
TAATATTACT CTGCAATATG CTCTTCAACA 
TTGAGACTTT GAATAAGGTC GGTCTAGATA 
GGTCTTGGTA TCGACTATCC AAGCATGCAT 
TAATACAACA GAATCTAATA AACAATACGG 

50 CTGCTGCTTA TGCTGCCTTT GCAAATGGTG 
TATATCCATA AAGTCGTCTT CAGTGATGGA 
TGTCGGAACT CGTGCCATGA AGGAAACGAC 
TGATGAAAAC AGTCTTGACT TATGGAACTG 
TGGCTTCCTC AAGCTGGTAA AACAGGAACC 

55 AGTTGAAAAC CACATCAAGA ACACTGGCTA 
TTGTTGGTTA TACTCGTAAG TATTCTATGG 
AATCGTTTAA CTCCTATCGT TGGAGATGGT 
TTATCGCTCA ATGATAACGT ATCTATCAGA 
GGACGATGCC AGACGGACTT TTCAGAAACG 

60 GGAGCTCGCC CAATATGGAC TGAACCCTCT 



ID NO: 1169 






vjLAI lAbbl 1 


50 


A b b L ub A bb A 


bl 1 1 1 1 1 1 LI 


100 


APAPTA AAPT 

Ab Ab 1 AAAL 1 


A PTTPPA APA 

Ab 1 1 bL AALA 


150 


r-i t\ APTPATTP 
LAAb i LA 1 1 b 


L 1 bAL 1 1 bbb 


200 


TPATATTPPP 

IbAi Al Ibbb 


A /""•A/* 1 A mmmpp 
AL Ab A 111 bb 


250 


bb 1 1L1 1 LbA 


ppapa r*r*r*c*r* 
L L AC Abbbbb 


300 


1 IbLbLAAlL 


rpripiv TV A PTA A 

1 bb AAAb 1 AA 


350 


^papTTPaTT 

Ab Ab 1 1 bA 1 1 


A APTTPAPTT 
AAvj 1 1 bAL 1 1 


400 


1 1 lblbblAA 


PPPTPAPPAA 
bbb 1 bAbbAA 


450 


bLAALLAnnL 


ft PP ft ft ft TPTT 
ALjo AAA 1LX 1 


500 


1 AA 1 oLjL^AAL. 


T ft TP (1. ft ft TTiP 


550 


ft PPTP ft ft T ft ft 


± 1 IAAVjX 1 In 


600 


P ft dClC ft PP Zi ft 


nLLnft 1 A 1 oA 


650 


PPf^PPfJAAAP 


X 1 \3\J 1 L 1 Inl 


700 


PTftAAPAfTTA 


TOAOAAAfrPA 


750 


A^TPTPAAAT 


PAHPAAftTAA 


800 


pPAftriTPATP 


AATPAAfiTAO 


850 


CTGGGATGGA 


TGTTTACACA 


900 


TGGGATATCT 


ACAACTCCGA 


950 


GCAAGTCGCA 


TCTACGGTCG 


1000 


AACTTGGAGC 


TCGTCACCAA 


1050 


GCTGTGGAAA 


CCAATCGTGA 


1100 


TTATGCACCT 


GCCATAGAAT 


1150 


TTAATGATAT 


TCCTTATAAC 


1200 


TGGGATAGAG 


CATATTTCGG 


1250 


ATCACGAAAT 


GTCACAGCCG 


1300 


GAGCTAAAAC 


CTTCCTTAAT 


1350 


TATGCAAACG 


CCATTTCAAG 


1400 


AGCAAGTAGT 


GAAAAAATGG 


1450 


GCACTTACTA 


TAAACCAATG 


1500 


AGTAAAAAAG 


AGTTCTCTAA 


1550 


AGCCTATATG 


ATGACCGACA 


1600 


GGCGTGGAGC 


CTATCTTCCT 


1650 


TCTAACTATA 


CAGATGAGGA 


1700 


TGTAGCTCCA 


GATGAAATGT 


1750 


CTGTATGGAC 


AGGTTATTCG 


1800 


TTCCTAGTTG 


CAGCTAAAGT 


1850 


AGATACTCAT 


CCAGAAGACT 


1900 


GGGAATTTGT 


ATTCAAAAAT 


1950 


ACTCAACAAT 


CCTCAACAGC 


2000 
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TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 



PCT/CA00/01150 

AACTTCACAG TCTAGCTCAA 2050 
CCAATCCTAA CAATAATACG 2100 
AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) ~ TYPE: "Nucleic acid" 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 



2) INFORMATION FOR SEQ ID NO: 1171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171 
ACGAGGATGA TTTGATTGTC 



2) INFORMATION FOR SEQ ID NO: 1172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE. TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: X13136 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172 

GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGAAAG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGCTATT CTCTAAATGA 200 

641 
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CCGTGTAGGA ACCTCCTATT TGGAAAAGCA ATATGAAGAG ACCTTACAAG 250 

GAAAACGCTC GGTAAAAGAA ATCCATCTGG ACAAATATGG CAACATGGAA 300 

AGCGTGGATA CAATTGAGGA AGGTAGTAAG GGAAACAATA TCAAGCTGAC 350 

CATTGATTTG GCCTTCCAAG ATAGCGTGGA TGCTTTGCTG AAAAGTTATT 400 

5 TCAATTCAGA GTTGGGAAAT GGTGGAGCCA AGTATTCTGA AGGTGTCTAT 450 

GCAGTCGCCC TTAACCCAAA AACAGGTGCT GTTTTGTCTA TGTCAGGAAT 500 

TAAACATGAC TTGAAAACAG GAGAGTTGAC GCCGGATTCC TTGGGAACGG 550 

TAACCAATGT CTTTGTCCCA GGTTCGGTTG TCAAAGCAGC GACCATCAGC 600 

TCTGGTTGGG AAAATGGAGT CTTGTCAGGA AATCAGACCT TGACAGACCA 650 

LO GTCCATTGTC TTTCAAGGTT CAGCTCCAAT TAATTCTTGG TATCCTGCCT 700 

TTTCTAGACC AATGCCGATT ACGGCGGTTC AGGCTCTAGA GTATTCATCC 750 

- AATGCTTATA- TGGTGGAAAC AGCGGTAGGT - CTTATGGGGC -AGACCTATCA " "800 

ACCCAATATG TTTGTCGGCA CCAGCAATCT AGAGTGTGCT ATGGGGAAAT 850 

TGCGTTCAAC CTTTGGTGAA TATGGTTTGG GTTCTGCGAC CGGAATTGAC 900 

15 CTACCAGATG AATCTACTGG ATTTGTTCCC . AAAGAGTATA GCTTTGCTAA 950 

TTTCATTACC AATGCCTTTG GGCAGTTTGA TAACTATACG CCGATGCAGT 1000 

TGGCTCAGTA TGTAGCAACT ATTGCAAATG ATGGTGTTCG TGTGGCTCCT 1050 

CGTATTGTTG AAGGCATTTA TGGTAATAAT GATAAGGGAG GACTGGGTGA 1100 

CTTGATTCAG CAACTGCAAC CGACAGAGAT GAATAAGGTC AATATATCCG 1150 

20 ACTCCGATAT GAGCATCTTG CACCAAGGTT TTTATCAGGT TGCCCATGGT 1200 

ACTAGTGAAT TGACAACTGG ACGTGCCTTT TCAAATGGCG CCTTGGTATC 1250 

CATTAGCGGA AAAACAGGTA CAGCCGAAAG CTATGTGGCA GATGGTCAGC 1300 

AAGCAACCAA TACCAATGCG GTGGCCTATG CCCCATCTGA TAATCCCCAA 1350 

ATCGCTGTAG CTGTTGTCTT CCCTCATAAC ACCAACCTTA CAAATGGTGT 1400 

25 CGGACCTTCC ATTGCGCGCG ATATTATCAA CCTCTATAAC CAACATCATC 1450 

CAATGAATTA GAAAGGAACA TATGCTTTAT CCAACACCTA TTGCTAAGCT 1500 

AATTGACAGT TATTCGAAGT TACCGGGTAT CGGGATTAAG ACGGCTACCC 1550 

GTTTGGCCTT 1560 



30 



2) INFORMATION FOR SEQ ID NO: 1173 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2007 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

45 (C) ACCESSION NUMBER: Z49096 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GAGTCCCGAT TGCTGAGGAT GCAACCTCCT 
50 GATGAGAACT ATAAGTCAGC AACGGGTAAG 

ACAATTTAAC AAGGTTGCAG AGGTCTTTCA 

AATCCTATGT AAGAGAGCAA CTCTCGCAAC 

TTTGGAGCAA AGGGAAATGG GATTACCTAT 

AAAAGAGTTG GAAACTGCAG AGGTCAAGGG 
55 CCAATCGTAG TTATCCAAAC GGACAATTTG 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT 

TTACCTATGA AAAGGATCGT CTGGGCAATA 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT 
60 TCCGCTACAA TCTTTCATGG AAACTCAGAT 

642 
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ATAATGTCTA 


TGCGGTCATT 


50 


ATTCTTTACG 


TAGAAAAAAC 


100 


TAAGTATCTG 


GACATGGAAG 


150 


CTAATCTCAA 


GCAAGTTTCC 


200 


GCCAATATGA 


TGACTATCAA 


250 


GATTGATTTT 


ACAACCAGTC 


300 


CTTCTAGTTT 


TATCGGTCTA 


350 


AAGAGCTTGC 


TGGGAACTTC 


400 


TGCAGGGACA 


GACGGCATTA 


450 


TTGTACCCGG 


AACAGAACTG 


500 


GTTTATACAA 


CATTGTCTAG 


550 


GGATGCCTTT 


CTAGAAAAAG 


600 
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TAAAAGGTAA 
ATTCTCGCTA 
AATCACTGAG 
AACCAGGATC 
AATACCTTCC 
GGATGCGACG 
GGATGATGAC 
AGTCTACTTG 
ACGCTTTAAA 
CTGGTCAACT 
GGGCAAGGAA 
TATTGCTAAT 
ATGATACTAA 
AATCCTGTTT 
AGTTGGGACG 
AGCCAATTAT 
GCTCAAATCG 
TTATATTTTC 
TCTTGTATGT 
GGAGAATTTG 
TCTCAATCTT 
CTTCTTATGC 
GAAGCCTTAC 
AAAGATTAAA 
AACAAGTTCT 
AGCTGGAAAA 
ACTGGAATTT 
CTAATACAGC 
TAATATG 
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GTATATGACC 
CCACCCAACG 
GACTTTGTTT 
AGCCTTTAAG 
CAAGTGGAGA 
ACTCGAGATT 
TTTCTCACAA 
AACAAAAAAT 
TTTGGGGTTC 
TCCAGCTGAT 
TTTCAGTGAC 
GATGGAGTTA 
CAATCAGTCT 
CCAAAGAGGC 
GACCCTCTAT 
AACAGTTCCT 
CTGATGAGAA 
TCAGTTGTGA 
AACGGTTCAA 
CCACCCCAAT 
CAATCTCCAG 
AATGCCTAGC 
GCCGAAATAT 
GAGACTTCTG 
CCTTTTATCG 
AAGAGACTGC 
GAAGGTTCAG 
TATCAAAAAC 



GCGACCTTGG 
ACCTACCTTT 
GGCGTGATAT 
GTCATGATGT 
ATACTTCAAT 
GGGATGTTAA 
GGTTTCGCTC 
GGGAGATGCT 
CAACTCGCTT 
AATATTGTTA 
ACAAACACAA 
TGCTGGAGCC 
GTACGTAAGT 
AGCAAGCACA 
ATGGAACTAT 
GGACAAAATG 
AAATGGAGGA 
CTATGAATCC 
CAGCCTGAGC 
CTTGGAGCGG 
CCAAAAATTT 
ATCAAGGATA 
TGTGCAACCA 
TAGAAGAAGG 
GATAAGGTAG 
CGAGACCTTT 
GTTCCGTTGT 
ATTAAAAAAA 



TCAGTGCAAA 
AATGCAGATA 
TCTTTATCAA 
TAGCTTCTTC 
AGCAGTGAAT 
TGAGGGTTTG 
ACTCCAGTAA 
ACTTGGTTGG 
TGGCTTGACA 
GTATTGCTCA 
ATGCTTCGTG 
AAAATTTATA 
CACAAAAAGA 
ACTCGAAATC 
GTATAATCAC 
TAGCAGTTAA 
TACTTGGTTG 
TGCTGAAAAT 
ATTATTCAGG 
GCTTCAGCTA 
AGATAAAGTT 
TTTCACCTGG 
ATCGTTGTAG 
GACCAATCTT 
AAGAAATTCC 
GCTAAATGGT 
TCAGAAGCAA 
TTAAATTAAC 



GACCGGTGAA 
CTAAAGAAGG 
AGTAACTATG 
TATTGATAAT 
TCAAAATAGC 
ACTACTGGTG 
TGTTGGAACG 
ATTATCTAAA 
GATGAATACG 
AAGCTCATTT 
CCTTTACAGC 
AGTGCTATTT 
AATAGTAGGA 
ACATGATCTT 
TACACAGGAA 
ATCCGGTACG 
GTTCTACCAA 
CCTGATTTTA 
TATCCAGTTG 
TGAAAGAATC 
ACGACAGAAT 
TGAGTTGGCG 
GTACTGGAAC 
GCACCAAACC 
AGACATGTAT 
TGGATATTGA 
GATGTTCGGA 
TTTAGGAGAC 



650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
"1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2007 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1174 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
GAACGTGGTG AAGTTCGC 



18 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 1175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 

643 
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5 2) INFORMATION FOR SEQ ID NO: 1176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D-) TOPOLOGY: Linear _ _ 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ . ID NO: 1176 

TACTGGTGTA GAAATGTTC 19 



20 



2) INFORMATION FOR SEQ ID NO: 1177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



30 



35 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177 
GCTCAACAAG TTCCAGATTA 20 

2) INFORMATION FOR SEQ ID NO: 1178 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 24 56 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC8325 

50 (C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1178 

ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

55 TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT ACTACATTTG 100 

TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATGAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 200 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAATACTA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 300 

60 AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AATAGTTTAG 
AGTATCTAAA 
ACTACGGTAA 
GGTATGTGGA 
5 GAAAGACCAA 
TTTTAGACCG 
TTAGGCATCG 
TAAAGAACTA 
TTGGGTACAA 

LO GATGAATATT 
AACAGAAAGT 
GTTATGTTGG 
GGCTATAAAG 
CGATAAAAAG 

L5 ATAATAGCAA 
GGCAAAGATA 
TAACAACATG 
CAGGTGAATT 
TTTATGTATG 

20 AAAAGAACCT 
CTCAAAAAAT 
GATAAAACAA 
TTGGGGTGGT 
ACTTAAAACA 

25 GCACTCGAAT 
TGTTGGTGAA 
CAAACAAAAA 
CAAGGTGAAA 
ATTAGAAAAT 

30 AAAACAAAGT 
TTAAATGATG 
TTATAGATCT 
AAATGAAACA 
GATAAAGATA 

35 AGATAAAGGA 
ATGAGCTATA 
AACAGTGAAG 
TATTAATAAG 
TTGTATGTTG 

30 TTTCTTATTT 
AGTTTAATAA 
TCCGTATTTA 
GCATGC 



45 



50 



55 



60 



GCGTTAAAGA 
AATAAAAAAC 
CATTGATCGC 
AGTTAGATTG 
AGCATACATA 
AAACAATGTG 
TTCCAAAGAA 
AGTATTTCTG 
GATGATACCT 
TAAGTGATTT 
CGTAACTATC 
TCCCATTAAC 
ATGATGCAGT 
CTCCAACATG 
TACAATCGCA 
TTCAACTAAC 
AAAAATGATT 
ATTAGCACTT 
GCATGAGTAA 
CTGCTCAACA 
ATTAACAGCA 
GTTATAAAAT 
TACAACGTTA 
AGCAATAGAA 
TAGGCAGTAA 
GATATACCAA 
TTTAGATAAT 
TACTGATTAA 
AATGGCAATA 
TTGGAAGAAA 
GTATGCAACA 
TATGCAAACT 
AGGAGAAAGT 
ATCCAAACAT 
ATGGCTAGCT 
TGAGAACGGT 
CAATCCGTAA 
TGCTGTTACT 
AAAGTGACAC 
GTAATTTCAG 
ATTTAACGTT 
CCTTCTTCTA 



TATAAACATT 
GAGTAGATGC 
AACGTTCAAT 
GGATCATAGC 
TTGAAAATTT 
GAATTGGCCA 
TGTATCTAAA 
AAGACTATAT 
TCGTTCCACT 
CGCAAAAAAA 
CTCTAGAAAA 
TCTGAAGAAT 
TATTGGTAAA 
AAGATGGCTA 
CATACATTAA 
TATTGATGCT 
ATGGCTCAGG 
GTAAGCACAC 
CGAAGAATAT 
AGTTCCAGAT 
ATGATTGGGT 
CGATGGTAAA 
CAAGATATGA 
TCATCAGATA 
GAAATTTGAA 
GTGATTATCC 
GAAATATTAT 
CCCAGTACAG 
TTAACGCACC 
AATATTATTT 
AGTCGTAAAT 
TAATTGGCAA 
GGCAGACAAA 
GATGATGGCT 
ACAATGCCAA 
AATAAAAAAT 
CGATGGTTGC 
TCTCCCTTAA 
TGTAACGAGT 
CGATAACGTA 
ATTCATTTGT 
CCCATAATTT 



CAGGATCGTA 
TCAATATAAA 
TTAATTTTGT 
GTCATTATTC 
AAAATCAGAA 
ATACAGGAAC 
AAAGATTATA 
CAACAACAAA 
TTAAAACCGT 
TTTCATCTTA 
AGCGACTTCA 
TAAAACAAAA 
AAGGGACTCG 
TCGTGTCACA 
TAGAGAAAAA 
AAAGTTCAAA 
TACTGCTATC 
CTTCATATGA 
AATAAATTAA 
TACAACTTCA 
TAAATAACAA 
GGTTGGCAAA 
AGTGGTAAAT 
ACATTTTCTT 
AAAGGCATGA 
ATTTTATAAT 
TAGCTGATTC 
ATCCTTTCAA 
TCACTTATTA 
CCAAAGAAAA 
AAAACACATA 
ATCCGGTACT 
TTGGGTGGTT 
ATTAATGTTA 
AATCTCAGGT 
ACGATATAGA 
TTCACTGTTT 
ATACAATTTC 
CCATTTTCTT 
CAATGTATTA 
GTTCCTGCTA 
AAATGATATT 
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AAATAAAAAA 400 

ATTAAAACAA 450 

TAAAGAAGAT 500 

CAGGAATGCA 550 

CGTGGTAAAA 600 

ACATATGAGA 650 

AAGCAATCGC 700 

TGGATCAAAA 750 

TAAAAAAATG 800 

CAACTAATGA 850 

CATCTATTAG 900 
AGAATATAAA " 950 

AAAAACTTTA 1000 

ATCGTTGACG 1050 

GAAAAAAGAT 1100 

AGAGTATTTA 1150 

CACCCTCAAA 1200 

CGTCTATCCA 1250 

CCGAAGATAA 1300 

CCAGGTTCAA 1350 

AACATTAGAC 1400 

AAGATAAATC 1450 

GGTAATATCG 1500 

TGCTAGAGTA 1550 

AAAAACTAGG 1600 

GCTCAAATTT 1650 

AGGTTACGGA 1700 

TCTATAGCGC 1750 

AAAGACACGA 1800 

TATCAATCTA 1850 

AAGAAGATAT 1900 

GCAGAACTCA 1950 

TATATCATAT 2000 

AAGATGTACA 2050 

AAAGTGTATG 2100 

TGAATAACAA 2150 

TATTATGAAT 2200 

TTCATTTTCA 2250 

TTTTTATGGA 2300 

CCTGGTATAC 2350 

CAACTTCTTC 2400 

GAAAGTGTAT 2450 
2456 



2) INFORMATION FOR SEQ ID NO: 1179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 
ATTTGGTGAC GGGTGACTTT 
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2) INFORMATION FOR SEQ ID NO: 1180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
15 TCCACCGTTG CCAATCGCA 19 



2 ) INFORMATION FOR SEQ ID NO: 1181 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181 

30 

AGCAGCTTAC TAGATGCCGT 20 



35 2) INFORMATION FOR SEQ ID NO: 1182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 

AACTGCAAGA GATCCTTTGG 20 



2) INFORMATION FOR SEQ ID NO: 1183 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2535 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 



(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1183 

ATGGCGATAG AAAAGCTATC ACCCGGCATG CAACAGTATG TGGATATTAA 50 

10 AAAGCAATAT CCAGATGCTT TTTTGCTCTT TCGGATGGGT GATTTTTATG 100 

AATTATTTTA TGAGGATGCG GTCAATGCTG CGCAGATTCT GGAAATTTCC 150 

TTAACGAGTC GCAACAAGAA TGCCGACAAT CCGATCCCTA TGGCGGGTGT 200 

TCCCTATCAT TCTGCCCAAC AGTATATCGA TGTCTTGATT GAGCAGGGTT 250 

ATAAGGTGGC TATCGCAGAG CAGATGGAAG ATCCTAAACA AGCAGTTGGG 300 

15 GTTGTTAAAC GAGAGGTTGT TCAGGTCATT . ACGCCAGGGA CAGTGGTCGA 350 

TAGCAGTAAG CCGGACAGTC AGAATAATTT TTTGGTTTCC ATAGACCGCG 400 

AAGGCAATCA ATTTGGCCTA GCTTATATGG ATTTGGTGAC GGGTGACTTT 450 

TATGTGACAG GTCTTTTGGA TTTCACGCTG GTTTGTGGGG AAATCCGTAA 500 

CCTCAAGGCT CGAGAAGTGG TGTTGGGTTA TGACTTGTCT GAGGAAGAAG 550 

20 AACAAATCCT CAGCCGCCAG ATGAATCTGG TACTCTCTTA TGAAAAAGAA 600 

AGCTTTGAAG ACCTTCATTT ATTGGATTTG CGATTGGCAA CGGTGGAGCA 650 

AACGGCATCT AGTAAGCTGC TCCAGTATGT TCATCGGACT CAGATGAGGG 700 

AATTGAACCA CCTCAAACCT GTTATCCGCT ACGAAATTAA GGATTTCTTG 750 

CAGATGGATT ATGCGACCAA GGCTAGTCTG GATTTGGTTG AGAATGCTCG 800 

25 CTCAGGTAAG AAACAAGGCA GTCTTTTCTG GCTTTTGGAT GAAACCAAAA 850 

CGGCTATGGG GATGCGTCTC TTGCGTTCTT GGATTCATCG CCCCTTGATT 900 

GATAAGGAAC GAATCGTCCA ACGTCAAGAA GTAGTGCAGG TCTTTCTCGA 950 

CCATTTCTTT GAGCGTAGTG ACTTGACAGA CAGTCTCAAG GGTGTTTATG 1000 

ACATTGAGCG CTTGGCTAGT CGTGTTTCTT TTGGCAAAAC CAATCCAAAG 1050 

30 GATCTCTTGC AGTTGGCGAC TACCTTGTCT AGTGTGCCAC GGATTCGTGC 1100 

GATTTTAGAA GGGATGGAGC AACCTACTCT AGCCTATCTC ATCGCACAAC 1150 

TGGATGCAAT CCCTGAGTTG GAGAGTTTGA TTAGCGCAGC GATTGCTCCT 1200 

GAAGCTCCTC ATGTGATTAC AGATGGGGGA ATTATCCGGA CTGGATTTGA 1250 

TGAGACTTTA GACAAGTATC GTTGCGTTCT CAGAGAAGGG ACTAGCTGGA 1300 

35 TTGCTGAGAT TGAGGCTAAG GAGCGAGAAA ACTCTGGTAT CAGCACGCTC 1350 

AAGATTGACT ACAATAAAAA GGATGGCTAC TATTTTCATG TGACCAATTC 1400 

GCAACTGGGA AATGTGCCAG CCCACTTTTT CCGCAAGGCG ACGCTGAAAA 1450 

ACTCAGAACG CTTTGGAACC GAAGAATTAG CCCGTATCGA GGGAGATATG 1500 

CTTGAGGCGC GTGAGAAGTC AGCCAACCTC GAATACGAAA TATTTATGCG 1550 

4 0 CATTCGTGAA GAGGTCGGCA AGTACATCCA GCGTTTACAA GCTCTAGCCC 1600 

AAGGAATTGC GACGGTTGAT GTCTTACAGA GTCTGGCGGT TGTGGCTGAA 1650 

ACCCAGCATT TGATTCGACC TGAGTTTGGT GACGATTCAC AAATTGATAT 1700 

CCGGAAAGGG CGCCATGCTG TCGTTGAAAA GGTTATGGGG GCTCAGACCT 1750 

ATATTCCAAA TACGATTCAG ATGGCAGAAG ATACCAGTAT TCAATTGGTT 1800 

45 ACAGGGCCAA ACATGAGTGG GAAGTCTACC TATATGCGTC AGTTAGCCAT 1850 

GACGGCGGTT ATGGCCCAGC TGGGTTCCTA TGTTCCTGCT GAAAGCGCCC 1900 

ATTTACCGAT TTTTGATGCG ATTTTTACCC GTATCGGAGC AGCAGATGAC 1950 

TTGGTTTCGG GTCAGTCAAC CTTTATGGTG GAGATGATGG AGGCCAATAA 2000 

TGCCATTTCG CATGCGACCA AGAACTCTCT CATTCTCTTT GATGAATTGG 2050 

50 GACGTGGAAC TGCAACTTAT GACGGGATGG CTCTTGCTCA GTCCATCATC 2100 

GAATATATCC ATGAGCACAT CGGAGCTAAG ACCCTCTTTG CGACCCACTA 2150 

CCATGAGTTG ACTAGTCTGG AGTCTAGTTT ACAACACTTG GTCAATGTCC 2200 

ACGTGGCAAC TTTGGAGCAG GATGGGCAGG TCACCTTCCT TCACAAGATT 2250 

GAACCGGGAC CAGCTGATAA ATCCTACGGT ATCCATGTTG CCAAGATTGC 2300 

55 TGGCTTGCCA GCAGACCTTT TAGCAAGGGC GGATAAGATT TTGACTCAGC 2350 

TAGAGAATCA AGGAACAGAG AGTCCTCCTC CCATGAGACA AACTAGTGCT 2400 

GTCACTGAAC AGATTTCACT CTTTGATAGG GCAGAAGAGC ATCCTATCCT 2450 

AGCAGAATTA GCTAAACTGG ATGTGTATAA TATGACACCT ATGCAGGTTA 2500 

TGAATGTCTT AGTAGAGTTA AAACAGAAAC TATAA 2535 
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2) INFORMATION FOR SEQ ID NO: 1184 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE:"" - -- - - 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 



15 



TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

2 0 GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

25 GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

30 AAACCAATCC AAAGGATCTC TTG 623 



2) INFORMATION FOR SEQ ID NO: 1185 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

50 TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

55 GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

60 CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 
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CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
AAACCAATCC AAAGGATCTC T 
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2) INFORMATION FOR SEQ ID NO: 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS.: .Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(XV) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT .150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

AAACCAATCC AAAGGATCTC TT 622 



2) INFORMATION FOR SEQ ID NO: 1187 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1187 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 
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GTTGAGAATG CTCGCTCAGG 
GGATGAAACC AAAACGGCTA 
ATCGCCCCTT GATTGATAAG 
CAGGTCTTTC TCGACCATTT 
CAAGGGTGTT TATGACATTG 
AAACCAATCC AAAGGATCTC 



TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 
TGGGGATGCG TCTCTTGCGT TCTTGGATTC 
GAACGAATCG TCCAACGTCA AGAAGTAGTG 
CTTTGAGCGT AGTGACTTGA CAGACAGTCT 
AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
TT 
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2 ) INFORMATION FOR SEQ ID NO: 1188 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 

GGGTGACTTT TATGTAACGG GGCTATTGGA TTTCACGTTG GTTTGTGGGG 50 

AAATTCGCAA TCTCAAGGCT AGAGAAGTGG TGCTGGGTTA TGACTTGTCT 100 

GAGGAAGAAG AACAAATCCT CAGTCGTCAG ATGAATCTGG TGCTTTCTTA 150 

TGAGAAGGAA GGCTTTGAGG ACCTTCATTT ACTGGATCCA CGACTGGCAG 200 

CTGTGGAGCA AGCGGCAGCT AGTAAGCTCC TCCAGTATGT TCACCGGACC 250 

CAGATGCGGG AATTGAACCA CCTCAAACCA GTTATCCGCT ATGAAATCAA 300 

AGATTTCTTA CAGATGGACT ATGCGACCAA GGCTAGTCTG GATTTGGTTG 350 

AGAATGCCCG TTCAGGCAAG AAGCAAGGCA GTCTTTTCTG GCTTTTAGAT 4 00 

GAAACCAAGA CGGCTATGGG AATGCGTCTC TTGCGTTCTT GGATTCATCG 450 

TCCTTTGATT GATAAGGAGC GAATCGTCCA GCGTCAAGAG GTGGTGCAGG 500 

TCTTTCTTGA CCACTTCTTT GAGCGTAGTG ATTTAACGGA CAGTCTTAAG 550 

GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 599 



2) INFORMATION FOR SEQ ID NO: 1189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 1189 

GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

GTGGGGAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 200 
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TGGCAGCTGT GGAGCAAGCG 
CGGACTCAGA TGAGGGAATT 
AATCAAAGAT TTTTTGCAGA 
TGGTTGAGAA TGCCCGTTCA 
TTGGATGAAA CCAAAACAGC 
TCATCGCCCC CTGATTGATA 
TGCAGGTCTT TCTCGACCAT 
CTCAAGGGTG TTTATGACAT 
CAAAACCAAT CCAAAGGATC 



GCATCTAGTA AACTGCTTCA GTATGTTCAT 
GAACCACCTC AAGCCTGTTA TCCGCTATGA 
TGGATTATGC GACCAAGGCT AGTCTGGATT 
GGCAAGAAGC AAGGTAGTCT TTTTTGGCTT 
TATGGGAATG CGTCTCTTGC GGTCTTGGAT 
AGGAACGAAT TGTCCAACGC CAAGAAGTTG 
TTCTTTGAGC GTAGTGATTT GACAGACAGT 
TGAGCGCTTG GCTAGTCGTG TTTCTTTTGG 
TCTT 
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2) INFORMATION FOR SEQ ID NO: 1190 

(i) SEQUENCE CHARACTERISTICS: , 
v (A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGT CTTGGTCTGC 50 

GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGCTGCTGGG CGAGGAGCTG 200 

TCTCCTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 300 

TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACT AAGACGGCCA TGGGCGGCCG CATGCTGCGC TCTTGGATCC 450 

AGCGTCCGCT GATTGATGAA GCGCGAATTA GCCAGCGACA GAATGTCGTT 500 

GAGGTTTTTC TGGATCATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

CAAGGGGGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC 599 



2) INFORMATION FOR SEQ ID NO: 1191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 
TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 
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GCCAGAAGCT GAGGAGCAGG TCTTGGCTGG 
CCTATGTACA GACGGCCTTG GACGATGTCC 
TCTCCTATGG AGCGTCAGGC AGCGGGGAAA 
5 GACCCAGATG AGGGAGCTCA GCCATTTGAA 
TCAAGGACTT TCTGCAAATG GATTATGCTA 
ACAGAAAATG CTCGCTCGGG TAAGAAACAC 
GGACGAGACC AAGACGGCCA TGGGCGGCCG 
AGCGTCCGTT GATTGATGAA GTGCGAATTA 
10 GAGGTTTTTC TGGAACATTT CTTTGAGCGG 

CAAGGGAGTC TATGATATCG AGCGGCTGGC 

AGACCAATCC AAAGGATCTC TT 



PCT/CA00/01150 

GTGGTGCTGG GCTATGCTTT 100 

ACAGATGAAT CTTTTGCTGT 150 

AGCTGCTGGG CGAGGAACTG 200 

TTGCTAGAGT ATGTGCACCG 250 

GAAGGCCCAG CATTATGAAA 300 

CCAAGGCGAG TCTGGATTTG 350 

GGCAGTCTTT ATTGGCTGAT 400 

TATGCTGCGC TCTTGGATCC 450 

GCCAGCGGCA GAATGTCGTC 500 

AGTGATTTGA CGGAGAGCCT 550 

TAGTCGGGTG TCTTTTGGCA 600 

622 
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2) INFORMATION FOR SEQ ID NO: 1192 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192 
GGTAAAACAG GAACCTCTAA CT 22 

2 ) INFORMATION FOR SEQ ID NO: 1193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 
GGTAAGACAG GTACTTCTAA CT 22 

45 

2) INFORMATION FOR SEQ ID NO: 1194 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 
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5 2) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

. CD). TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ 'ID .NO: 1195 

CATTTCAAGT AACACAACTG AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1196 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196 
GCCATTTCAA GTAATACAAC AGAA 24 

2) INFORMATION FOR SEQ ID NO: 1197 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 25 

50 



2) INFORMATION FOR SEQ ID NO: 1198 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 

AACGGGCGTC TCGATAGAAA AACACGTGAA AATCCCAATG ATTATAAACA 50 

ATCAATATAC GATTTTGCTG AAGCTGTAAC AAAAGGTATT AAGGAACAAA 100 

CAAATAAAAA TTAATAGGCA. ACTTAACCAG AATCGTTAAA ACTATATGAC 150 

GATTCTGGTT TTTTAAATTC AAAAAGTTTT CTAAAAAATT TACTTGCTTC 2 00 

TTTAAAGTAT AGGTATGAAA TACAATTGAT TAAAATAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

TACAACAACG CTGATTGGCG CGTTACTATT ATATTTGCCA ATCACAKGTA 3 50 

AACATCCTAT TGATTTTGTG GACGCCCGTT A 381 



2) INFORMATION FOR SEQ ID NO: 1199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATCCAA AAAGC 



2) INFORMATION FOR SEQ ID NO: 1200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200 
TCAAAGAAGA AACTAAAAAA GCTGT 



2) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AACGTAGGTG TCCTTCTTC 
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2 ) INFORMATION FOR SEQ ID NO: 1202 

(i) SEQUENCE CHARACTERISTICS: 
" ' (A) LENGTH: 22 bases - - - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2) INFORMATION FOR SEQ ID NO: 1203 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203 
GGIGARMGIG GIAAYGARAT G 



2) INFORMATION FOR SEQ ID NO: 1204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1204 
GCIAAYAACI TCIWMYATGC C 



2) INFORMATION FOR SEQ ID NO: 1205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

655 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 
AAYACITCIA WYATGCCIGT 20 



2) INFORMATION FOR SEQ" ID NO : 1206 " 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: > Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206 
CKISRIGTIG ARTCIGCCA 19 

25 

2) INFORMATION FOR SEQ ID NO: 1207 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 07 
4 0 CCITCITCWC CIGGCATYTC 20 



2) INFORMATION FOR SEQ ID NO: 1208 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208 

55 

TCAAAAAGTT TTCTAAAAAA TTTAC 25 



60 2) INFORMATION FOR SEQ ID NO: 1209 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209 

ACGGGCGTCC ACAAAATCAA TAGGA 25 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1210 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
ACCAGCTTGC CCAATACAAA GG 22 

2) INFORMATION FOR SEQ ID NO: 1211 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 24 

45 

2) INFORMATION FOR SEQ ID NO: 1212 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 

60 CCICCIRGIG GIGAIACIGC WCC 23 
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2) INFORMATION FOR SEQ ID NO: 1213 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
L0 (D) TOPOLOGY: Linear 

(ii)" MOLECULE "TYPE: - DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 

15 

AARGGIGGIA CIGCIGCIAT HCCIGG 26 



20 2) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 

GGTAAAACAG GTACCTCTAA CTA 23 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1215 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1337 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1215 



AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

55 ATGTCATTAG AACGTTGCGT CAAACTAGCA TCTAAATAAA GATCGAAATG 150 

CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 2 00 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTGCTAAGTT 2 50 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 3 00 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 350 

60 ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 4 00 
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AGTTCCAACG 
AAAGCTTTCA 
ACTGGACTAA 
TCCTGATAAC 
5 GTAAAGCAGC 
ATTCATCAGG 
TGTTTGCAAT 
GTCCAAATGC 
GTTGTTGATA 

10 AGGAATTGAG 
ATTTAAAACG 
TATTTTAAAA 
. AATGAAAGAC 
AATCTAATAC 

15 AAACGTTGTT 
GTAATGGGAC 
AGGTGGCTAC 
GTTTGCCAAA 
AAGCATTGAG 

20 
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GTTTTTCAAA 
ACCTGATGCA 
CGCCAGAACG 
GAAGGGAATC 
TTATTTTTCA 
CTGGGCTTCC 
CATTTGATGT 
CAAAGCTGGG 
AACCTAATAC 
GCTGCTATTT 
TGTAGGGGGC 
AAACAGAGAT 
CTAGGGAAAC 
TACTTCTAAC 
GATCTAAATG 
TTATCAAGAT 
CACCTTTCCA 
TATTGGCATA 
ATAGGGACAC 



AATCTGCCGA 
GTCCTTTGTA 
CGTTGCCATT 
AGCCTATTGA 
ACCTTGCCAA 
TGCTTCTGTT 
ATCAAGCCCT 
TTTATGCATA 
AGCTGCCATG 
TTGCCATTGT 
GCTACTCACT 
ATGAACTAAC 
TTTTCATCGG 
ATCAGCGGGT 
AGCAATCCCA 
TTAAGGCTTT 
GAATCAGGTT 
GAGATAACCA 
TTTCTATAGC 



TGTGCTCCAG 
TTGGGCAAGC 
AATCAAGACG 
TACACCTATT 
TCAAAGCGAT 
TCTAATACAG 
TTACTTAGTG 
TTCCCTTTAT 
AACCTCGATG 
CGATTTCAAA 
GACTGTGACG 
TCTGTTTTTT 
TCTTTCTCAA 
ATAGTTTGCC 
, TTCAAAACAT 
TAACAAAGGA 
GGAGTTTGAC 
TCTACATACT 
AACTAGT 



CAGCATATCG 
TGGTGGCCGG 
ATGCTCGCAT 
CGTGCAGATG 
GGTTGCTGCC 
CTGGTACCTT 
GATAAATATT 
GATGGAACAG 
ATATTACAAG 
GATCGTTCCG 
CTACTAAACC 
TTGTGCTAAA 
TTGTCATCTT 
AGTAATTAAG 
TAAGGTCAGG 
CTAATATCAT 
AATAGTATTG 
CTAATTCGTT 



450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1337 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1216 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
35 GGTAAGACTG GTACATCAAA CTA 



23 



2) INFORMATION FOR SEQ ID NO: 1217 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217 

50 

CAAATGCCAT TTCAAGTAAC ACAAC 25 



55 2) INFORMATION FOR SEQ ID NO: 1218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 
CAAACGCCAT TTCAAGTAAC ACAAC 25 

2) INFORMATION FOR SEQ ID NO: 1219 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases - 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 
CAAATGCTAT TTCAAGTAAT ACAAC 25 

2) INFORMATION FOR SEQ ID NO: 1220 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220 
CAAACGCCAT TTCAAGTAAT ACGAC 25 

40 

2) INFORMATION FOR SEQ ID NO: 1221 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 
55 GAYACICCIG GICAYGTIGA YTT 23 



60 



2) INFORMATION FOR SEQ ID NO: 1222 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
ATYGAYACIC CIGGICAYGT IGAYTT 



2 ) INFORMATION FOR SEQ ID NO: 1223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223 
AYITCIARRT GIARYTCRCC CATICC 



2) INFORMATION FOR SEQ ID NO: 1224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT IATG 



2) INFORMATION FOR SEQ ID NO: 1225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225 
TAICCRAACA TYTCISMIAR IGGIAC 
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20 



2) INFORMATION FOR SEQ ID NO: 1226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



- (ii) -MOLECULE .TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226 
15 GTIRMRTAIC CRAACATYTC 20 



2) INFORMATION FOR SEQ ID NO: 1227 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227 

30 

GTICCIYTIK CIGARATGTT YGGITA 26 



35 2) INFORMATION FOR SEQ ID NO: 1228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 

GTICCIYTIK CIGARATGTT YGGITAYGC 29 

50 2) INFORMATION FOR SEQ ID NO: 1229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229 
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TCCATYTGIG CIGCICCIGT IATCAT 



26 



5 2) INFORMATION FOR SEQ ID NO: 1230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2145 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(D) ACCESSION NUMBER: X00415 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1230 

TGAACGCCTA AAAGATAAAC GAGGAAACAA ATGGCTCGTA CAACACCCAT 50 

CGCACGCTAC CGTAACATCG GTATCAGTGC GCACATCGAC GCCGGTAAAA 100 

CCACTACTAC CGAACGTATT CTGTTCTACA CCGGTGTAAA CCATAAAATC 150 

GGTGAAGTTC ATGACGGCGC TGCAACCATG GACTGGATGG AGCAGGAGCA 2 00 

GGAACGTGGT ATTACCATCA CTTCCGCTGC GACTACTGCA TTCTGGTCTG 250 

GTATGGCTAA GCAGTATGAG CCGCATCGCA TCAACATCAT CGACACCCCG 3 00 

GGGCACGTTG ACTTCACAAT CGAAGTAGAA CGTTCCATGC GTGTTCTCGA 3 50 

TGGTGCGGTA ATGGTTTACT GCGCAGTTGG TGGTGTTCAG CCGCAGTCTG 400 

AAACCGTATG GCGTCAGGCA AACAAATATA AAGTTCCGCG CATTGCGTTC 450 

GTTAACAAAA TGGACCGCAT GGGTGCGAAC TTCCTGAAAG TTGTTAACCA 500 

GATCAAAACC CGTCTGGGCG CGAACCCGGT TCCGCTGCAG CTGGCGATTG 550 

GTGCTGAAGA ACATTTCACC GGTGTTGTTG ACCTGGTGAA AATGAAAGCT 600 

ATCAACTGGA ACGACGCTGA CCAGGGCGTA ACCTTCGAAT ACGAAGATAT 650 

CCCGGCAGAC ATGGTTGAAC TGGCTAACGA ATGGCACCAG AACCTGATCG 700 

AATCCGCAGC TGAAGCTTCT GAAGAGCTGA TGGAAAAATA CCTGGGTGGT 750 

GAAGAACTGA CTGAAGCAGA AATCAAAGGT GCTCTGCGTC AGCGCGTTCT 800 

GAACAACGAA ATCATCCTGG TAACCTGTGG TTCTGCGTTC AAGAACAAAG 850 

GTGTTCAGGC GATGCTGGAT GCGGTAATTG ATTACCTGCC ATCCCCGGTT 900 

GACGTACCTG CGATCAACGG TATCCTGGAC GACGGTAAAG ACACTCCGGC 950 

TGAACGTCAC GCAAGTGATG ACGAGCCGTT CTCTGCACTG GCGTTCAAAA 1000 

TCGCTACCGA CCCGTTTGTT GGTAACCTGA CCTTCTTCCG TGTTTACTCC 1050 

GGTGTGGTTA ACTCTGGTGA TACCGTACTG AACTCCGTGA AAGCTGCACG 1100 

TGAGCGTTTC GGTCGTATCG TTCAGATGCA CGCTAACAAA CGTGAAGAGA 1150 

TCAAAGAAGT TCGCGCGGGC GACATCGCTG CTGCTATCGG TCTGAAAGAC 12 00 

GTAACCACTG GTGACACCCT GTGTGACCCG GATGCGCCGA TCATTCTGGA 1250 

ACGTATGGAA TTCCCTGAGC CGGTAATCTC CATCGCAGTT GAACCGAAAA 1300 

CCAAAGCTGA CCAGGAAAAA ATGGGTCTGG CTCTGGGCCG TCTGGCTAAA 1350 

GAAGACCCGT CTTTCCGTGT ATGGACTGAC GAAGAATCTA ACCAGACCAT 14 00 

CATCGCGGGT ATGGGCGAAC TGCACCTCGA CATCATCGTT GACCGTATGA 1450 

AGCGTGAATT CAACGTTGAA GCGAACGTAG GTAAACCGCA GGTTGCTTAC 1500 

CGTGAAACTA TCCGCCAGAA AGTTACCGAT GTTGAAGGTA AACACGCGAA 1550 

ACAGTCTGGT GGTCGTGGTC AGTATGGTCA TGTTGTTATC GACATGTACC 1600 

CGCTGGAGCC GGGTTCAAAC CCGAAAGGCT ACGAGTTCAT CAACGACATT 1650 

AAAGGTGGTG TAATCCCTGG CGAATACATC CCGGCCGTTG ATAAAGGTAT 1700 

CCAGGAACAG CTGAAAGCAG GTCCGCTGGC AGGCTACCCG GTAGTAGACA 1750 

TGGGTATTCG TCTGCACTTC GGTTCTTACC ATGACGTTGA CTCCTCTGAA 1800 

CTGGCGTTTA AACTGGCTGC TTCTATCGCC TTTAAAGAAG GCTTTAAGAA 1850 

AGCGAAACCA GTTCTGCTTG AGCCGATCAT GAAGGTTGAA GTAGAAACTC 1900 

CGGAAGAGAA CACCGGTGAC GTTATCGGTG ACTTGAGCCG TCGTCGTGGT 1950 

ATGCTCAAAG GTCAGGAATC TGAAGTTACT GGCGTTAAGA TCCACGCTGA 2000 
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AGTACCGCTG TCTGAAATGT TCGGATACGC AACTCAGCTG CGTTCTCTGA 2050 
CCAAAGGTCG TGCATCATAC ACTATGGAAT TCCTGAAGTA TGATGAAGCG 2100 
CCGAGTAACG TTGCTCAGGC CGTAATTGAA GCCCGTGGTA AATAA 214 5 



2) INFORMATION FOR SEQ ID NO: 1231 

(i) SEQUENCE CHARACTERISTICS: 
L0 (A) LENGTH: 37 bases 

- _(B). TYPE: Nucleic acid. 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 37 

20 

2) INFORMATION FOR SEQ ID NO: 1232 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
35 GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 42 



40 



2) INFORMATION FOR SEQ ID NO: 1233 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233 

50 

GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 35 



55 2) INFORMATION FOR SEQ ID NO: 1234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234 
ACTAAATAAA CGCTCATTCG 20 



2) INFORMATION FOR SEQ ID NO: 1235 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235 
GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 38 

2) INFORMATION FOR SEQ ID NO: 1236 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1236 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 34 

40 

2) INFORMATION FOR SEQ ID NO: 1237 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
55 GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 38 



2) INFORMATION FOR SEQ ID NO: 1238 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCGGCG TTAATTTTGG CACCGAAGAA GAGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1239 
GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 



2) INFORMATION FOR SEQ ID NO: 1240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1240 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 
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2) INFORMATION FOR SEQ ID NO: 1242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
15 (E) STRAIN: BM4147-1 

(F) ACCESSION NUMBER: U3 9790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 

20 TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG 50 

TGCATGTGCC ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG 100 

GTGTGCCGCA AGTTCCTTAT GTACCAGTAC TTAAGAATCA ATGGAAAGAA 150 

AATCCTAAAA AAGTATTTGA TCAATGTGAA GGTTCTTTGC TTTATCCGAT 200 

GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT ACAAAGGCAG 250 

25 AAAACCGAGA AGAGCTGCAA AATGCTTTAG CAACAGCCTA TCAGTATGAT 300 

TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC 350 

TGTATTAGGA AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA 4 00 

AAGACGTAGC ATTCTATGAT TATGAAGCCA AATATATCAA TAATAAAATC 450 

GAAATGCAGA TTCCAGCCGA AGTGCCGGAA GAAGTTTATC AAAAAGCGCA 500 

30 AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC GGATTGAGCC 550 

GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 600 



35 2 ) INFORMATION FOR SEQ ID NO: 1243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2275 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(C) ACCESSION NUMBER: M38386 



50 



(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 



GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 200 

55 ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 300 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 350 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 400 

TTTCAATTGA ATTAAATCAA CAGTTTAATA CAAAAGAAGA AGCTATTCGC 450 

60 TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 500 

667 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CGAAGCAATG ATTGAAAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 550 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCAAAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 650 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

5 AAGAACATTT GCAATTAGTC CAACAAATTG CACTTTATTG TAGTGATATG 750 

GATAACGTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 800 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 950 

10 TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 1000 

ACAAATTAAC GTTGAAAATG TGACCGGGTT AAATAACATG ACAGAACCAG 1050 

"AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGCTGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 1200 

15 GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 1250 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 1300 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 1350 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 1400 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 1450 

20 TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 1600 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 1700 

25 CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 1750 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 1800 

TAGCAATGAT TGGTATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 1850 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 1900 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AAACGGTTAA AAATATTAAA 1950 

30 CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2000 

CCTTCTACCA AGATACTTCA CATTTCTTAA TTAAAGAAAA AACAACCGCG 2050 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 2100 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 2150 

TTCATGACTA CTTATATGGT CAAGGAGGTC TAAGGTTTCT CGCAAAGAAG 22 00 

35 TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 2250 

GAACTTGGTG AAGAAGGGGG GATCC 2275 



40 2) INFORMATION FOR SEQ ID NO: 1244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 25923 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 



GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGCTCA 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGAAGCTGG TCATACGAGA GTTATATTTT 200 

60 CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 250 

668 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

ATTATCACAC AAATATTTTA GCTTTTCAGT TTCTATATCA ACTGTAGCTT 300 

CTTTATCCAT ACGTTGAATA ATTGTACGAT TCTGACGCAC CATCTTTTGC 350 

ACACCTTTAA TGTTATTTGT TTTAAAAGCA TGAATAAGTT TTTCAACACA 400 

ACGATGTGAA TCTTCTAAGA AGTCACCGTA AAATGAAGGA TC 442 



669 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

20 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

25 TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

30 GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 

ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTC 845 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1246 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 11986 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246 

TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA ATGTCTTCCA 50 

670 
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CAGATGGACT 
TCTGTACCAG 
TGATGCAATT 
CAATTCACCG 
ATTCTTGAAA 
GGGTGGTAAG 
TAATTCAGGA 
GTATTCGCTG 
CGAAATGAGC 
AAATGAACGA 
ACAATGGCTG 
CATCGATAAC 
TTCTTG 



TGTTCGTGGC 
TTGGTGATGT 
GACTTAGATG 
TCAAGCGCCT 
CTGGTATTAA 
ATTGGTCTAT 
ATTAATTAAT 
GTGTAGGTGA 
GATTCTGGCG 
GCCACCTGGA 
AACATTTCCG 
ATCTTCCGTT 



ACAGAAGTAG 
AACACTTGGT 
GTGATGTTCC 
GCATTCGAAG 
AGTAGTAGAC 
TCGGTGGTGC 
AACATCGCAC 
GCGTACTCGT 
TAATTAAGAA 
GCACGTCAAC 
TGATGAGCAA 
TCACGCAAGC 



AAGATACTGG TAAAGCAATC 
CGTGTATTTA ACGTATTAGG 
TGCGGATGTA CGTCGTGATC 
AGTTATCTAC TAAAGTAGAA 
TTACTTGCTC CTTACATTAA 
CGGCGTAGGT AAAACAGTAT 
AAGAGCACGG TGGTATCTCT 
GAAGGTAACG ACTTATACCA 
AACTGCGATG GTATTCGGAC 
GTGTTGCATT AACAGGTTTA 
GGACAAGACG TACTATTGTT 
GGGTTCTGAA GTATCTGCCC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
656 



- - .10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1247 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247 

CGAAAACGGA AGTATTAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAACCA ATCTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGTCGCGTAT TTAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAGGT TCCTGCAGAT GTACATCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGCGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 450 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT A 791 



2) INFORMATION FOR SEQ ID NO: 1248 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
10 " (A) ORGANISM: Bacillus- thuringiensis 

(B) STRAIN: BGSC 4AC1 

(Xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1248 



15 ATCTACAATG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGCATGAA 50 

CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 

TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

ACTGGTAAAG CAATCTCTGT ACCAGTTGGT GATGCAACAC TTGGACGTGT 200 

ATTCAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAA CTTCCTGCGG 250 

20 ATGTACACCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 300 

TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 350 

TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 400 

TAGGTAAAAC AGTATTAATT CAGGAGTTAA TCAATAACAT CGCACAAGAG 450 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 500 

25 TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 550 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 600 

GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA 650 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 700 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 750 

30 CCAACACTTG CAACAGAAAT GGGTCAATTA CAAGAGCGTA TTACATCTAC 800 

AAATAAAGGG TCTATCACGT CTATC 825 



35 2) INFORMATION FOR SEQ ID NO: 1249 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGTATTAA 50 
CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 
TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

672 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/0115O 



ACTGGTAAAC CAATCTCTGT ACCAGTTGGT GATGTAACAC TTGGTCGCGT 200 

ATTTAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAG GTTCCTGCAG 250 

ATGTACATCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 300 

TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 350 

5 TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 400 

TAGGTAAAAC AGTATTAATT CAGGAATTAA TTAACAACAT CGCACAAGAG 450 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 500 

TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 550 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 600 

10 GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA. 650 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 700 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 750 

CCAACACTTG CAACAGAAAT GGGTC 775 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1250 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250 



CCAGCAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAGC 50 

GAGCATCAAC TTAACATTTG AAGTTGCACT TCATTTAGGT GATGACACAG 100 

35 TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG TGGCACAGAA 150 

GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG ATGTAACACT 200 

TGGTCGCGTA TTCAACGTAT TAGGTGATGC AATTGACTTA GATGGTGATG 250 

TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC ACCTGCATTC 300 

GAAGAACTAT CTACAAGAAT AGAAATTCTT GAAACTGGTA TTAAAGTAGT 350 

40 AGATTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACG GTATTAATTC AGGAATTAAT TAACAACATC 450 

GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG GTGAGCGTAC 500 

TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT GGCGTAATTA 550 

AGAAAACTGC GATGGTATTT GGACAAATGA ACGAGCCACC TGGAGCACGT 600 

45 CAACGTGTTG CATTAACAGG TTTAACAATG GCTGAACATT TCCGTGATGA 650 

GCAAGGACAA GACGTACTAT TGTTCATCGA TAACATCTTC CGTTTCACGC 700 

AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GTCGTATGCC ATCTGCGGTA 750 

GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC AAGAGCGTAT 800 

TACATCTACA AATAAAGGGT CTATCACGTC TA 832 

50 



2) INFORMATION FOR SEQ ID NO: 1251 

673 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 - (vi) ORIGINAL SOURCE: - 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1251 



AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 

AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 

AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA GAAGTAGAAG 150 

ATACTGGTAA ACCAATCTCT GTACCAGTTG GTGATGTAAC ACTTGGTCGC 200 

20 GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG AGGTTCCTGC 250 

AGATGTACAT CGTGATCCAA TTCACCGTCA AGCACCTGCA TTCGAAGAAT 300 

TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT AGTAGACTTA 350 

CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG GTGGTGCCGG 400 

CGTAGGTAAA ACAGTATTAA TTCAGGAATT AATTAACAAC ATCGCACAAG 450 

25 AGCACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG TACTCGTGAG 500 

GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA TCAAGAAAAC 550 

TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA CGTCAACGTG 600 

TTGCATTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA TGAGCAAGGA 650 

CAAGACGTAC TTCTGTTCAT CGATAACATC TTCCGTTTCA CGCAAGCGGG 700 

30 TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT GCCATCTGCG GTAGGTTACC 750 

AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG TATTACATCT 800 

• AC 802 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1252 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1410 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 
AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 

674 
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AGTTGCAATG 


TCTTCCACAG 


ATGGACTTGT 


TCGTGGCACA 


f*% TV TV ^% m TV TV TV 

GAAGTAGAAG 


150 




ATACTGGTAA 


ACCAATCTCT 


GTACCAGTTG 


tv m s> tv ta s** 

GTGATGTAAC 


ACTTGGTCGC 


o r\ r\ 

200 




GTATTTAACG 


TATTAGGTGA 


TGCAATTGAC 


TTAGATGGTG 


AGGTTCCTGC 


O C Pi 

250 




AGATGTACAT 


CGTGATCCAA 


TTCACCGTCA 


TV /**! TV /*N m TV 

AGCACCTGCA 


TTCGAAGAAT 


300 


5 


TATCTACTAA 


AGTAGAAATT 


CTTGAAACTG 


TV 17^ TV TV TV rn 

GTATTAAAGT 


AGTAGACTTA 


350 




CTTGCTCCTT 


ACATTAAGGG 


TGGTAAGATC 


GGCCTATTCG 


GTGGTGCCGG 


400 




CGTAGGTAAA 


ACAGTATTAA 


TTCAGGAATT 


AATTAACAAC 


ATCGCACAAG 


450 




AGCACGGTGG 


TATCTCTGTA 


TTCGCTGGTG 


ITT TV m TV >^ 

TAGGTGAGCG 


TACTCGTGAG 


500 




GGTAATGACT 


TATACCACGA 


tv tv m >^ tv y*^ ^ T\ m 

AATGAGCGAT 


TCTGGCGTAA 


m TV TV TV TV TV TV /"^ 

TCAAGAAAAC 


CCA 

550 


10 


TGCGATGGTA 


TTCGGACAAA- 


TGAACGAGCC. 


. ACCTGGAGCA 


CGTCAACGTG 


600 




TTGCATTAAC 


Tv v*^ /"N fT"L m T\ TV T\ 

AGGCTTAACA 


ATGGCTGAGC 


ATTTCCGTGA 


m /*"» tv /"» y> tv tv /"» t\ 

TGAGCAAGGA 


650 




CAAGACGTAC 


TTCTGTTCAT 


CGATAACATC 


TTCCGTTTCA 


CGCAAGCGGG 


700 




TTCTGAAGTA 


TCTGCCCTTC 


TTGGTCGTAT 


GCCATCTGCG 


GTAGGTTACC 


750 




AACCAACACT 


TGCAACAGAA 


ATGGGTCAAT 


TACAAGAGCG 


TATTACATCT 


800 


15 


ACAAATAAAG 


GGTCTATCAC 


GTC 






823 




2) INFORMATION FOR SEQ 


ID NO: 1253 






20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 798 bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS: Double 








25 


(D) 


TOPOLOGY: 


Linear 










(ii) MOLECULE TYPE: 


Genomic DNA 








(vi) ORIGINAL SOURCE: 








30 


(A) 


ORGANISM: 


Bacillus cereus 








(B) 


STRAIN: ATCC 13472 










(xi) SEQUENCE DESCRIPTION: SEQ 


ID NO: 1253 




35 


CCAGAAATCT 


ACAATGCCCT 


TACGGTAAAA 


CAAAGCAACG 


AAAACGGAAG 


50 




CATGAACTTA 


ACATTTGAAG 


TTGCACTTCA 


TTTAGGTGAT 


GATACAGTTC 


100 




GTACAGTTGC 


GATGTCTTCC 


AC AG AT GG AC 


TTGTTCGTGG 


CACAGAAGTA 


150 




GAAGATACTG 


GTAAAGCAAT 


CTCTGTACCA 


GTTGGTGATG 


CAACACTTGG 


o r\ a 

200 




ACGTGTATTC 


AACGTATTAG 


GTGATGCAAT 


TGACTTAGAT 


GGTGAACTTC 


250 


40 


CTGCGGATGT 


ACACCGTGAT 


CCAATTCACC 


GTCAAGCACC 


TGCATTCGAA 


300 




GAATTATCTA 


CTAAAGTAGA 


AATTCTTGAA 


ACTGGTATTA 


AAGTAGTAGA 


n C A 

350 




CTTACTTGCT 


CCTTACATTA 


AGGGTGGTAA 


GATCGGCCTA 


TTCGGTGGTG 


400 




CCGGCGTAGG 


TAAAACAGTA 


TTAATTCAGG 


AATTAATCAA 


TAACATCGCA 


450 




CAAGAGCATG 


GTGGTATCTC 


TGTATTCGCT 


GGTGTAGGTG 


AGCGTACTCG 


500 


45 


TGAGGGTAAT 


GACTTATACC 


ACGAAATGAG 


CGATTCTGGC 


GTAATCAAGA 


550 




AAACTGCGAT 


GGTATTCGGA 


CAAATGAACG 


AGCCACCTGG 


AGCACGTCAA 


600 




CGTGTTGCAT 


TAACAGGTTT 


AACAATGGCT 


GAGCATTTCC 


GTGATGAGCA 


650 




AGGACAAGAC 


GTACTTCTGT 


TCATCGATAA 


CATCTTCCGT 


TTCACGCAAG 


700 




CGGGTTCTGA 


AGTATCTGCC 


CTTCTTGGTC 


GTATGCCATC 


TGCGGTAGGT 


750 


50 


TACCAACCGA 


CACTTGCAAC 


AGAAATGGGT 


CAATTACAAG 


AGCGTATT 


798 



675 
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2) INFORMATION FOR SEQ ID NO: 1254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254 

CATTTGAAGT TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA 50 

ATGTCTTCCA CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG 100 

TAAAGCAATC TCTGTACCAG TTGGTGATGC AACACTTGGT CGTGTATTTA 150 

20 ACGTATTAGG TGATGCAATT GACTTAGATG GTGAGGTTCC TGCGGATGTA 200 

CGTCGTGATC CAATTCACCG TCAAGCACCT GCATTCGAAG AATTATCTAC 250 

TAAAGTAGAA ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC 300 

CTTACATTAA GGGTGGTAAG ATCGGTCTAT TCGGTGGTGC CGGTGTAGGT 350 

AAAACGGTAT TAATTCAGGA ATTAATCAAT AACATCGCAC AAGAACACGG 4 00 

25 TGGTATCTCT GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAGGGTAATG 450 

ACTTATACCA CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG 500 

GTATTCGGAC AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCGTT 550 

AACAGGTTTA ACAATGGCTG AGCATTTCCG TGATGAGCAA GGACAAGACG 600 

TACTTCTGTT CATCGATAAT ATCTTCCGTT TCACGCAAGC AGGTTCTGAA 650 

30 GTATCTGCCC TTCTTGGCCG TATGCCATCT GCGGTAGGTT ACCAACCAAC 700 

ACTTGCAACA GAAATGGGTC AATTACAAGA GCGTATTACA TCTACAAATA 750 

AAGGGTCTAT CACGTCT 767 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1255 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: . 1174 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: C-14 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255 

GAAATGCGTG AATCATTTTT AGATTATGCG ATGAGTGTTA TCGTTGCTCG 50 
TGCATTGCCA GATGTTCGTG ACGGTTTAAA ACCAGTACAT CGTCGTATAC 100 

676 
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TATATGGATT AAATGAACAA GGTATGACAC CGGATAAATC ATATAAAAAA 150 

TCAGCACGTA TCGTTGGTGA CGTAATGGGT AAATATCACC CTCATGGTGA 200 

CTTATCTATT TATGAAGCAA TGGTACGTAT GGCTCAAGAT TTCAGTTATC 250 

GTTATCCGCT TGTTGATGGC CAAGGTAACT TTGGTTCAAT GGATGGAGAT 300 

5 GGCGCAGCAG CAATGCGTTA TACTGAAGCG CGTATGACTA AAATCACACT 350 

TGAACTGTTA CGTGATATTA ATAAAGATAC AATAGATTTT ATCGATAACT 400 

ATGATGGTAA TGAAAGAGAG CCGTCAGTCT TACCTGCTCG ATTCCCTAAC 450 

TTATTAGCCA ATGGTGCATC AGGTATCGCG GTAGGTATGG CAACGAATAT 500 

TCCACCACAT AACTTAACAG AATTAATCAA TGGTGTACTT AGCTTAAGTA 550 

10 AGAACCCTGA TATTTCAATT GCTGAGTTAA TGGAGGATAT TGAAGGTC.CT 600 

GATTTCCCAA CTGCTGGACT TATTTTAGGT AAGAGTGGTA TTAGACGTGC 650 

ATATGAAACA GGTCGTGGTT CAATTCAAAT GCGTTCTCGT GCAGTTATTG 700 

AAGAACGTGG AGNCGGACGT CAACGTATTG TTGTCACTGA AATTCCTTTC 750 

CAAGTGAATA AGGCTCGTAT GATTGAAAAA ATTGCAGAGC TCGTTCGTGA 800 

15 CAAGAAAATT GACGGTATCA CTGATTTACG TGATGAAACA AGTTTACGTA 850 

CTGGTGTGCG TGTCGTTATT GATGTGCGTA AGGATGCAAA TGCTAGTGTC 900 

ATTTTAAATA ACTTATACAA ACAAACACCT CTTCAAACAT CATTTGGTGT 950 

GAATATGATT GCACTTGTAA ATGGTAGACC GAAGCTTATT AATTTAAAAG 1000 

AAGCGTTGGT ACATTATTTA GAGCATCAAA AGACAGTTGT TAGAAGACGT 1050 

20 ACGCAATACA ACTTACGTAA AGCTAAAGAT CGTGCCCACA TTTTAGAAGG 1100 

ATTACGTATC GCACTTGACC ATATCGATGA AATTATTTCA ACGATTCGTG 1150 

AGTCAGATAC AGATAAAGTT GCAA 1174 



25 



2) INFORMATION FOR SEQ ID NO: 1256 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10209 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

45 GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGTAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGA GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCGCCTGC ATTCGAAGAG 300 

TTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 400 

50 GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCACG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGAGC ACGTCAACGT 600 
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GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 
GCAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 
GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 
CAGCCAACAC TTGCAACAGA AATGGGTCAA 



650 
700 
750 
780 



- 10. 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1257 

. (i) SEQUENCE CHARACTERISTICS: ...... 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE :• Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 7700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257 

CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA ACGAAAACGG 50 

AACAAGCATT AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGACA 100 

CAGTTCGTAC AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

ACTTGGTCGT GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AGGTTCCTGC GGATGTACGT CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGTCTATTCG 400 

GTGGTGCCGG TGTAGGTAAA ACGGTATTAA TTCAGGAATT AATCAATAAC 450 

ATCGCACAAG AACACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 500 

TACTCGTGAG GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA 550 

TTAAGAAAAC TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA 600 

CGTCAACGTG TTGCGTTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 650 

TGAGCAAGGA CAAGATGTAC TTCTGTTCAT CGATAATATC TTCCGTTTCA 700 

CGCAAGCAGG TTCTGAAGTA TCTGCCCTTC TTGGCCGTAT GCCATCTGCG 750 

GTAGGTTACC AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG 800 

TATTACATCT ACAAATA 817 



2) INFORMATION FOR SEQ ID NO: 1258 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

.(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1404 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258 

CCAGAAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

TATTAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

10 GAAGATACTG GTAAACCAAT CTCTGTACGA GTTGGTGATG- TAACACTTGG . 200 

TCGCGTATTT AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAGGTTC 250 

CTGCAGATGT ACATCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA, ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

15 CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATTAA CAACATCGCA 4 50 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

20 AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT AAAGGGTCTA TCACGTCTA 829 



25 



2) INFORMATION FOR SEQ ID NO: 1259 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
40 (C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259 

TGGCGGAAAG CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA 50 

4 5 ACGAAAACGG TGAACTTAAC TTAACATTTG AAGTTGCACT TCATTTAGGT 100 

GATGATACAG TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG 150 

TGGCACAGAA GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG 200 

ATGCAACACT TGGTCGCGTA TTTAACGTAT TAGGTGATGC TATTGACTTA 250 

GATGGTGAGG TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC 300 

50 ACCTGCATTC GAAGAATTAT CTACTAAAGT AGAAATTCTT GAAACTGGTA 350 

TTAAAGTAGT AGACTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGC 400 

CTATTCGGTG GTGCCGGTGT AGGTAAAACA GTATTAATTC AGGAGTTAAT 450 

CAACAACATC GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG 500 
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GTGAGCGTAC TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT 
GGCGTAATTA AGAAAACTGC GATGGTATTC GGACAAATGA ACGAGCCACC 
TGGAGCACGT CAACGTGTTG CATTAACAGG CTTAACAATG GCTGAATATT 
TCCGTGATGA GCAAGGACAA GACGTACTTC TGTTCATCGA TAATATCTTC 
CGTTTCACGC AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GCCGTATGCC 
ATCTGCGGTA GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC 
AAGAGCGTAT TACATCTACA AATAAAGGGT CTATCACGTC TATC 



550 
600 
650 
700 
750 
800 
844 



-10- 
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2) INFORMATION FOR SEQ ID NO: 1260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1260 

AAGCTACCAG AAATCTACAA CGCCCTTACG GTAAAACAGA GCAACGAAAA 50 

CGGAACAAGC ATTAACTTAA CATTTGAAGT TGCACTTCAT TTAGGTGATG 100 

ACACAGTTCG TACAGTTGCA ATGTCTTCCA CAGATGGACT TGTTCGTGGC 150 

ACAGAAGTAG AAGATACTGG TAAAGCAATC TCTGTACCAG TTGGTGATGC 200 

AACACTTGGT CGTGTATTTA ACGTATTAGG TGATGCAATT GACTTAGATG 250 

GTGAGGTTCC TGCGGATGTA CGCCGTGATC CAATTCACCG TCAAGCACCT 300 

GCATTCGAAG AATTATCTAC TAAAGTAGAA ATTCTTGAAA CTGGTATTAA 350 

AGTAGTAGAC TTACTTGCTC CTTACATTAA GGGTGGTAAG ATCGGTCTAT 400 

TCGGTGGTGC CGGTGTAGGT AAAACAGTAT TAATTCAGGA ATTAATCAAC 4 50 

AACATCGCAC AAGAACACGG TGGTATCTCT GTATTCGCTG GTGTAGGTGA 500 

GCGTACTCGT GAGGGTAATG ACTTATACCA CGAAATGAGC GATTCAGGCG 550 

TAATTAAGAA AACTGCGATG GTATTCGGAC AAATGAACGA GCCACCTGGA 600 

GCGCGTCAAC GTGTTGCGTT AACAGGTTTA ACAATGGCTG AGCATTTCCG 650 

TGATGAGCAA GGACAAGACG TTCTTCTGTT CATCGATAAT ATCTTCCGTT 700 

TCACGCAAGC AGGTTCTGAA GTATCTGCCC TTCTTGGTCG TATGCCATCT 750 

GCGGTAGGTT ACCAACCAAC ACTTGCAACA GAAATGGGTC AATTACAAGA 800 

GCGTATTACA TCTACAAATA AAGGGTCTAT CACGTCTATC 840 



2) INFORMATION FOR SEQ ID NO: 1261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AZ1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261 



10" GCGGAAAGCT ACCAGAAATC TACAATGCCC TTACGGTAAA ACAAAGCAAC . _ 50 

GAAAACGGAA GCATGAACTT AACATTTGAA GTTGCACTTC ATTTAGGTGA 100 

TGATACAGTT CGTACAGTTG CGATGTCTTC CACAGATGGA CTTGTTCGTG 150 

GCACAGAAGT AGAAGATACT GGTAAAGCAA TCTCTGTACC AGTTGGTGAT 200 

GCAACACTTG GACGTGTATT CAACGTATTA' GGTGATGCAA TTGACTTAGA 250 

15 TGGTGAACTT CCTGCGGATG TACACCGTGA TCCAATTCAC CGTCAAGCAC 300 

CTGCATTCGA AGAATTATCT ACTAAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTAGTAG ACTTACTTGC TCCTTACATT AAGGGTGGTA AGATCGGCCT 400 

ATTCGGTGGT GCCGGCGTAG GTAAAACAGT ATTAATTCAG GAGTTAATCA 450 

ATAACATCGC ACAAGAGCAC GGTGGTATCT CTGTATTCGC TGGTGTAGGT 500 

20 GAGCGTACTC GTGAGGGTAA TGACTTATAC CACGAAATGA GCGATTCTGG 550 

CGTAATCAAG AAAACTGCGA TGGTATTCGG ACAAATGAAC GAGCCACCTG 600 

GAGCACGTCA ACGTGTTGCA TTAACAGGTT TAACAATGGC TGAGCATTTC 650 

CGTGATGAGC AAGGACAAGA CGTACTTCTG. TTCATCGATA ACATCTTCCG 700 

TTTCACGCAA GCGGGTTCTG AAGTATCTGC CCTTCTTGGT CGTATGCCAT 750 

25 CTGCGGTAGG TTACCAACCA ACACTTGCAA CAGAAATGGG TCAATTACAA 800 

GAGCGTATTA CATCTACAAA TAAAGGGTCT ATCACGTCT 839 



30 2) INFORMATION FOR SEQ ID NO: 1262 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

3 5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4H2 



(xi) SEQUENCE DESCRIPTION: SEQ 

45 

AAGCTACCAG AAATCTACAA TGCCCTTACG 
CGGAAGCATG AACTTAACAT TTGAAGTTGC 
CAGTTCGTAC AGTTGCGATG TCTTCCACAG 
GAAGTAGAAG ATACTGGTAA AGCAATCTCT 
50 ACTTGGACGT GTATTCAACG TATTAGGTGA 
AACTTCCTGC GGATGTACAC CGTGATCCAA 
TTCGAAGAAT TATCTACTAA AGTAGAAATT 
AGTAGACTTA CTTGCTCCTT ACATTAAGGG 

68 



ID NO: 1262 

GTAAAACAAA GCAACGAAAA 50 

ACTTCATTTA GGTGATGATA 100 

ATGGACTTGT TCGTGGCACA 150 

GTACCAGTTG GTGATGCAAC 200 

TGCAATTGAC TTAGATGGTG 250 

TTCACCGTCA AGCACCTGCA 300 

CTTGAAACTG GTATTAAAGT 350 

TGGTAAGATC GGCCTATTCG 400 
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GTGGTGCCGG CGTAGGTAAA ACAGTATTAA 
ATCGCACAAG AGCACGGTGG TATCTCTGTA 
TACTCGTGAG GGTAATGACT TATACCACGA 
TCAAGAAAAC TGCGATGGTA TTCGGACAAA 
5 CGTCAACGTG TTGCATTAAC AGGTTTAACA 
TGAGCAAGGA CAAGACGTAC TTCTGTTCAT 
CGCAAGCGGG TTCTGAAGTA TCTGCCCTTC 
GTAGGTTACC AACCAACACT TGCAACAGAA 
TATTACATCT ACAAATAAAG GGTCTATCAC 
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TTCAGGAATT AATCAATAAC 450 

TTCGCTGGTG TAGGTGAGCG 500 

AATGAGCGAT TCTGGCGTAA 550 

TGAACGAGCC ACCTGGAGCA 600 

ATGGCTGAGC ATTTCCGTGA 650 

CGATAACATC TTCCGTTTCA 700 

TTGGTCGTAT GCCATCTGCG 750 

ATGGGTCAAT TACAAGAGCG 800 

GTC 833 



2 ) INFORMATION FOR SEQ ID NO: 1263 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ISO bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
25 (B) STRAIN: BGSC 4Q1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263 



CGAAAACGGA AGCATGAACT TAACATTTGA 
0 ATGATACAGT TCGTACAGTT GCGATGTCTT 
GGCACAGAAG TAGAAGATAC TGGTAAAGCA 
TGTAACACTT GGACGTGTAT TCAACGTATT 
ATGGTGAACT TCCTGCGGAT GTACACCGTG 
CCTGCATTCG AAGAATTATC TACTAAAGTA 
5 TAAAGTAGTA GACTTACTTG CTCCTTACAT 
TATTCGGTGG TGCCGGTGTA GGTAAAACAG 
AACAACATCG CACAAGAGCA CGGTGGTATC 
TGAGCGTACT CGTGAGGGTA ATGACTTATA 
GCGTAATCAA GAAAACTGCG ATGGTATTCG 
0 GGAGCACGTC AACGTGTTGC ATTAACAGGT 
CCGTGATGAG CAAGGACAAG ACGTACTTCT 
GTTTCACGCA AGCGGGTTCT GAAGTATCTG 
TCTGCGGTAG GTTACCAACC AACACTTGCA 
AGAGCGTATT ACATCTACAA ATAAAGGGTC 



AGTTGCACTT 


CATTTAGGTG 


50 


CCACAGATGG 


ACTTGTTCGT 


100 


ATTTCTGTAC 


CAGTTGGTGA 


150 


AGGTGATGCA 


ATTGACTTAG 


200 


ATCCAATTCA 


CCGTCAAGCA 


250 


GAAATTCTTG 


AAACTGGTAT 


300 


TAAGGGTGGT 


AAGATCGGCC 


350 


TATTAATTCA 


GGAATTAATT 


400 


TCTGTATTCG 


CTGGTGTAGG 


450 


CCACGAAATG 


AGCGATTCTG 


500 


GACAAATGAA 


CGAGCCACCT 


550 


TTAACAATGG 


CTGAGCATTT 


600 


GTTCATCGAT 


AACATCTTCC 


650 


CCCTTCTTGG 


TCGTATGCCA 


700 


ACAGAAATGG 


GTCAATTACA 


750 


TATCACGTCT 




790 



2) INFORMATION FOR SEQ ID NO: 1264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

682 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264 



10 - - . _ 

AGTTGCACTT CATTTAGGTG ATGATACAGT TCGTACAGTT GCGATGTCTT . 50 

CCACAGATGG ACTTGTTCGT GGCACAGAAG TAGAAGATAC TGGTAAACCA 100 

ATCTCTGTAC CAGTTGGTGA TGTAACACTT, GGTCGCGTAT TTAACGTATT 150 

AGGTGATGCA ATTGACTTAG ATGGTGAGGT TCCTGCAGAT GTACATCGTG 200 

15 ATCCAATTCA CCGTCAAGCA CCTGCATTCG AAGAATTATC TACTAAAGTA 250 

GAAATTCTTG AAACTGGTAT TAAAGTAGTA GACTTACTTG CTCCTTACAT 300 

TAAGGGTGGT AAGATCGGCC TATTCGGTGG TGCCGGCGTA GGTAAAACAG 350 

TATTAATTCA GGAATTAATT AACAACATCG CACAAGAGCA CGGTGGTATC 400 

TCTGTATTCG CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ATGACTTATA 450 

20 CCACGAAATG AGCGATTCTG GCGTAATCAA GAAAACTGCG ATGGTATTCG 500 

GACAAATGAA CGAGCCACCT GGAGCACGTC AACGTGTTGC ATTAACAGGT 550 

TTAACAATGG CTGAGCATTT CCGTGATGAG CAAGGACAAG ACGTACTTCT 600 

GTTCATCGAT AACATCTTCC GTTTCACGCA AGCGGGTTCT GAAG 644 

25 

2) INFORMATION FOR SEQ ID NO: 1265 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265 



GGCGGAAAGC TACCAGAAAT CTACAACGCC CTTACGGTAA AACAGAGCAA 50 

CGAAAACGGA ACAAGCATTA ACTTAACATT TGAAGTTGCA CTTCATTTAG 100 

45 GTGATGACAC AGTTCGTACA GTTGCAATGT CTTCCACAGA TGGACTTGTT 150 

CGTGGCACAG AAGTAGAAGA TACTGGTAAA GCAATCTCTG TACCAGTTGG 200 

TGATGCAACA CTTGGTCGTG TATTTAACGT ATTAGGTGAT GCAATTGACT 250 

TAGATGGTGA GGTTCCTGCG GATGTACGTC GTGATCCAAT TCACCGTCAA 300 

GCACCTGCAT TCGAAGAATT ATCTACTAAA GTAGAAATTC TTGAAACTGG 350 

50 TATTAAAGTA GTAGACTTAC TTGCTCCTTA CATTAAGGGT GGTAAGATCG 400 

GTCTATTCGG TGGTGCCGGT GTAGGTAAAA CGGTATTAAT TCAGGAATTA 450 

ATCAATAACA TCGCACAAGA ACACGGTGGT ATCTCTGTAT TCGCTGGTGT 500 

AGGTGAGCGT ACTCGTGAGG GTAATGACTT ATACCACGAA ATGAGCGATT 550 
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CTGGCGTAAT TAAGAAAACT GCGATGGTAT TCGGACAAAT GAACGAGCCA 
CCTGGAGCAC GTCAACGTGT TGCGTTAACA GGTTTAACAA TGGCTGAGCA 
TTTCCGTGAT GAGCAAGGAC AAGATGTACT TCTGTTCATC GATAATATCT 
TCCGTTTCAC GCAAGCAGGT TCTGAAGTAT CTGCCCTTCT TGGCCGTATG 
CCATCTGCGG TAGGTTACCA ACCAACACTT GCAACAGAAA TGGGTCAATT 
ACAAGAGCGT ATTACATCTA CAA 



600 
650 
700 
750 
800 
823 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION "FOR "SEQ ID NO: 1266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: 'Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266 

TGGTCCGAGR CCCGATTCMA TGAAATTATC AAGGAAACCT CCAAYTTCAT 50 

TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT CCCATTTCTG 100 

GTTTCCAGGG TGACAACATG ATCGATSCCT CTGCCAACTG CCCATGGTAC 150 

AAGGGCTGGT ACMAKGAGAC TGCCGACAGG CAAGYACTCT GGCAAGACCC 200 

TTCTTGAGGC CATTGACGSC ATTGAGCCCC CCAMSCGTCC TWCCGATAAA 250 

CCTCTCCGTC TTCCTCTCCA GGATGTCTAC AAGATCTCCG GTATTGGMAC 300 

TGTTCCTGTC GGACGTRTTG AGACTGGAGT CATCAAGCCC GGTATGGTCG 350 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAAATG 400 

CACCACCAGC AGCTTTCCGA CGGTAWCCCC GGTGACAACG TCGGCTTCAA 450 

CGTCAAGAAT GTTTCCGTCA AAGAAGTCCG CCGTGGTAAC GTTGCCTGGT 500 

GACTCTAAGA ATGATCCCGC MAWGGGCTGC GATTCCTTCA ATGCYCAGGT 550 

CATCGTCCTC AACCACCCTG GTCAGGTTGG CGCTGGTTAT GCCCCAGTCC 600 

TCGAYTGCCA TACTGCCCAC ATTGCYTGCA ARTTCGCTGA GMTCMAKGAG 650 

AAGATTGAYC GCCGAACCGG MAAGTCTGTT GAGAACGCCC CCAAGTTCAT 700 

CAAGTCCGGT GATGC 715 



2) INFORMATION FOR SEQ ID NO: 1267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 56220 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1267 

GAGTCCTCTT ATTTACTTTT GTCATGACTA CCTTACTAAT CTGTCATAGA 50 

TCGTTACAAC GAAATCGTCA AGGAGACTTC CAACTTCATC AAGAAGGTCG 100 

GATACAACCC CAAGAACGTT CCTTTCGTTC CTATCTCCGG TTTCAACGGC 150 

10 "GACAACATGC TTGAGCCCTC- CCCCAACTGC CCCTGGTACA AGGGTTGGGA 200 

GAAGGAGACC AAGGCCGGTA AGGTCACTGG TAAGACCCTC CTCGAGGCCA 250 

TCGACGCCAT TGAGCCCCCT ACCCGTCCCG CCAACAAGGT CAGTACTACC 300 

TCAATTACTT GAACTCTCTT CATACGTTCC, GATTACTGAC TGCTTCACAG 350 

CCCCTCCGTC TTCCCCTCCA GGACGTTTAC AAGATCGGTG GTATTGGAAC 400 

15 GGTGCCCGTC GGTCGTGTTG AGACCGGTAC CATCTCCCCT GGTATGGTCG 450 

TTACCTTGTA TGTATCCTGA CCATCCCCCT TGGCAATCAT TACGTACTAA 500 

CTCACTCTTC AGCGCTCCCG CCAACGTCAC CACTGAAGTC AAGAGTGTTG 550 

AAATGCACCA CCAGCAGCTC GCTGCCGGTC AGCCCGGTGA CAACGTTGGT 600 

TTCAACGTGA AGAACGTCTC CGTCAAGGAA ATCCGTCGTG GTAACGTTGC 650 

20 TGGTGATAGC AAGAACGACC CCCCTGCCGG TGCTGCTTCC TTCAACGCCC 700 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGTGCTGG TTACGCCCCA 750 

GTCCTTGACT GCCACACTGC CCACATTGCT TGCAAGTTCT CTGAACTCCT 800 

TGAGAAGATT GACCGTCGTA CCGGAAAGTC TGTTGAGGAC CACCCCAAGT 850 

TCATCAAGTC CGGTGACGCT GCCAT 875 

25 



2) INFORMATION FOR SEQ ID NO: 1268 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
40 (B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268 



GTGAGCGTGG 
45 AAGTACAGTG 
ATTTCAACTT 
GCCATCGTGA 
TGCGCTATCC 
CTCCAAGGAT 
50 GTGTGAGGCA 
TCCGAGTCCC 
GAAGGTCGGA 
TCGAGGGTGA 



TATCACCATC 
TCACTGTCAT 
CCAGAGTATC 
CTTCATCAAG 
TCATCATTGC 
GGCCAGACTC 
ACTCATCGTT 
GTTTCAACGA 
TATAACCCCA 
CAACATGATT 



GATATTGCCC 
TGGTGAGTGC 
TACTCTAACA 
AACATGATCA 
TGCCGGTACT 
GTGAGCACGC 
GCCATCAACA 
AATCATCAAG 
AGGCTGTTCC 
GAACCCTCCC 



TCTGGAAATT 
TTTTTACCCC 
TATCCGCTTA 
CTGGTACCTC 
GGTGAGTTCG 
TCTGCTTGCT 
AGATGGACAC 
GAGGTTTCCA 
CTTCGTGCCA 
CCAACTGCAC 



CGAGACCCCG 
TCTTAAGCAG 
GATGCTCCCG 
CCAGGCTGAC 
AGGCTGGTAT 
TTCACCCTTG 
CACCAAGTGG 
ACTTCATCAA 
ATCTCTGGTT 
CTGGTACAAG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
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GGCTGGAACA AGGAGACTGC CTCTGGCAAG TCTTCTGGTA AAACCCTTCT 550 

CGATGCCATT GACGCCATTG AACCCCCAAC CCGTCCTACC GATAAGCCTC 600' 

TCCGTCTTCC CCTCCAGGAT GTTTACAAAA TCTCTGGTAT TGGCACTGTT 650 

CCCGTCGGAC GTGTTGAGAC TGGTGTCATC AAGCCCGGTA TGGTCGTGAC 700 

5 TTTCGCTCCC TCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACCAACAACT CCAGGCTGGT TACCCTGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTTT CAGTCAAGGA AGTCCGCCGT GGCAACGTTG CTGGCGACTC 850 

CAAAAATGAT CCCCCCAAGG GCTGCGAATC CTTCAATGCC CAGGTCATCG 900 

TCCTTAACCA CCCCGGCCAG GTTGGCGCTG GTTATGCCCC AGTCCTCGAC 950 

10 - TGCCACACTG CCCACATTGC -TTGCAAGT.TC .TCTGAACTCA TTGAGAAGAT 1000 

CGACCGCCGT ACTGGAAAGT CTGTTGAGAA CAACCCCAAG TTCATCAAGT 1050 

CTGGTGATGC TGCTATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTG 1100 

GAGCCCTTCA CTGACTATCC CCCT ' 1124 



15 



30 



50 



2) INFORMATION FOR SEQ ID NO: 1269 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1043 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton rubrum 

(B) STRAIN: WSA-224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269 



GTGAGCGTGG TATCACCATC GATATCGCCC 
AAGTACAATG TCACCGTCAT TGGTATGTTT 

35 TGGTTGTACC ATATCTAACG AGAGTAGACG 
ATCAAGAACA TGATCACTGG TACCTCCCAG 
CATTGCTGCC GGTACTGGTG AGTTCGAGGC 
AGACCCGTGA GCACGCTCTG CTCGCCTTCA 
ATCGTTGCCA TCAACAAGAT GGACACCACC 

40 CAAGGAAATT ATCAAGGAAG TCACCAACTT 
ACCCCAAGGG TGTTCCATTC GTTCCAATCT 
ATGATTGAGG CCTCCACCAA CTGCCCATGG 
GACCAAGGCC GGTGGTGCCA AGTCCGGCAA 
ATGCCATCGA CATGCCAACC CGTCCTACCG 

45 CTCCAGGATG TCTACAAGAT CTCTGGTATC 
TGTTGAGACC GGTATCATCA AGCCCGGTAT 
CCAACGTCAC CACTGAAGTC AAGTCCGTYK 
CAGCAGGGTG TCCCCGGTGA CAACGTCGGC 
CGTCAAGGAA GTCCGCCGTG GTAACGTTGC 



CCCGGCCAGA TCGGTGCTGG 
TCACATTGCT TGCAAGTTCG 



TCTGGAAGTT 


CGAGACCCCC 


50 


CTTTGCCTTG 


TTCCCTCATG 


100 


CCCCCGGTCA 


CCGTGACTTC 


150 


GCTGACTGCG 


CTATTCTCAT 


200 


TGGTATCTCC 


AAGGATGGCC 


250 


CCCTCGGTGT 


CAAGCAGCTC 


300 


GGCTGGTCCG 


AGGATCGTTT 


350 


CATCAAGAAG 


GTTGGCTACG 


400 


CTGGTTTCAA 


CGGTGACAAC 


450 


TACAAGGGAT 


GGAACAAGGA 


500 


GACCCTCCTC 


GAGGCCATCG 


550 


ACAAGCCCCT 


CCGTCTCCCA 


600 


GGAACTGTGC 


CAGTCGGTCG 


650 


GGTCGTCACC 


TTNGCCCCCG 


700 


AAATGCACCA 


CCAGCAGCTT 


750 


TTCAATGTCA 


AGAACGTTTC 


800 


CGGTGACTCC 


AAGAACGACC 


850 


AGGTCATYGT 


CCTCAACCAC 


900 


GTCCTCGACT 


GCCACACTGS 


950 


CGAGAAGATT 


GACCGCCGTA 


1000 


TCGTCAAGTC 


TGG 


1043 
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2) INFORMATION FOR SEQ ID NO: 1270 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum canis 

(B) STRAIN: WSA-217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270 



GCTGAGCGTG AGCGTGGTAT 
GACCCCCAAG TACATGGTCA 
TTTATAGTTG CGACCAGTAA 
TGACTTCATC AAGAACATGA 
TTCTCATCAT TGCTGCCGGT 
GATGGCCAGA CTCGTGAGCA 
GCAGCTCATC GTTGCCATCA 
CCCGTTTCGG TGAAATCATC 
GGCTACGACC CCAAGGGTGT 
TGACAACATG ATTGAGCCCT 
ACAAGGAGAC CAAGGCCGGT 
GCCATCGATG CCATTGACAT 
TCTCCCACTC CAGGATGTCT 
TCGGTCGTGT TGAGACTGGT 
GCCCCCGCCA ACGTCACCAC 
GCAGCTYGTC CAGGGTGTTC 
ACGTYTCTGT CAAGGAAGTC 
AACGACCCAC CAGCTGGCTG 
CAACCACCCC GGCCAGATCG 
ACACTGCCCA CATTGCTTGC 
CGCCGTACTG GTAAATCCGT 
TGATGCCGCT ATTGCCACCA 
CTTTC 



CACCATTGAT ATCGCCCTCT 
CCGTCATCGG TATGCTTTAT 
CTAACAAAAA GTAGATGCCC 
TTACTGGTAC CTCCCAGGCC 
ACTGGTGAGT TCGAGGCTGG 
CGCCCTGCTC GCTTTCACCC 
ACAAGATGGA CACCACCAAC 
AAGGAAGTCA CCAACTTCAT 
CCCATTCGTC CCAATCTCTG 
CCACCAACTG CCCATGGTAC 
GGCAAATCCT CTGGTAAGAC 
GCCCACTCGT CCCACCGACA 
ACAAGATCTC TGGTATCGGA 
ATCATCAAGC CTGGTATGGT 
TGAAGTCAAG TCCGTCGAAA 
CCGGTGACAA CGTTGGCTTC 
CGCCGTGGTA ACGTTGCCGG 
CGCCTCTTTC AAGGCCCAGG 
GTGCTGGTTA CGCCCCAGTC 
AAGTTCTCTG AGCTTCTTGA 
CGAAACCAGC CCTAAGTTCG 
TGGTTCCATC CAAGCCCATG 



GGAAGTTCGA 


50 


CTGTTTCCCA 


100 


CCGGGCACCG 


150 


GACTGCGCTA 


200 


TATCTCCAAG 


250 


TCGGTGTCAA 


300 


TGGTCTGAGT 


350 


CAAGAAGGTC 


400 


GCTTCAACGG 


450 


AAGGGATGGA 


500 


CCTCCTTGAG 


550 


AGCCTCTCCG 


600 


ACAGTACCAG 


650 


TGTCACTTTY 


700 


TGCACCACCA 


750 


AACGTCAAGA 


800 


TGATTCCAAG 


850 


TCATCGTCCT 


900 


CTTGACTGCC 


950 


GAAGATTGAC 


1000 


TCAAGTCTGG 


1050 


TGCGTTGAGG 


1100 




1105 



2) INFORMATION FOR SEQ ID NO: 1271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271 

5 

GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA AGTTCCAGAC 50 

CCCTAAGTAT GAGGTCACCG TCATTGGTAT GTTGTCCTTC TTGTGTTACC 100 

ATCGAAACAT ATCTAACCTA CAACTGCAGA CGCCCCCGGT CACCGTGACT 150 

TCATCAAGAA CATGATCACT GGTACCTCCC AGGCCGACTG CGCTATTCTC 200 

10 " ATCATTGCTT CCGGTACTGG TGAATTCGAG GCTGGTATCT CCAAGGATGG 250 

CCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTCGGT GTCCGTCAGC 300 

TCATCGTTGC CCTCAACAAG ATGGACACTG CTGGCTGGGC TGAGGCTCGT 350 

TACAACGAAA TCGTCAAGGA AACTTCCGGT TTCATCAAGA AGGTCGGCTA 400 

CAACCCCAAG TCGGTTCCCT TCGTCCCCAT CTCCGGTTTC AACGGTGACA 450 

15 ACATGCTTGA GCCCTCCTCC AACTGCCCCT GGTACAAGGG TTGGGAGAAG 500 

GAGACCAAGG CTGGTAAGGC CACTGGTAAG ACCCTCCTCG AGGCCATCGA 550 

CGCCATTGAG CCTCCCGTCC GTCCCTCCAA CAAGCCTCTC CGTCTTCCCC 600 

TCCAGGATGT CTACAAGATC TCTGGTATTG GAACTGTCCC CGTCGGCCGT 650 

GTCGAGACCG GTACCATCGT CCCCGGTATG GTCGTCACCT TCGCTCCCGC 700 

20 CAACGTCACC ACTGAAGTCA AGTCCGTTGA GATGCACCAC CAGCAGCTCA 750 

AGGAGGGTGT TCCCGGTKAC AACGTTGGTT TCAACGTGAA GAACGTTTCC 800 

GTCAAGGAAG TCCGCCGTGG TAACGTCGCT GGTGACTCCA AGAACGACCC 850 

CCCTGCCGGT GCTGCCTCTT TCACCGCCCA GGTCATCGTC CTCAACCACC 900 

CCGGTCAGGT CGGCGCTGGT TACGCTCCCG TCCTCGACTG CCACACCGCT 950 

25 CACATTGCCT GCAAGTTCGC TGAGCTCCAG GAGAAGATCG ACCGCCGTAC 1000 

CGGAAAGTCT GTCGAATYTG CCCCCAAGTT CATCAAGTCT GGTGACGCCG 1050 

CTATCGTCAA GATGATTCCC TCCAAGCCCA TGTGTGTCGA GTCTTTCACT 1100 

GACTACCCTC CTYTCGGCCG TTTCGCCGTC CGTGACGTAA GTTCTTTCCC 1150 

CAGCTTTTCG ATGCTACCCT TCTMTGAATC ACGTGTCATG TCTTGGCACC 1200 

3 0 CGCCCATCAC ATGACCACGC AACCCTATAC CCCGCCACAC CCTT 1244 



2) INFORMATION FOR SEQ ID NO: 1272 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Exophiala moniliae 

(B) STRAIN: WSA-219 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272 

50 GCTGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACTATGTCA CCGTCATCGA CGCCCCCGGT CATCGTGACT 100 

TCATCAAGAA CATGATCACT GGTACTTCCC AAGCTGACTG CGCCATTCTC 150 

ATCATTGCTG CCGGTACTGG TGAATTCGAA GCCGGTATCT CCAAGGATGG 200 
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TCAGACCCGT GAGCACGCTC 

TCATTGTCGC CATCAACAAG 

TTCAACGAAA TCATCAAGGA 

CAACCCCAAG TCCGTTCCTT 

5 ACATGATCGA CGTCTCCACC 

GAGACCAAGG CTGGCAAGGC 

CGCCATTGAC CCCCCCTCTC 

TCCAGGATGT GTACAAGATC 

. GTCGAGACTG GTATCATCAA 

10 CAACGTCACC ACTGAAGTCA 

CCGAGGGTGT TCCAGGTGAC 

GTCAAGGAGG TTCGTCGTGG 

ACCCAAGGGC GCTGATTCCT 

CTGGTCAAGT TGGTGCTGGC 

15 CACATTGCTT GCAAGTTCTC 

CGGAAAGTCG ATCGAAAACA 

CCATCGTCAA GATGGTTCCC 



TGCTTGCCTA CACCCTGGGT 
ATGGACACTA CCAAGTGGTC 
GACTTCCAGC TTCATCAAGA 
TCGTCCCCAT CTCCGGCTTC 
AACTGCCCCT GGTACAAGGG 
CTCTGGCAAG ACTCTCCTTG 
GTCCCACCGA CAAGCCTYTC 
TCTGGTATCG GAACGGTGCC 
GGCCGGTATG GTCGTTACCT 
AGTCCGTCGA AATGCACCAC 
AACGTCGGTT TCAACGTCAA 
AAACGTTTGC GGTGACTCCA 
TCAACGCCCA, GGTCATCGTC 
TACGCCCCAG TGTTGGATTG 
TGAGCTTCTC GAGAAGATCG 
ACCCCAAGTT CATCAAGTCT 
AGCAAGCCCA TG 



GTCAAGCAGC 250 

TGAGGACCGT 300 

AGGTCGGCTA 350 

AACGGTGACA 400 

CTGGGAGAAG 450 

AGGCCATCGA 500 

CGTCTCCCTC 550 

CGTCGGTCGT 600 

TCGCTCCTGC 650 

GAACAGGTGG 700 

GAACGTYTCC 750 

AGAACGACCC 800 

TTGAACCACC 850 

CCACACTGCC 900 

ATCGTCGKAC 950 

GGTGACGCTG 1000 
1032 



20 



2) INFORMATION FOR SEQ ID NO: 1273 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hortaea werneckii 

(B) STRAIN: ATCC 34944 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1273 

TAACAGCCCG TGCGACCGTC TCGTCCAATA ACTAATGATT CGCAGGTACT 50 

ATGTCGTGAG TATCCGGTCC TTTTTTGTTA ATTTACCAGA AATGACGAAA 100 

TTTTTGACTA ATTAACACAC TCAGACCGTC ATTGACGCCC CGGGTCACCG 150 

40 TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCC GACTGCGCTG 200 

TCCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA 300 

GCAGCTCATC GTCGCCATCA ATAAGATGGA. CACCACCAAG TGGTCCGAGG 350 

AGCGTTACGG CGAGATCATC AAGGAGACCT CTGCCTTCAT CAAGAAGGTC 400 

45 GGTTTCAACC CGAAGCACGT CCCGTTCGTC CCGATCTCCG GTTTCAACGG 450 

TGACAACATG ATCGAGGCCT YCACCAACTG CCCGTGGTAC AAGGGCTGGG 500 

AGAAGGAGAC CAAGGCCAAG GTCACCGGCA AGACCCTYCT TGAGGCCATT 550 

GACAACATCG ACCCGCCGAG CCGTCCTTCC GACAAGCCGC TCCGTCTTCC 600 

CCTCCAGGAT GTCTACAAGA TCGGTGGTAT TGGGACAGTC CCAGTCGGCC 650 

50 GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTTAC CTTCGCTCCG 700 

GCTGGTGTCA CCACTGAAGT GAAGTCCGTT GARATGCACC ACGAGCAGCT 750 

CGCTGAGGGT YTGCCGGGTG ACAACGTCGG CTTCAACGTC AAGAACGTTT 800 

CCGTCAAGGA GATCCGTCGT GGCAACGTTG CTGGTGACAG CAAGGCTGAC 850 
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CCGCCGAAGG GCTGCGACAG CTTCAACGCC 
CCCTGGCCAG GTCGGTGCTG GTTACGCTCC 
CCCACATTGC CTGCAAGTTC GGCGAGCTCC 
TYTGGCAAGT CCATTGAAGC CTYGCCTAAG 
5 TGCCATYGTC AAGATGATTC CGTCCAAGCC 
CTGAGT 



PCT/CA00/01150 

CAGGTCATCG TCCTGAACCA 900 
AGTCCTGGAC TGCCACACTG 950 
TCGAGAAGAT CGACCGTCGC 1000 
TACATCAAGT CTGGTGACGC 1050 
GATGTGCGTT GAGCCATTCA 1100 

1106 



10 2) INFORMATION FOR SEQ "ID NO: 1274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY:' Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274 



CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCCGCTACT ATGTCACCGT CATTGGTATG TTGCTGTCAC 100 

CTCTCTCACA CATGTCTCAC CACTAACAAT CAACAGACGC CCCCGGCCAC 150 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGACTGCGC 200 

30 CATTCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 250 

AGGATGGCCA GACCCGTGAG CACGCCCTGC TCGCCTACAC CCTCGGTGTC 300 

AAGAACCTCA TTGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 350 

GTCCCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 4 00 

TCGGCTACAA CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC 450 

35 GGCGACAACA TGCTTACTCC CTCCACCAAC TGCCCCTGGT ACAAGGGCTG 500 

GGAGCGTGAG ATCAAGTCCG GCAAGCTCAC TGGCAAGACC CTCCTCGAGG 550 

CCATTGACTC CATCGAGCCC CCCAAGCGTC CCGTCGACAA GCCCCTYCGA 600 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATTGGCA CGGTTCCCGT 650 

CGGCCGTATC GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTMG 700 

40 CCCCCTCCAA CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG 750 

CAGCTCTYTG AGGGTCTTCC CGGTGACAAC GTCGGCTTCA ACGTGAAGAA 800 

CGTYTCCGTC AAGGAGATCC GACGTGGCAA CGTCGCTGGT GACTCCAAGA 850 

ACGACCCCCC TYTGGGTGCC GCCTCTTTCA CCGCCCAGGT CATTGTCCTC 900 

AACCACCCTG GCCAGGTCGG TGCCGGTTAC GCCCCCGTTY TGGACTGCCA 950 

45 CACTGCCCAC ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC 1000 

GCCGAACTGG TAAGGCTGTT GAGTCCGCCC CCAAGTTCAT CAAGTCTGGT 1050 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1100 

TTTCACTGAC TACCCCCCT 1119 



2) INFORMATION FOR SEQ ID NO: 1275 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

--■ - -(v.i). ORIGINAL SOURCE: 

10 (A) ORGANISM: Auf ebbasidium pullvlans - - -■ 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1275 

15 CTGAAGTCTG AGCGTGAGCG TGGTATCACT ATCGATATCG CCCTCTGGAA 50 

GTTCGAGACC CCCAAGTACA TGGTCACCGT CATCGATGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC TGACTGCGCC 150 

ATTCTCATCA TTGCTGCCGG TACTGGTGAG TTCGAGGCTG GTATCTCCAA 200 

GGATGGCCAG ACTCGTGAGC ACGCTCTCCT CGCCTACACC CTTGGTGTCA 250 

20 AGCAGCTCAT CGTTGCCATC AACAAGATGG ACACCACCAA GTGGTCTGAG 300 

GCCCGTTACC AGGAGATCAT CAAGGAGACC TCCGGTTTCA TCAAGAAGGT 350 

CGGCTACAAC CCCAAGCACG TTCCCTTCGT CCCCATCTCC GGCTTCAACG 400 

GAGACAACAT GATCGAGGTC AGCTCCAACT GCCCTTGGTA CAAGGGTTGG 450 

GAGAAGGAGA CCAAGGCCAA GGCCACTGGC AAGACCCTCC TCGAGGCCAT 500 

25 TGACGCCATC GACCCTCCTT CCCGTCCTAC CGACAAGCCC CTCCGTCTTC 550 

CCCTCCAGGA TGTCTACAAG ATCGGTGGTA TTGGCACGGT GCCCGTCGGT 600 

CGTGTTGAGA CCGGTAAGAT CATGGGTGGT ATGGTTGTCA CCTTCGCCCC 650 

CGCTGGTGTC ACCACCGAGG TCAAGTCCGT CGAGATGCAC CACGAGCAGC 700 

TCACCGAGGG TCTTCCCGGT GACAACGTCG GCTTCAACGT CAAGAACGTY 750 

30 TCCGTCAAGG AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA 800 

CCCCCCCAAG GGTTGTGACT CCTTCAACGC CCAGGTCATC GTCCTGAACC 850 

ACCCTGGTCA GGTCGGTGCT GGTTACGCAC CCGTCCTCGA CTGCCACACT 900 

GCCCACATTG CTTGCAAGTT YTCCGAGCTT GTTGAGAAGA TTGACCGCAG 950 

AACCGGCAAG TCCGTTGAGG CTGCCCCCAA GTTCATCAAG TCTGGTGACG 1000 

35 CCGCCATCGT CAAGATGGTT CCCTCCAAGC CCATGTGTGT TGAGGCTTTC 1050 

ACCGACTACC CTCCTYTCGG TCGTTTCGCC GTCCGGTATG TTTTTCTTCT 1100 

TCAATCATTC TTC 1113 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1276 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 731 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 14112 

691 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1276 

GGTCTGAGAC CCGTTTCAAC GAAATTATCA AGGAAGTCAG CAACTTCATC 50 

5 AAGAAGGTCG GATACAACCC CAAGTCTGTT CCCTTCGTGC CAATCTCCGG 100 

TTTCGAGGGT GACAACATGA TTGAGCCTTC CCCCAACTGC CCTTGGTACA 150 

AGGGCTGGAA CAAGGAGACT GCTGCTGGCA AGGCCGCCGG TAAGACTCTT 200 

CTCGATGCCA TTGACGCCAT CGACCCCCCC GTCCGTCCTA CCGAGAAGCC 250 

TCTCCGTCTT CCCCTCCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 300 

10 TTCCCGTTGG ACGTGTCGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTG 350 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TTGAAATGCA 400 

CCACCAGCAG CTCCAGGCTG GTTACCCTGG TGACAACGTC GGCTTCAACG 450 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCGGTGAC 500 

TCCAAGAACG ACCCCCCCAA GGGCTGCGAG TCCTTCAACG CCCAGGTCAT 550 

15 CGTCCTCAAC CACCCCGGCC AGGTTGGCGC TGGTTATGCC CCAGTCCTCG 600 

ACTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT CATCGAGAAG 650 

ATTGACCGCC GTACCGGAAA GTCTGTTGAG GACAACCCCA AGTTCATCAA 700 

GTCCGGTGAT GCTGCTATCG TCAAGA'TGAT T 731 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1277 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1046 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala dermatitidis 

(B) STRAIN: ATCC 76088 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277 



CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCCCTCTG GAAGTTCGAG 50 

ACCCCCAAGT ACTATGTCAC CGTCATCGAC GCCCCGGGTC ATCGTGACTT 100 

4 0 TATCAAGAAC ATGATCACTG GTACCTCGCA GGCCGACTGC GCCATCTTGA 150 

TCATTGCCGC CGGTACCGGT GAATTCGAAG CCGGTATCTC CAAGGATGGT 200 

CAGACCCGTG AGCACGCTCT GCTCGCCTAC ACCTTGGGTG TCAAGCAGCT 250 

CATCGTCGCC ATCAACAAGA TGGACACCAC CAAGTGGTCC GAGGAGCGTT 300 

TCAACGAAAT CATCAAGGAG ACTTCCAACT TCATCAAGAA GGTCGGCTAC 350 

4 5 AACCCCAAGG CCGTTCCTTT CGTCCCCATC TCCGGCTTCA ACGGTGACAA 400 

CATGATTGAG GTCTCCACCA ACTGCCCGTG GTACAAGGGA TGGGAGAAGG 450 

AGTCCAAGGC TGGCAAGGCC ACCGGCAAGA CCCTCCTCGA GGCCATTGAC 500 

GCCATCGACC CACCCACCCG TCCCACCGAC AAGCCTCTCC GTCTCCCTCT 550 

CCAGGATGTC TACAAGATCT CTGGTATCGG AACGGTTCCT GTCGGTCGTG 600 

50 TCGAGACCGG TACCATCAAG GCCGGTATGG TCGTCACCTT CGCTCCGGCC 650 

AACGTCACCA CTGAAGTCAA GTCCGTCGAA ATGCACCACG AGCAGCTCGC 700 

CGAGGGTYTG CCAGGTGACA ACGTTGGCTT CAACGTCAAG AACGTYTCCG 750 

TCAAGGAGGT TCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCG 800 

692 
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CCCAAGGGTG CCGAGTCCTT CAACGCCCAG 

TGGTCAGATC GGTGCCGGCT ACGCTCCAGT 

ACATTGCTTG CAAGTTCGCC GAGTTGCTCG 

GGAAAGTCCA TCGAGAACAA CCCCAAGTTC 

5 CATCGTCAAG ATGATTCCCA GCAAGCCCAT 



GTCATTGTCC TCAACCACCC 850 

CTTGGATTGC CACACTGCCC 900 

AGAAGATCGA CCGTCGTACC 950 

ATCAAGTCTG GTGATGCTGC 1000 

GTGTGTCGAG GCTTTC 104 6 



2) INFORMATION FOR SEQ ID NO: 1278 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278 



25 CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT TCGAGACTCC 50 

TCGCTACTAT GTCACCGTCA TTGGTATGTT GTCGCTCATA CCTCATCCTA 100 

CTTCCTCATA CTAACACATC ATTCAGACGC TCCCGGTCAC CGTGATTTCA 150 

TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGATTGCGC CATTCTCATC 200 

ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA AGGATGGCCA 250 

30 GACCCGTGAG CACGCTCTTC TTGCCTACAC CCTTGGTGTC AAGAACCTCA 300 

TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCTGA GGCCCGTTAC 350 

CAGGAGATCA TCAAGGAGAC CTCCTCTTTC ATCAAGAAGG TCGGCTACAA 400 

CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC GGTGACAACA 450 

TGCTTACCCC CTCCACCAAC TGCCCCTGGT ACAAGGGTTG GGAGCGTGAG 500 

35 ATCAAGTCCG GCAAGCTCTC CGGCAAGACC CTCCTCGAGG CCATTGACTC 550 

CATCGAGCCT CCCAAGCGTC CCGTTGACAA GCCCCTCCGT CTTCCCCTCC 600 

AGGATGTCTA CAAGATCGGT GGTATTGGAA CGGTTCCCGT CGGCCGTATT 650 

GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTCG CTCCCTCCAA 700 

CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG CAGYTCAGTK 750 

40 AGGGCCAGCC CGGTGACAAC GTTGGTTTCA ACGTGAAGAA CGTTTCCGTC 800 

AAGGACATCC GACGTGGTAA CGTCGCTGGT GACTCCAAGA ACGACCCCCC 850 

CCAGGGTGCC GCTTCTTTCA CCGCCCAGGT CATCGTCCTC AACCACCCCG 900 

GCCAGGTCGG TGCTGGTTAC GCTCCTGTCC TCGATTGCCA CACTGCCCAC 950 

ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC GCCGAACCGG 1000 

45 TAAGGCTACT GAGGCCGCTC CCAAGTTCAT CAAGTCTGGT GACTCCGCCA 1050 

TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GTGTCGAGGC TTTCACTGAC 1100 

TACCCTCCT 1109 



50 

2) INFORMATION FOR SEQ ID NO: 1279 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 65 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

- - (A) ^ORGANISM: Aspergillus terreus 

10 (B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: , SEQ ID NO: 1279 



CCTGCAAGTG GTCTGAGGAC CGTTACAACG AAATCGTGAA GGAGACCTCC 50 

15 AACTTCATCA AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC 100 

CATCTCCGGT TTCAACGGTG ACAACATGCT TGAGCCTTCC CCCAACTGCC 150 

CCTGGTACAA GGGTTGGGAG AAGGAGGGCA AGTCCGGCAA GGTCACCGGT 200 

AAGACTCTCC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCTGC 250 

CAACAAGCCT CTCCGTCTTC CCCTCCAGGA TGTGTACAAG ATCTCTGGTA 300 

20 TCGGAACTGT CCCCGTCGGC CGTGTCGAGA CTGGTGTCAT CACCCCCGGC 350 

ATGGTTGTTA CCTTCGCTCC TTCCAACGTC ACCACTGAAG TGAAGTCCGT 400 

TGAGATGCAC CACCAGCAGC TCAAGGAGGG TCTCCCCGGT GACAACGTTG 4 50 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AGGTCCGTCG TGGTAACGTC 500 

GCTGGTGACT CCAAGAACGA CCCCCCTGCT GGCGCTGCCT CCTTCACCGC 550 

25 CCAGGTCATC GTTCTCAACC ACCCCGGTCA GGTCGGCGCT GGCTACGCCC 600 

CCGTCCTCGA CTGCCACACT GCCCACATTG CCTGCAAGTT CGCTGAGCTC 650 

CAGGAGAAGA TTGACCGCCG TACCGGAAAG TCTGTTGAGT CTTCTCCCAA 700 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATC CCTYCAAAGC 750 

CCATGTGCGT CGAAG 7 65 

30 



2) INFORMATION FOR SEQ ID NO: 1280 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
45 (B) STRAIN: ATCC 64746 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280 

GCGTGGTATC ACCATCGACA TTGCCCTCTG GAAGTTCCAG ACTCCCAAGT 50 

50 ATGAGGTCAC TGTCATCGGT AAGCTCGACT CGCCCCGATA TGTTTTGGTG 100 

CTGTAGCTAA CACGATCTGA AGATGCCCCC GGTCACCGTG ACTTCATCAA 150 

GAACATGATC ACTGGTACCT CCCAGGCTGA CTGCGCTATC CTCATCATTG 200 

CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACC 250 

694 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CGTGAGCACG CTCTGCTGGC TTTCACCCTC GGTGTCAAGC AGCTCATCGT 300 

CGCCCTCAAC AAGATGGACA CCTGCAAGTG GTCCGAGGAT CGTTACAACG 350 

AAATTGTCAA GGAAACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 400 

AAGGCCGTTC CCTTCGTMCC CATCKCTGGC TTCAACGGTG ACAACATGCT 450 

5 TGAGCCCTCC TCCAACTGCC CCTGGTACAA GGGATGGGAG AAGGAGACCA 500 

AGGCCGGCAA GGTCACTGGT AAGACCCTCA TCGAGGCCAT YGACGCCATT 550 

GAGCCCCCTG TCCGTCCCTC CAACAAGCCC CTYCGTCTTC CCCTCCAGGA 600 

TGTYTACAAG ATCTCTGGTA TCGGAACGGT CCCTGTCGGC CGTGTCGAGA 650 

. CCGGTATCAT CAAGCCCGGC ATGGTCGTCA CCTTTGCCCC CGCCAACGTC 700 

10 ACCACTGAAG TCAAGTCCGT CGAAATGCAC CACCAGGAGC- TCCAGGAGGG . . 750 

TGTCCCCGGT GACAACGTCG GTTTCAACGT CAAGAACGTT TCCGTCAAGG 800 

AAGTCCSCCG TGGTAACGTT TGCGGTGACT CCAAGAACGA TCCCCCTCAG 850 

GGTGCTGCCT CCTTCAACGC CCAGGTCATC, GTCCTCAACC ACCCCGGTCA 900 

GGTCGGCGCT GGTTACGCCC CCGTCCTCGA CTGCCACACT GCCCACATTG 950 

15 CTTGCAAGTT CTYTGAGCTG CTTGAGAAGA TTGACCGCCG TACCGGCAAG 1000 

TCTGTTGAGA ACAACCCCAA GTTCATCAAG TCCGGTGAWG CCGCCATCGT 1050 

GAAGATGGTT CCTTCCAAGC CCATGTGTGT CGAGTCCTTC ACTGACTACC 1100 

CCCCT 1105 

20 

2) INFORMATION FOR SEQ ID NO: 1281 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus laurentii 

(B) STRAIN: ATCC 44096 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281 

u 

GCCGAGCGAG AGCGAGGTAT CACCATCGAC ATTGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAACGTCA CCGTCATTGA CGCCCCCGGA CACCGAGACT 100 

40 TCATCAAGAA CATGATCACC GGTACCTCCC AGGCCGACTG CGCCATCCTT 150 

ATCATTGCCA CCGGTATCGG AGAGTTCGAG GCCGGTATCT CTAAGGACGG 200 

TCAGACCCGA GAGCACGCTT TGCTCGCCTT CACCCTCGGT GTCAGGCAGC 250 

TCATCGTTGC TTGCAACAAG ATGGACACCT GCAAGTGGTC CGAGGACCGA 300 

TTCAACGAGA TCGTCAAGGA GACCAACGGT TTCATCAAGA AGGTCGGATA 350 

45 CAACCCCAAG GCTGTTCCCT TCGTCCCCAT CTCTGGATGG CACGGAGACA 400 

AGTGAGTGCC GTTCTTTGCG TTGAGCCCTC TTTGTCGCTC CCCCTCCCTC 450 

TCAAGTGGCG GCGGCGGTCT CCACCCACAA ATCGGGTGGC GAATCCGCCA 500 

CACCCACCAC TTCTCGCCAC CGAGTGTGGC ACTTCTTCCA ACTCCTCTTT 550 

CCACTCCTCC TCGTCTCGTC TCTTTTTTTC TCCGTTGTCT TTGACAAGGG 600 

50 GAGTGTGCTG ATAGTAAAGC ATGCTTGAGG AGACCACCAA CATGCCGTGG 650 

TACAAGGGAT GGACCAAGGA GACCAAGTCC GGTGTCGTTA AGGGTAAGAC 700 

CCTCCTCGAC GCCATCGACG CCATCGAGCC TCCTCAACGA CCCACCGACA 750 

AGCCCCTCCG ACTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATCGGT 800 

695 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACGGTGCCCG TCGGCCGAGT CGAGACCGGT GTCATCAAGG CCGGGTAAGT 850 

CACGGGAGCC TGACGGCTGT TGTTGCCACA CCCAACTTAT ATCCAGTATG 900 

GTCGTCACCT TCGCTCCTAC CAACGTCACC ACTGAGGTTA AGTCCGTTGA 950 

GATGCACCAC GAGCAGATCC CTGAGGGTCT TCCCGGAGAC AACGTTGGTT 1000 

5 TCAACGTGAA GAACGTTTCC ATCAAGGACA TCCGACGAGG AAACGTYTGC 1050 

TCCGACTCCA AGAACGACCC CGCTAAGGAG GCCGCTTCTT TCAACGCCCA 1100 

GGTCATTGTC CTCAACCACC CTGGACAGAT TGGTGCCGGT TACACCCCCG 1150 

TCCTCGACTG CCACACCGCC CACATTGCCT GCAAGTTCGC CGAGCTCATC 1200 

GAGAAGATCG ACCGACGAAC TGGTAAGACC ATGGAGGCCG CCCCCAAGTT 1250 

10 ' CGTCAAGTCC GGAGACGCCG CCATTGTCAA GCTCGTTGCC - CAGAAGCCCA .1300 

TCTGTGTCGA GTCTTACTCT GACTACCCTC CCCTTGGACG ATT 1343 



15 2) INFORMATION FOR SEQ ID NO: 1282 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282 



TGGTCCGAGG CTCGTTTCAA CGAAATCATC AAGGAAGTCA CCAACTTCAT 50 

CAAGAAGGTC GGATACAACC CCAAGTCTGT TCCCTTCGTG CCAATCTCTG 100 

GTTTCGAGGG TGATAACATG ATTGAGCCCT CCACCAACTG CCCCTGGTAC 150 

AAGGGCTGGA CCAAGGAGAC CGCTGCCGGC AAGTCAACCG GTAAGACTCT 200 

35 TCTTGACGCC ATTGATGCCA TCGATCAACC CTCCCGTCCT ACCGACAAGC 250 

CCCTCCGTCT TCCCCTCCAG GATGTGTACA AGATCTCCGG TATTGGCACT 300 

GTTCCCGTTG GACGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTCGT 350 

GACCTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 400 

ACCACCAACA GCTCTTGGCT GGTAACCCCG GTGACAACGT CGGTTTCAAC 450 

40 GTCAAGAACG TTTCCGTCAA GGAAGTCCGC CGTGGCAACG TTGCTGGTGA 500 

CTCAAAGAAC GACCCCCCCA AGGGCTGCGA CTCCTTCAAC GCCCAGGTCA 550 

TCGTCCTCAA TCACCCCGGT CAAGTTGGCG CTGGTTATGC CCCAGTCCTC 600 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TCCTCGAGAA 650 

GATTGACCGC CGTACCGGAA AGTCCACTGA GAACAACCCC AAGTTCATCA 700 

45 AGTCTGGTGA CGCCGCTATC GTCAAGATGG TTCC 734 



2) INFORMATION FOR SEQ ID NO: 1283 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

696 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 62877 

(xi) SEQUENCE DESCRIPTION :~ SEQ ID NO: 1283 - - 



CGTGAGCGTG GTATCACCAT 
CCGCTACTAT GTCACCGTCA 
TGTCTCACCA CTAACAATCA 
AAGAACATGA TCACTGGTAC 
TGCCGCTGGT ACTGGTGAGT 
CCCGTGAGCA CGCCCTGCTC 
GTCGCCATCA ACAAGATGGA 
GGAGATCATC AAGGAGACCT 
CCAAGGCTGT CGCTTTCGTC 
CTTACTCCCT CCACCAACTG 
CAAGTCCGGC AAGCTCACTG 
TCGAGCCCCC CAAGCGTCCC 
GATGTCTACA AGATCGGTGG 
GACTGGTGTC ATCAAGCCCG 
TCACCACTGA AGTCAAGTCC 
GGTCTTCCCG GTGACAACGT 
GGAGATCCGA CGTGGCAACG 
TGGGTGCCGC CTCTTTCACC 
CAGGTCGGTG CCGGTTACGC 
TGCCTGCAAG TTCGCCGAGA 
AGGCTGTTGA GTCCGCCCCC 
GTCAAGATGG TTCCCTCCAA 
CCCCCCT 



CGACATTGCC CTCTGGAAGT 
TTGGTATGTT GCTGTCACCT 
ACAGACGCCC CCGGCCACCG 
TTCCCAGGCC GACTGCGCCA 
TCGAGGCTGG TATCTCCAAG 
GCCTACACCC TCGGTGTCAA 
CACCACCAAG TGGTCCGAGT 
CCAACTTCAT CAAGAAGGTC 
CCCATCTCCG GTTTCAACGG 
CCCCTGGTAC AAGGGCTGGG 
GCAAGACCCT CCTCGAGGCC 
GTCGACAAGC CCCTCCGACT 
TATTGGCACG GTTCCCGTCG 
GTATGGTCGT TACCTTCGCC 
GTCGAGATGC ACCACGAGCA 
CGGCTTCAAC GTKAAGAACG 
TCGCTGGTGA CTCCAAGAAC 
GCCCAGGTCA TTGTCCTCAA 
CCCCGTTYTG GACTGCCACA 
TCCAGGAGAA GATCGACCGC 
AAGTTCATCA AGTCTGGTGA 
GCCCATGTGC GTTGAGGCTT 



TCGAGACTCC 


c a 

50 


CTCTCACACA 


100 


TGACTTCATC 


t c r\ 

150 


TTCTCATCAT 


o a n 
200 


GATGGCCAGA 


OCA 

250 


GAACCTCATT 


O A A 

300 


CCCGTTACCA 


OCA 

JoO 


GGCTACAACC 


400 


CGACAACATG 


450 


AGCGTGAGAT 


500 


ATTGACTCCA 


550 


TCCCCTCCAG 


600 


GCCGTATCGA 


650 


CCCTCCAACG 


700 


GCTCTYTGAG 


750 


TYTCCGTCAA 


800 


GACCCCCCTY 


850 


CCACCCTGGC 


900 


CTGCCCACAT 


950 


CGAACTGGTA 


1000 


CTCCGCCATC 


1050 


TCACTGACTA 


1100 




1107 



2) INFORMATION FOR SEQ ID NO: 1284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284 
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GCTCAAGGCC GAGCGTGAGC GCGGTATCAC CATCGATATT GCTCTGTGGA 50 

AGTTCGAGAC CCCCAAGTAC TACGTCACCG TCATTGACGC CCCCGGTCAT 100 

CGCGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

5 AGGATGGCCA GACTCGTGAG CACGCTCTGC TCGCCTACAC CCTGGGTGTG 250 

CGGCAGCTGA TCGTCGCCAT CAACAAGATG GACACGGCCA AGTGGGCTGA 300 

GGCTCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGACT GTTGCCTTCG TCCCCATCTC GGGCTTCCAC 400 

■ GGCGACAACA TGCTTACTCC CTCGACCAAC TGCCCCTGGT ACAAGGGCTG 450 

10 GGAGAAGGAG GGCAAGAGCG GCAAGGTTAC" CGGTAAGACT CTGCTGGACG 500 

CCATTGACGC CGTCGAGCCC CCCAAGCGCC CCACGGACAA GCCCCTGCGT 550 

CTGCCCCTCC AGGATGTCTA CAAGATCGGC GGTATCGGCA CTGTCCCTGT 600 

CGGCCGTATC GAGACTGGTG TCCTGAAGCC, CGGCATGGTC GTCACCTTTG 650 

CCCCGTCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

15 CAGCTTGTTG AGGGTGTTCC CGGCGACAAC GTCGGCTTCA ACGTCAAGAA 750 

CGTYTCCGTC AAGGAGATCC GTYGTGGCAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCCCC CTCGGGCGCC GCCACCTTCA ACGCCCAGGT CATTGTCCTG 850 

AACCACCCCG GCCAGGTCGG CAACGGCTAC GCCCCGGTTY TGGACTGCCA 900 

CACCGCCCAC ATTGCCTGCA AGTTCACCGA GATCCTTGAG AAGATCGACC 950 

20 GCCGTACCGG CAAGTCGGTT GAGAACAACC CCAAGTTCAT CAAGTCGGGT 1000 

GACGCCGCCA TTGTCAAGCT GACGCCCTYG AAGCCCATGT GCGTT 1045 



25 2) INFORMATION FOR SEQ ID NO: 1285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergiiius nidulans 

(B) STRAIN: WSA-176 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285 



TTGCCAGTGG TCCGAGGCCC GTTACAACGA AATCGTCAAG GAGACTTCCG 50 

GTTTCATCAA GAAGGTCGGA TACAACCCCA AGTCCGTTGC CTTCGTCCCC 100 

ATCTCCGGTT TCAACGGTGA CAACATGCTC GAGGCCTCTA CCAACTGCCC 150 

CTGGTACAAG GGTTGGGAGA AGGAGACCAA GGCCGGTAAG GCCACTGGTA 200 

45 AGACCCTCCT TGAGGCCATC GACGCCATTG AGCCCCCCAC CCGTCCCTCC 250 

AACAAGCCCC TCCGTCTTCC CCTCCAGGAT GTCTACAAGA TCTCCGGTAT 300 

TGGAACTGTC CCCGTCGGCC GTGTCGAGAC TGGTGTTATC ACCCCCGGCA 350 

TGGTCGTCAC CTTCGCTCCT GCCAACGTCA CCACTGAAGT CAAGTCCGTT 400 

GAGATGCACC ACCAGCAGCT CAAGGAGGGT GTCCCCGGTG ACAACGTCGG 450 

50 TTTCAACGTC AAGAACGTTT CCGTCAAGGA AATCCGTCGT GGTAACGTTG 500 

CCTCCGACTC CAAGAACGAC CCCGCCTCCG GCGCTGCCTC TTTCAACGCC 550 

CAGGTCATCG TTCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC 600 

CGTCCTCGAC TGCCACACCG CCCACATTGC TTGCAAGTTC TCTGAGCTTC 650 

698 
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TTGAGAAGAT TGACCGCCGT ACCGGAAAGG CTGTTGAAAC CAGCCCCAAG 
TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC CTTCCAAGCC 
CATGTGCGKT CCGA 



700 
750 
764 



10 
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2) INFORMATION FOR SEQ ID NO: 1286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 16264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286 

ACCATCGATA TCGCGCTCTG GAAGTTCGAG ACTCCCAAGT ACTTCGTCAC 50 

CGTCATCGAT GCCCCTGGTC ATCGTGACTT CATCAAGAAC ATGATCACTG 100 

GTACCTCCCA GGCTGATTGT GCTATTCTCA TCATTGCCGC TGGTACTGGT 150 

GAGTTCGAGG CCGGTATCTC CAAGGATGGC CAGACCCGTG AGCATGCTCT 200 

GCTCGCCTAC ACCCTGGGCG TGAAGCAGCT TATCGTCGCC ATCAACAAGA 250 

TGGACACCAC CAAATGGTCT GAGGATCGTT TCAACGAAAT CATCAAGGAG 300 

ACTTCCAACT TCATCAAGAA GGTCGGATAC AACCCCAAGT CCGTTCCATT 350 

CGTGCCCATC TCCGGCTTCA ACGGTGACAA CATGATCGAC GTCTCCACCA 400 

ATGCCCCCTG GTACAAGGGC TGGGAAAAGG AGTCCAAGGC TGGCAAGGCC 450 

ACCGGCAAGA CCCTCCTTGA GGCTATCGAC TCCATCGACC CTCCTGCTCG 500 

TCCCACCGAC AAGCCTCTCC GTCTCCCACT CCAGGATGTC TACAAGATTT 550 

CTGGTATCGG CACGGTGCCC GTCGGTCGTG TTGAGACTGG TACCATCAAG 600 

GCCGGTATGG TCGTCACCTT TGCCCCCGCC AACGTCACCA CTGAAGTCAA 650 

GTCCGTCGAA ATGCACCACG AACAGCTYGC CGAGGGCGTT CCGGGTGACA 700 

ACGTCGGCTT CAACGTCAAG AACGTYTCCG TGAAGGAGGT TCGTCGTGGA 750 

AACGTTGCTG GTGACTCCAA GAACGACCCC CCCAAGGGTG CCGACTCCTT 800 

CAACGCCCAG GTCATCGTCC TCAACCACCC TGGTCAGGTC GGTGCTGGCT 850 

ACGCCCCGGT CTTGGATTGC CACACTGCCC ACATTGCCTG CAAGTTCTYT 900 

GAGCTCCTCG AGAAGATCGA TCGTCGKACC GGCAAGTCCA TGGAAAACAA 950 

CCCCAAGTTC ATCAAGTCTG G 971 



2) INFORMATION FOR SEQ ID NO: 1287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 
5 (B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287 

GGTCTGAGGA CCGTTACCAG GAG AT CATC A AGGAGACCTC CAACTTCATC 50 

10 1 " AAGAAGGTCG" GCTACAACCC" CAAGCACGTT CCCTTCGTCC CCATCTCCGG 100 
TTTCAACGGA GACAACATGA TCGAGGCCTC CAGCAACTGC CCCTGGTACA " 150 

AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACTGGTAA GACCCTCCTT 200 

GAGGCCATTG ACGCCATCGA CCCTCCCAGC CGTCCTACCG ACAAGCCCCT 250 

CCGTCTTCCC CTCCAGGATG TCTACAAGAT TGGTGGTATT GGCACGGTTC 300 

15 CCGTCGGTCG TGTCGAGACC GGTATCATCA AGGCCGGTAT GGTCGTCACC 350 

TTCGCCCCCG CTGGTGTCAC CACTGAAGTC AAGTCCGTCG AGATGCACCA 400 

CGAGCAGCTT ACCGAGGGTG TCCCCGGTGA CAACGTCGGC TTCAACGTCA 450 

AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GTAACGTTGC CGGTGACTCC 500 

AAGAACGACC CCCCCAAGGG CTGCGAGTCT TTCAACGCTC AGGTCATTGT 550 

20 CCTCAACCAC CCTGGTCAGG TCGGTGCCGG TTACGCGCCA GTCCTCGACT 600 

GCCACACCGC CCACATTGCC TGCAAGTTCT CTGAGCTCCT CGAGAAGATT 650 

GACCGCCGTA CCGGAAAGTC TGTCGAAGCC TCTCCCAAGT TCATCAAGTC 700 

TGGTGACGCG GCCATCGTCA AGATGGTTCC CT 732 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1288 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) ' LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288 



AATGGATCCT GTATACGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

45 CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARRARCAAT WGCAATCTTT ATTAACCAAA TTCGWGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCCA 337 

50 



2) INFORMATION FOR SEQ ID NO: 1289 
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(i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 336 bases 
iq\ TYPE: Nucleic acid 
(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(l t) MOLECULE TYPE: Genomic DNA 

10 X^ORGAS-'Bacillus t^iensis 

(B) STRAIN: HER 1232 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1289 
15 AATGGATCCT GTATATGCAC CG™CATA ^AATTRC 

TATTATCACA GCCTGATACA ^GGAGCAAG GA CTGTAGCAGC 
CTTGTACGAA GTGGTGCGGT TGACATTATC 6 Q TCACACGTAG 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA ™ TTCAGGAGCA 
20 GTTTACAAGC ACGTTTAATG TgCMGCaC ^ ^ TTCGTGAAAA 

AGTTGGGGTT SS aSaAAC AACTCC 

25 2 ) INFORMATION FOR SEQ ID NO: 1290 

(i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 23 bases 
(B) TYPE: Nucleic acid 
30 c) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
35 (xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 1290 
GAYTAYGCIA TGIS1GTIAT HGT 

40 

2 ) INFORMATION FOR SEQ ID NO: 1291 

(t, SEQUENCE CHARACTERISTICS: 
K (A) LENGTH: 23 bases 
45 B) TYPE: Nucleic acid 

C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291 
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GCIYTICCIG AYGTIMGIGA YGG 



5 2) INFORMATION FOR SEQ ID NO: 1292 
M, SEQUENCE CHARACTERISTICS: 
15 {J S LENGTH: 20 bases 

- - - o r type: -Nucleic acid - - - - - 

C STRANDEDNESS: Single 
10 (D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ 

ARISCYTCIA RIATRTGIGC 

20 2) INFORMATION FOR SEQ ID NO: 1293 
(i) SEQUENCE CHARACTERISTICS:. 
1 ' ( A ) LENGTH: 20 bases 

B TYPE: Nucleic acid 
2 5 c STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293 
ATGGCTGAAT TACCTCAATC 

35 

2) INFORMATION FOR SEQ ID NO: 1294 
M, SEQUENCE CHARACTERISTICS: 
11 (A) LENGTH: 25 bases 
40 * TYPE: Nucleic acid 

C STRANDEDNESS: Single 
( D) TOPOLOGY: Linear 

45 (ii ) MOLECULE TYPE: DNA 

(xi , SEQUENCE DESCRIPTION: SEQ ID NO: 1294 
ATGATTGTTG TATATCTTCT TCAAC 

50 

2) INFORMATION FOR SEQ ID NO: 1295 
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m SEQUENCE CHARACTERISTICS: 
( (A ) LENGTH: 20 bases 

B) TYPE: Nucleic acid 

c STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

I Li) MOLECULE TYPE: DNA 
" (xi) _ SEQUENCE DESCRIPTION: SEQ ID NO: 1295 
CAGAAAGTTT GAAGCGTTGT 



15 2) INFORMATION FOR SEQ ID NO: 1296 
(i , SEQUENCE CHARACTERISTICS: 
( (A) LENGTH: 20 bases 

B1 T YPE: Nucleic acid 
20 c STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE: DNA 
25 (X L) SEQUENCE DESCRIPTION: SEQ ID NO: 1296 
AACGATTCGT GAGTCAGATA 

30 

2 ) INFORMATION FOR SEQ ID NO: 1297 

(i) SEQUENCE CHARACTERISTICS: 
( ' (A) LENGTH: 22 bases 
35 on TYPE: Nucleic acid 

C STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1297 
CGGTCAACAT TGAGGAAGAG CT 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1298 

(i , SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
)n) TYPE: Nucleic acid 
(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 ' (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 1298 
ACGAAATCGA CCGTCTCTTT TTC 

10 2 ) INFORMATION FOR SEQ ID NO: 1299 

(i ) SEQUENCE CHARACTERISTICS : , 
( (A) LENGTH: 2711 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: 601055 ■ 

(C) ACCESSION NUMBER: X71437 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1299 ^ ^ 

ATGGCTGAAT TACCTCAATC AAGAATAAAT G AACGAAATA TTAD^ 

AATGCGTGAA TCATTTTTAG ATTAT^CGAT <* TCG TATACTA 150 

30 CATTGCCAGA TGTTCGTGAC GGTTTAAAAC ^ ATAAA AAATC 200 

TATGGATTAA ATGAACAAGG TATGACACtb CT CATGG TGACT 250 

AGCACGTATC GTTGGTGACG TAATGGGTAA f f CAGTTATCGT 300 

CATCTATTTA TGAAGCAATG GTACGTATGG C ATGGAGATGG 350 

TATCCGCTTG TTGATGGCCA AGGTAACTll ATCACACTTG 400 

35 CGCAGCAGCA ATGCGTTATA CTGMGCGCG TATG* 450 

3 AACTGTTACG TGATATTAAT ^ATACAA T TCCCTAATTT 500 

GATGGTAATG AAAGAGAGCC GTCAGTCTTA A CGAATATTC 550 

ATTAGCCAAT GGTGCATCAG GTATCGCGGT £ CTTAAGTAAG 600 

CACCACATAA CTTAACAGAA TTAATCAATG G AAGGTCCTGA 650 

40 AACCCTGATA TTTCAATTGC TGAGTTAATG G T AGACGTGCAT 700 

TTTCCCAACT GCTGGACTTA TTTTAGGTAA TGC AGTTAT TGAA 750 

ATGAAACAGG TCGTGGTTCA ^AAATGL TTCCTTTCCA 800 

GAACGTGGAG GCGGACGTCA ACGTATTGTT GT GTTCGTGAC A 850 

AGTGAATAAG GCTCGTATGA TTGMAAAAT 1 TTTACGTACT 900 

45 AGAAAATTGA CGGTATCACT GATTTACGTG ATG CTAGTGTCAT 950 

GGTGTGCGTG TCGTTATTGA TGTGCGTAAG GA TTTGGTGTGA 1000 

TTTAAATAAC TTATACAAAC AAACACCTCT * TTT AAAAGAA 1050 

ATATGATTGC ACTTGTAAAT GGTAGACCGA * GAAGACGTAC 1100 

GCGTTGGTAC ATTATTTAGA GCATCAAAA TTAGAAGGGT 1150 

50 GCAATATAAC TTACGTAAAG CTAAAGAl^ ^ GATTCGTGAG 1200 

TACGTATCGC ACTTGACCAT ATCGATGAAA T c GCT TCAAACT 1250 

TCAGATACAG ATAAAGTTGC ^TGGAAAGC m AGAG GTCTAA 1300 
TTCTGAAAAA CAAGCTCAAG CTATT1 1A« 
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CAGGTTTAGA GAGAAACAAA AT~ ? ~ 1400 

TATATTAGTG AATTAGAAGC CATCTTAO^i GGTGATGAGC 1450 

Sttaga gatgaattga ctga^ttag jg^gtt^ ggac 

GTCGTACAGA AATTCAATTA GGTGGATTTG ta AT AACTACAT 1550 

TTAATTCCAG AAGAACAAAT AGT ^AC1 : cAAAA CCGT GGTGGTCGTG 1600 

Sgtttg ccggtatcta catatcgtgc tcaaaacl gta 16 50 

gKJgg tatgaataca t« tttactraca aaggtcgtgt 1700 

ACTTTAAGTA CACATGACCA ^TATTGTTC CAGTCTAAAG 1750 

"atacaaacta aaaggttatg .^jg|ctga g atga -agtcattagt „ . 1800 

GTATTCCTGT AGTGAATGCT ATTGAACTTG ^ TC TTAGTGTT 1850 

araATfJATTG CTGTTAAAGA CCTTbAArtbi AACTTCTCAA 19UU 
ffiSS CGTGGTGTTG TTJ^gTC AGCATmGT 

GAATAAATAG AAATGGTAAG ATTGCGATTT T -pGATTGGTAC 2000 

TTAATTGCAG TTCGTTTAAC AAGTGGTCAA W£ CGTCCTTTAG 2050 

™cItgca tcattaattc gjttocctg* ^CATT ft aggt 2 oo 

rrrrTACAGC AACGGGTGTG AAAGGTATTA U*, AAGTATTAGT 2150 

StSagggc ttgatgtagc tcatgcaaac agtgttgatg atcgtt 2200 

AGTTACTGAA AATGGTTATG GTA^CGTAC GCCAG^ ^ 

TATCAAATCG TGGTGGTAAA GGTATTAAAA AAGATTTAAT 2300 

AATGGTAATG TTGTATGTAT CACTACAGTA ACTGfal GCAGATATTT 2350 

rATTGTTACT AATGCCGGTG TCATTATTCG ^TAGATGTl ^ 2400 

CTCAAAATGG TCGTGCAGCA CM^ ^AAGAGGATG CAGATGAAGT 24 0 

GATCAATTTG TTTCAACGGT JGCTAAAGTA ™ ^ GAACAACAAC 2500 

AAATGAAGAT GAACAATCTA C^ATCTG qaAATGCAAT TCATACTGAA 2550 

i GTGAAGCGGT TGTAAATGAT GAAACACCAG ^ Q MGTAAG ACA 2600 

ATGAAGAATA A 

0 

2) INFORMATION FOR SEQ ID NO: 1300 

(i ) SEQUENCE CHARACTERISTICS: 
1 (A) LENGTH: 2628 bases 
}b (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

^ ^SS^ 1 ^^ coli 

( ( B c! ACCESSION — X57174 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300 

ATGAGCGACC TTGCGAGAGA M = G gggrjG 00 

50 GAAGAGCTCC TATCTGGATT ATGCSMGTC GG CGTACTTTAC 150 

TGCCAGATGT CCGAGATGGC CTGAAGCCGG ^ AAAAATCTGC 200 

SEES SSSS "a cfATCCCCAT GGTGACTCGG 250 
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CGGTCTATGA GAGGATCGTC CGCATGG- ? ™C 

GGCGGCAATG £5S£ ^ SSSSS «• 

TGATGGCCGA TCTCGAMAR GAGACG TCG CTMCCTGCT MD 

5 GGCACGGAAA ^"CCGGA CGTCRTGC CMCC MCATCCCGC » 

GGTGAACGGT TCTTCCGGTA TCGCCGT^ ^ TATTGMGAT 6 

SSSS5S SSS oc = c SSS g =Sc 

■ " CCCGACG6CG ^^^T^ g ^q TRTATCCGCG CTCGCGCAGA'RGTGGAAGTT ...W 

AACGCGTGGA AGGCATCAGC GCGCTGCbi QTG MG TTGTGCT J 

^rrrrATCG TGATTGAAGT GAAACGCGAT ^ c GGTAT CAACA 10OU 

.n^rrrrrr-P TTGTTCGTCA CCGCCGTGAA ^ CCTT GA R G CATTAG 

— SS3S & gsgg 



20 



25 



45 



SSSS CGTAAAGCTC GCOATCGTGC T = . _ 0 
CCGTGGCGCT GGCGAACATC GTTGCTMTC 0B8WW 1250 

GTGATCCGTG AAGAGCTGGA GCTb cATCAACCTG ^AAGATCTGA 

TCfTACTGAA ATCACCGCCA _ rTCACCAGGG CTACGTTAAU 

TCACCCAGGA AGATGTGGTC GTGACGCTCT CTCA GGAAAGGT AA 1100 

ScAGCCGC TTTCTGAATA CGAAGCGCAG CG CTGCTGGTGG 1750 

IfcTGCCGCA CGTATTAAAG AAGMG^ TA^^ TCGCGTCTAT 00 

SS cSgS Logtcctc ggg SSSSc 20*. 

ScCGGT AAAG = GAAGACGAAG TAATGCTGTT CTC™ 2100 
rrrTTGACCT GACCAGCGGC GAA^ r^TCCGTGCGA TGGGCTGCAH 

SlGG TGCGCTTB* AGAGT « GTCC 200 

SS S ilS SS S2SS 



50 



2) INFORMATION FOR SEQ ID NO: 1301 
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m SEQUENCE CHARACTERISTICS : 
U) A ) LENGTH: 23 bases 

B TYPE: Nucleic acid 

C STRANDEDNESS: Single 

D ) TOPOLOGY: Linear 

5 

MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301 

10 GTIMGIAWIM GICCIGSIAT GTA 

15 2) INFORMATION FOR SEQ ID NO: 1302 
m SEQUENCE CHARACTERISTICS: 
U) (M LENGTH: 23 bases 

B TYPE: Nucleic acid 
C STRANDEDNESS: Single 
20 ( D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302 
TAIAD1GGIG GIKKIGCIAT RTA 

30 2) INFORMATION FOR SEQ ID NO: 1303 
m SEQUENCE CHARACTERISTICS: 
( ' (A) LENGTH: 20 bases 
B TYPE: Nucleic acid 
35 c STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303 
GGIGAIGAID YIMGIGARGG 

45 

2) INFORMATION FOR SEQ ID NO: 1304 

{i , SEQUENCE CHARACTERISTICS: 
( ' {A ) LENGTH: 19 bases 
50 o TYPE: Nucleic acid 

C STRANDEDNESS: Single 
(D ) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: !304 
CIARYTTIKY ITTIGTYTG 



10 2 ) INFORMATION FOR SEQ ID HO: 1305 

(i ) SEQUENCE CHARACTERISTICS : 

1 ' (A) LENGTH: 22 bases , 

{B) TYPE: Nucleic acid 

C) STRANDEDNESS: Single 

15 (D) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE: DNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305 

ATGGTGACTG CATTGTCAGA TG 

25 2 ) INFORMATION FOR SEQ ID NO: 1306 

H) SEQUENCE CHARACTERISTICS : 
1 (A) LENGTH: 22 bases 
{ a\ TYPE: Nucleic acid 
30 {C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306 
GTCTACGGTT TTCTACAACG TC 

40 

2) INFORMATION FOR SEQ ID NO: 1307 

(i) SEQUENCE CHARACTERISTICS: 
U ' (A) LENGTH: 1923 bases 
45 (B) type: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(Vl) "J 6 ^oRGANfsSfstaphylococcu, aureus 
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(C) ACCESSION NUMBER: M86227 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307 

5 A TGGTGACTG CttTgCJ* 100 

GCAAATACAA GTATTAGAAG GTTTAGAAGC J GGAAATTGTC 150 

TGTATATAGG ATCGACTCAG ^GAGTTGC ^ M^TTGWjCT 200 

GATAATAGTA TCGATGAAGC ^TAGCTGGl MC GGACGTGGTA 250 

~ " TGTTATTGAA AAAGATAACT GGATTAAA^i -^™ GCTGT C GAAGTTATT 300. . 
10 TCCCAGTTGA TATTCAAGAA AAAATGGGAC GTCC^ 

TTAACTGTTT TACATGCTGG TGGTAAATTC ^ GCATTG TCAC 400 

ATCTGGTGGT TTACATGGTG TTGGTTCAlc «w R TCATCAAGCA 450 

Saga agtatatgta cacagaaatg aetata 

TATAAAAAAG GTGTACCTCA ^TTGACTTA ^ ATCT TCACAG 550 

15 TAAGACAGGT ACTGTCATTC GTTTTAAAGC A T TAGAGAGC TT 600 

15 AGACAACTGT ATACAACTAT GAAACATTAC * ^CGTGATGA 650 

GCTTTCTTAA ACAAAGGAAT JCAAATCACA QT AT TAAATCGT 700 

ArAAAACGTT AGAGAAGACT CCTATCAUiH qa TGAG CCAATT 750 

ACGTTGAGTT ATTGAACGAA AATAAAGAAC CTATTtt™ lh 800 

20 Stcatc aatctaaaga tgatattgaa gtagaaattG t ^ 

TAACTCAGGA TATGCCACAA ATCTTTTAAC l QC ATTAAC GCGT 900 

CGTACGAAGG TGGTACGCAT ^ATGAAGAAG AAAAGATAGC 950 

rTCTTAAATA GTTATGGTTT AAGTAGCAbA ini XCTATCAAAC 1000 

SSgS gatacacgag aaggtatgac agcaattata t 1060 

25 ATGGTGATCC TCAATTCGAA ^TCAAACGA TTGAA CGATT 1100 

2 GAAGTGCGTC AAGTTGTAGA TAAATTATTC * AAAGGTATTA 1150 
TTTATATGAA AATCCACAAG TCGCACGTAC ^ AG TAACACGT 1200 
TGGCGGCACG TGCACGTGTT GCTGCGAAAA Aft TAGCCGATTG 1250 
CGTAAATCAG CGTTAGATGT AGCAAGTCTT C GAAGGGGACT 1300 

30 CTCTAGTCAA AGTCCTGAAG AATGTGAGAT ^ GCAGGCGATT 1350 

3 CTGCCGGAGG GTCTACAAAA TCTGGTCGTG ACTCl^ GATTAGATAG 1400 
ttappATTAC GAGGTAAGAT ATTAAATGTT TTTGGTAC AG 1450 
HxmS AACAATGAAA TTCGTCAAAT GATCACAGCA T 1500 
GAATCGGTGG CGACTTTGAT CTAGCGAAAG CAAGATATCA ^ 1550 

35 Sgactg atgccgatgt ggatggagcg ^ttagaa ggctatgtgt 1600 

AATATTCTTC TATCGATTTA TGAGACCGTT AAT1£ A CAAAAGTAT 1650 

Stgcaca gccaccgttg tata^ctga cacaaggtaa ^ 

TATGTATACA ATGATAGGGA ACTTGATAAA £ GAAATGAATG 1750 

40 — ~ sss sss ib°5°o 



SSSS CTOGACTTC TAA 
45 

2 ) INFORMATION FOR SEQ ID NO: 1308 

(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 bases 
50 B T YPE: Nucleic acid 

C ) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

709 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308 
ATGTAYGTIA TIATGGAYMG IGC 



2) INFORMATION FOR SEQ ID NO: 1309 " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309 
ATIATYTTRT TICCYTTICC YTT 



2) INFORMATION FOR SEQ ID NO: 1310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310 
ATIATITSIA TIACYTCRTC 



2) INFORMATION FOR SEQ ID NO: 1311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311 
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5 2} INFORMATION FOR SEQ ID NO: 1312 

(i) SEQUENCE CHARACTERISTICS: 
U \k) LENGTH: 23 bases 

ov T ype:- Nucleic acid . _ 

c STRANDEDNESS: Single 
10 (D ) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE: DNA. 

15 (xi, SEQUENCE DESCRIPTION: SEQ ID NO: 1312 
AARTAYATIA TICARGARMG IGC 

20 2) INFORMATION FOR SEQ ID NO: 1313 

(i) SEQUENCE CHARACTERISTICS: 
1 (A) LENGTH: 23 bases 

in\ TYPE: Nucleic acid 
25 ; c) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1313 
AMIAYICKRT GIGGITTITT YTT 

35 

2) INFORMATION FOR SEQ ID NO: 1314 

m SEQUENCE CHARACTERISTICS: 
[ (A) LENGTH: 23 bases 
40 B TYPE: Nucleic acid 

C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314 
TAIGAITTYA CIGAISMICA RGC 

50 

2) INFORMATION FOR SEQ ID NO: 1315 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID~NO:" 1315 ~ " " 
ACIATIGCIT CIGCYTGIKS YTC 



2) INFORMATION FOR SEQ ID NO: 1316 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE:. Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316 
GTGAGTGAAA TAATTCAAGA TT 



2) INFORMATION FOR SEQ ID NO: 1317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317 
CACCAAAATC ATCTGTATCT AC 



2) INFORMATION FOR SEQ ID NO: 1318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

712 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318 
ACCTAYTCSA TGTACGTRAT CATGGA 26 

2) INFORMATION FOR SEQ ID NO: 1319 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319 
AGRTCGTCIA CCATCGGYAG YTT 23 

2) INFORMATION FOR SEQ ID NO: 1320 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K-12 MG1655 

40 (C) ACCESSION NUMBER: AE000384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320 

ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC 50 

45 CTACTTAAAC TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA 100 

TTGGTGATGG TCTGAAACCT GTTCAGCGCC GCATTGTGTA TGCGATGTCT 150 

GAACTGGGCC TGAATGCCAG CGCCAAATTT AAAAAATCGG CCCGTACCGT 200 

CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC GCCTGTTATG 250 

AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT 300 

50 GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC 350 

AATGCGTTAC ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA 4 00 

GCGAGCTGGG GCAGGGGACG GCTGACTGGG TGCCAAACTT CGACGGCACT 450 

TTGCAGGAGC CGAAAATGCT ACCTGCCCGT CTGCCAAACA TTTTGCTTAA 500 
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WO 01/23604 ^rrfcTATT CCACCGCATA 550 

CGGCACCRCC GGTATTGCCG TC~ «™ CO 

ACCTGCGTGA AGTGGCTCAG CGQ ATTAT CCGAC 650 

ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGb ATCTACGAGA 100 

TGAAGCGGAA ATTATCACTT C^CG^GA W**^ AGAAGATGGC 750 

ACGGACGTGG TTCAGTGCGT ^GCGCGCGG T CGCGCGTACT 800 

GCGGTGGTTA TCAGCGCATT GCCGCATCAG ATGGTTGACG 850 

GGAGCAAATT GCTGCGCAAA TGCGCAACAA GCCT GGTGATTGTG 900 

ATCTGCGCGA TGAATCTGAC CACGAGAA^ GTGATGAACC ACCTCTTCGC 950 

CCGCGTTCCA- ACCGCGTGGA- ^GGATC^G -ftTGATCGGTC - 000 . 

TACCACCGAT CTGGAAAAGA GCTATCGTA1 CT CTCC GAATGG 1050 

TGGATGGTCG TCCGGCGGTG AAAAACCTGC ATCG TCTGGA 1100 

cSgttcc gwbdb^c cggogoK a ggtttgctg gtggcgt^c 

GAAAGTCCTC AAGCGCCTGC Alft^ _ GTMTGAAG A TGAACCGAAA i 

TCAATATCGA CGAAGTGATT JAGATCATTC GTAA q CGGAAGCG AT 1250 

CCGGCGCTGA TGTCGCGGTT TGGCCTTACG £ G ATGAAGATTC 1300 

rPTPGAACTG AAACTGCGTC ATCTTGCCAA £ QTT GCAGGGCATT 1350 
C Gc"c G A GAGTGAACTG GAAAAAG^ GCGACC^ ^ 

TTGGCTTCCG AGCGTAAAAT GAATAACCl^ GTTG C AGGAACGCG 1450 

TgACGCGCAA GCCTACGGTG ACGATCf S ™^CGT C TGAACCTGTC 1500 

AAGAAGCGAA AGCGATGAGC GAGCACGACA lb aagGCCATGA 1550 

nrri?TGTGC TGTCGCAGAT GGGCTGGGTA ^ TTCAAAGCGG 1600 

SSSS CCGGGCCTGA ATTATAAAGC eMTG^ ? , 

cggtgaaagg taagagcaac caaccggtag i gtggtcaggg iioo 

cgtagctatg ccattgaccc gattacgctg ccg ACCGTTG acc 1150 

5 CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GC TTCCGATGCG 1800 

^GTTACGGTT TCGTCTGCAC CTTTAAC^T ^ gggg ^ g ^t ATGCCGCCGG IK 

SmggS Satcacct taccggaaaa gcccmgtt a 

TGGTGATTGA AGATGCTTCC GATATGCTGC TGT CGAAGGGCAA 2000 

50 CGTATGTTGA TGTTCOGGT AJG^ qcTGACCATT flOO 

SSS ""AC ^^TSp^ G Q GG SS TACAGAAAGT 2150 
CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAA TTGCAGCGTA 22Q0 

35 S S2K SSSE GTGCCAGCAG CGGTGATAGC JgO 
GAAGAGTAA 

40 2) INFORMATION FOR SEQ ID NO: 1321 

(i) SEQUENCE CHARACTERISTICS: 
A) LENGTH: 2403 bases 
(B) TYPE: Nucleic acid 
C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

Hi, MOLECULE TYPE: Genomic DNA 
50 (Vl) ^^ORGANlS^^taphylococcus aureus 

\l\ ACCESSION^NUMBER : 0.074 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321 

GTGAGTGAAA TAATTCAAGA TTTATCACTT GAAGATGTTT TAGGTGATCG 50 

5 CTTTGGAAGA TATAGTAAAT ATATTATTCA AGAGCGTGCA TTGCCAGATG 100 

TTCGTGATGG TTTAAAACCA GTACAACGTC GTATTTTATA TGCAATGTAT 150 

TCAAGTGGTA ATACACACGA TAAAAATTTC CGTAAAAGTG CGAAAACAGT 200 

- -CGGTGATGTT ATTGGTCAAT ATCATCCACA TGGAGACTTC TCAGTGTACA 250 

AAGCAATGGT CCGTTTAAGT CAAGACTGGA AGTTACGACA TGTCTTAATA 300 

10 GAAATGCATG GTAATAATGG TAGTATCGAT AATGATCCGC CAGCGGCAAT • .350 

GCGTTACACT GAAGCTAAGT TAAGCTTACT AGCTGAAGAG TTATTACGTG 400 

ATATTAATAA AGAGACAGTT TCTTTCATTC CAAACTATGA TGATACGACA 450 

CTCGAACCAA TGGTATTGCC ATCAAGATTT CCTAACTTAC TAGTGAATGG 500 

TTCTACAGGT ATATCTGCAG GTTACGCGAC AGATATACCA CCACATAATT 550 

15 TAGCTGAAGT GATTCAAGCA ACACTTAAAT ATATTGATAA TCCGGATATT 600 

ACAGTCAATC AATTAATGAA ATATATTAAA GGTCCTGATT TTCCAACTGG 650 

TGGTATTATT CAAGGTATTG ATGGTATTAA AAAAGCTTAT GAATCAGGTA 700 

AAGGTAGAAT TATAGTTCGT TCTAAAGTTG AAGAAGAAAC TTTACGCAAT 750 

GGACGTAAAC AGTTAATTAT TACTGAAATT CCATATGAAG TGAACAAAAG 800 

20 TAGCTTAGTA AAACGTATCG ATGAATTACG TGCTGACAAA AAAGTCGATG 850 

GTATCGTTGA AGTACGTGAT GAAACTGATA GAACTGGTTT ACGAATAGCA 900 

ATTGAATTGA AAAAAGATGT GAACAGTGAA TCAATCAAAA ATTATCTTTA 950 

TAAAAACTCT GATTTACAGA TTTCATATAA TTTCAACATG GTCGCTATTA 1000 

GTGATGGTCG TCCAAAATTG ATGGGTATTC GTCAAATTAT AGATAGTTAT 1050 

25 TTGAATCATC AAATTGAGGT TGTTGCAAAT AGAACGAAGT TTGAATTAGA 1100 

TAATGCTGAA AAACGTATGC ATATCGTTGA AGGTTTGATT AAAGCGTTGT 1150 

CAATTTTAGA TAAAGTAATT GAATTGATTC GTAGCTCTAA AAACAAGCGT 1200 

GACGCTAAAG AAAACCTTAT CGAAGTATTC GAGTTCACAG AAGAACAGGC 1250 

TGAAGCAATT GTAATGTTAC AGTTATATCG TTTAACAAAC ACTGACATAG 1300 

30 TTGCGCTTGA AGGTGAACAT AAAGAACTTG AAGCATTAAT CAAACAATTA 1350 

CGTCATATTC TTGATAACCA TGATGCATTA TTGAATGTCA TAAAAGAAGA 1400 

ATTGAATGAA ATTAAAAAGA AATTCAAATC TGAACGACTG TCTTTAATTG 1450 

AAGCAGAAAT TGAAGAAATT AAAATTGACA AAGAAGTTAT GGTGCCTAGT 1500 

GAAGAAGTTA TTTTAAGTAT GACACGTCAT GGATATATTA AACGTACTTC 1550 

35 TATTCGTAGC TTTAATGCTA GCGGTGTTGA GGATATTGGT TTAAAAGATG 1600 

GTGACAGTTT ACTTAAACAT CAAGAAGTAA ATACGCAAGA TACCGTACTA 1650 

GTATTTACAA ATAAAGGTCG TTATCTATTT ATACCGGTTC ATAAATTAGC 1700 

AGATATTCGT TGGAAAGAAT TGGGGCAACA TGTATCACAA ATAGTTCCTA 1750 

TCGAAGAAGA TGAAGTGGTT ATTAATGTCT TTAATGAAAA GGACTTTAAT 1800 

40 ACAGATGCAT TTTATGTTTT TGCGACTCAA AATGGCATGA TTAAGAAAAG 1850 

TACAGTGCCT CTATTTAAAA CAACGCGTTT TAATAAACCT TTAATTGCTA 1900 

CTAAAGTTAA AGAAAATGAT GATTTGATTA GTGTTATGCG CTTTGAAAAA 1950 

GATCAATTAA TTACCGTCAT TACTAATAAA GGTATGTCAT TAACGTATAA 2000 

TACAAGTGAA CTATCAGATA CCGGATTAAG GGCAGCTGGT GTTAAATCAA 2050 

45 TAAATCTTAA AGCTGAAGAT TTCGTTGTTA TGACAGAAGG TGTTTCTGAA 2100 

AATGATACTA TATTGATGGC CACACAACGC GGCTCGTTAA AACGTATTAG 2150 

TTTTAAAATC TTACAAGTTG CTAAAAGAGC ACAACGTGGA ATAACTTTAT 2200 

TAAAAGAATT AAAGAAAAAT CCACATCGTA TTGTAGCTGC ACATGTAGTG 2250 

ACAGGTGAAC ATAGTCAATA TACATTATAT TCAAAATCAA ATGAAGAACA 2300 

50 TGGTTTAATT AATGATATTC ATAAATCTGA ACAATATACA AATGGCTCAT 2350 

TCATTGTAGA TACAGATGAT TTTGGTGAAG TAATAGACAT GTATATTAGC 2400 

TAA 2403 
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2) INFORMATION FOR SEQ ID NO: 1322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TY~PE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322 
RTIGAIAAYI SIGTIGAYGA RG 22 



2) INFORMATION FOR SEQ ID NO: 1323 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323 
ACIAWRSAIG GIGGIACICA YG 22 

30 

2) INFORMATION FOR SEQ ID NO: 1324 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324 
45 CCICCIGCIS WRTCICCYTC 20 



2) INFORMATION FOR SEQ ID NO: 1325 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C ) STRANDEDNESS: Sxngle 
; D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
> (xi) SEQUENCE DESOTO*: SEQ 10 NO: »» 
- RT 7CATYTCI CCIARICCYT T „ 

2) INFORMATION FOR SEQ ID NO:, 

(i) SEQUENCE "ERISTICS: 

SI T^Nucle^cid 
* STRANDEDNESS : Single 
Jo? TOPOLOGY: Linear 

2Q MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326 
TGATTCAATA CAGGTTTTAG AG 

25 

2) INFORMATION FOR SEQ ID NO: 1327 
,m SEOUENCE CHARACTERISTICS: 
30 (1) ^LENGTH: 22 bases 

* TYPE: Nucleic acid 
5 STRANDEDNESS: Single 
JdS TOPOLOGY: Linear 

35 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132, 
40 CTAGATTTCC TCCTCATCAA AT 

2) INFORMATION FOR SEQ ID NO: 1328 
45 m SEQUENCE CHARACTERISTICS: 

{1) (M LENGTH: 1992 bases 
* TYPE: Nucleic acid 
5 STRANDEDNESS: Double 
D TOPOLOGY: Linear 

Hi, MOLECULE TYPE: Genomic 0« 



717 



SUBSTITUTE SHEET (RULE 26) 



PCT/CA00/01150 

WO 01/23604 

T "oR^S-'stap^ococcus aureus 



[^} 

M) SEQDENCE DESCRIPTOR SEQ ID NO: 1326 

> AAGT ™I?J aacItaaaac aaaagataaa ^ttaga mttaT 

- mmmmm i 

ESSS ESS Sjp-g g-ss G & mo 

45 ssssssssss^ssssssssss ig 

SSSSS S GftGGAMTCT ,3 



750 
800 
850 
900 
950 
1000 
1050 



50 



2) INFORMATION FOR SEQ ID NO: 1329 
(i) SEQUENCE CHARACTERISTICS 
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iM LENGTH: 25 bases 

5 TYPE: Nucleic acid 

? STRANDEDNESS: Single 

D TOPOLOGY: Linear 



Hi, MOLECULE TYPE: DNA 

v nFSCR iPTION: SEQ ID NO: 1329 
(xiV - SEQUENCE. DESCRirn _ 

10 TGTAGAGCGC GGTATCATCA AAGTA 

2)INF 0RMATI0N FOR SEQ ID NO: 1330 
15 r» SEQUENCE CHARACTERISTICS: 

(1) (A) LENGTH: 22 bases 
B TYPE: Nucleic acid 
S STRANDEDNESS: Single 
D TOPOLOGY: Linear 

20 

MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID 1330 

25 AGATTCGAAC TTGGTGTGCG GG 



30 2) INFORMATION FOR SEQ ID NO: 1331 
(i) SEQUENCE CHARACTERISTICS: 
[ ip.) LENGTH: 30 b ases 

B TYPE: Nucleic acid 
S STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 

nv nFSCRI PTION: SEQ ID NO: 1331 
40 (Ki) SEQUENCE DESCRIPTl 

GCCCTTGAGG TACAGAATGG TAATGAAGTT 

45 2) INFORMATION FOR SEQ ID NO: 1332 
m SEQUENCE CHARACTERISTICS: 
U) ^LENGTH: 20 bases 
n! TYPE: Nucleic acid 
*° o STRANDEDNESS: Single 

Jo, TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332 
5 GACCGCGGCG CAGACCATCA 20 



10 



2) INFORMATION FOR SEQ ID NO: 1333 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single' 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333 

20 

TCATGGTGAC TTATCTATTT ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1334 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334 

CATCTATTTA TAAAGCAATG GTA 23 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1335 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335 

CTATTTATGG AGCAATGGT 19 
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2) INFORMATION FOR SEQ ID NO: 1336 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
■ (xi) SEQUENCE DESCRIPTION: SEQ ID "NO: 1336 
TGGAGACTAC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1337 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337 
TGGAGACTTC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1338 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338 
GTGTACGGAG CAATG 



2) INFORMATION FOR SEQ ID NO: 1339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15' bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339 
CCAGCGGAAA TGCGT 15 



2) INFORMATION FOR SEQ ID NO: 1340 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340 
GAACAAGGTA TGACACCGGA TAAAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1341 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341 
40 GATAACTGAA ATCCTGAGCC ATACG 25 



2) INFORMATION FOR SEQ ID NO: 1342 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1342 

GATGTTATTG GTCAATATCA TCCA 24 

5 

2) INFORMATION FOR SEQ ID NO: 1343 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) ' LENGTH: 29 bases - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343 
AAGAAACTGT CTCTTTATTA ATATCACGT 29 

20 

2) INFORMATION FOR SEQ ID NO: 1344 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1344 
35 AGCAGCAACG ATGTTACGCA GCAG 24 



40 



2) INFORMATION FOR SEQ ID NO: 1345 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 5 

50 

CCCGCCGAGC ATTTCAACTA TTG 23 
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2) INFORMATION FOR SEQ ID NO: 1346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346 
GATGTTACGC AGCAGGGCAG TC 22 



2) INFORMATION FOR SEQ ID NO: 1347 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1347 
ACCAAGCAGG TTCGCAGTCA AGTA 24 



2) INFORMATION FOR SEQ ID NO: 1348 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Unidentified bacterium 
(C) ACCESSION NUMBER: X04555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348 

ATGCGCTCAC GCAACTGGTC CAGAACCTTG ACCGAACGCA GCGGTGGTAA 50 

CGGCGCAGTG GCGGTTTTCA TGGCTTGTTA TGACTGTTTT TTTGTACAGT 100 

CTATGCCTCG GGCATCCAAG CAGCAAGCGC GTTACGCCGT GGGTCGATGT 150 

TTGATGTTAT GGAGCAGCAA CGATGTTACG CAGCAGGGCA GTCGCCCTAA 200 

AACAAAGTTA GGCCGCATGG ACACAACGCA GGTCACATTG ATACACAAAA 250 
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TTCTAGCTGC GGCAGATGAG CGAAATCTGC CGCTCTGGAT CGGTGGGGGC 300 

TGGGCGATCG ATGCACGGCT AGGGCGTGTA ACACGCAAGC ACGATGATAT 350 

TGATCTGACG TTTCCCGGCG AGAGGCGCGG CGAGCTCGAG GCAATAGTTG 400 

AAATGCTCGG CGGGCGCGTC ATGGAGGAGT TGGACTATGG ATTCTTAGCG 450 

5 GAGATCGGGG ATGAGTTACT TGACTGCGAA CCTGCTTGGT GGGCAGACGA 500 

AGCGTATGAA ATCGCGGAGG CTCCGCAGGG CTCGTGCCCA GAGGCGGCTG 550 

AGGGCGTCAT CGCCGGGCGG CCAGTCCGTT GTAACAGCTG GGAGGCGATC 600 

ATCTGGGATT ACTTTTACTA TGCCGATGAA GTACCACCAG TGGACTGGCC 650 

TACAAAGCAC ATAGAGTCCT ACAGGCTCGC ATGCACCTCA CTCGGGGCGG 700 

10 AAAAGGTTGA- GGTCTTGGGT - GCCGCTTTCA GGTCGCGATA TGCGGCCTAA -750 



2) INFORMATION FOR' SEQ ID NO: 1349 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349 

25 

CAGCCGACCA ATGAGTATCT TGCC 24 



30 2) INFORMATION FOR SEQ ID NO: 1350 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350 

TAATCAGGGC AGTTGCGACT CCTA 24 



45 



2) INFORMATION FOR SEQ ID NO: 1351 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 531 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
5 (B) STRAIN: Stone 130 

(C) ACCESSION NUMBER: L06157 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351 

10 ATGTTATGGA GCAGCAACGA -TGTTACGCAG CAGGGCAGTC GCCCTAAAAC ... 50. 

AAAGTTAGGT GGCTCAATGA GCATCATTGC AACCGTCAAG ATCGGCCCTG 100 

ACGAAATTTC AGCCATGAGG GCTGTGCTCG ATCTCTTCGG CAAAGAGTTT 150 

GAGGACATTC CAACCTACTC TGATCGCCAG CCGACCAATG AGTATCTTGC 200 

CAATCTTCTG CACAGCGAGA CGTTCATCGC GCTCGCTGCT TTTGACCGCG 250 

15 GAACAGCAAT AGGTGGGCTC GCCGCCTACG TTCTACCCAA GTTCGAGCAA 300 

GCGCGAAGCG AGATCTACAT TTATGACTTG GCAGTCGCTT CCAGCCATCG 350 

AAGGCTAGGA GTCGCAACTG CCCTGATTAG CCACCTGAAG CGTGTGGCGG 400 

TTGAACTTGG CGCGTATGTA ATCTATGTGC AAGCAGACTA CGGTGACGAT 450 

CCGGCAGTCG CTCTCTACAC AAAGCTTGGA GTTCGGGAAG ACGTCATGCA 500 

20 CTTCGACATT GATCCAAGAA CCGCCACCTA A 531 



25 



2) INFORMATION FOR SEQ ID NO: 1352 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352 

35 

CCACGCTGAC AGAGCCGCAC CG 22 

40 2) INFORMATION FOR SEQ ID NO: 1353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353 

GGCCAGCTCC CATCGGACCC TG 22 
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2) INFORMATION FOR SEQ ID NO: 1354 



PCT/CA00/01150 



15 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354 
CACGCTGACA GAGCCGCACC G 21 

2) INFORMATION FOR SEQ ID NO: 1355 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355 
ATGCCGTTGC TGTCGAAATC CTCG 24 

30 



2) INFORMATION FOR SEQ ID NO: 1356 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
45 (C) ACCESSION NUMBER: M97172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356 

ATGAACACGA TCGAATCGAT CACGGCGGAC CTGCACGGAC TGGGCGTCCG 50 

50 GCCCGGCGAC CTGATCATGG TCCATGCATC GCTGAAAGCC GTCGGCCCGG 100 

TCGAGGGAGG TGCGGCCTCG GTGGTGTCGG CCCTTCGCGC CGCGGTCGGG 150 

TCCGCAGGGA CCCTGATGGG TTATGCCTCA TGGGACCGCT CGCCCTATGA 200 

GGAGACGCTG AACGGCGCGC GGATGGACGA AGAACTGCGC CGCCGGTGGC 250 
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CACCCTTCGA TCTGGCCACA TCCGGTACCT ATCCCGGCTT CGGCCTGCTC 300 

AACCGGTTTC TGCTTGAGGC GCCCGACGCA CGGCGCAGCG CGCATCCCGA 350 

CGCCTCCATG GTCGCGGTCG GCCCCCTTGC CGCCACGCTG ACAGAGCCGC 400 

ACCGGCTTGG GCAGGCGCTG GGCGAAGGCT CGCCGCTGGA GCGCTTCGTC 450 

5 GGGCATGGCG GAAAGGTCCT GCTTCTGGGA GCGCCGCTCG ACTCCGTCAC 500 

CGTGCTGCAT TACGCCGAGG CCATCGCCCC CATCCCGAAC AAACGCCGCG 550 

TGACCTATGA AATGCCGATG CTCGGCCCGG ATGGCAGGGT CCGATGGGAG 600 

CTGGCCGAGG ATTTCGACAG CAACGGCATT CTCGATTGCT TCGCGGTCGA 650 

TGGGAAGCCG GATGCCGTCG AGACGATCGC CAAGGCTTAT GTCGAACTGG 700 

-10 - GCCGGCA-TCG GGAAGGCATC GTCGGTCGCG-CACCCTCCTA TCTGTT-TGAA -750 

GCGCAGGATA TCGTCTCGTT CGGCGTCACC TATCTCGAAC AGCATTTCGG 800 

CGCGCCCTGA 810 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1357. 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357 
GCCCATCCAT TTGCCTTTGC 20 

2) INFORMATION FOR SEQ ID NO: 1358 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358 
GCGTACCAAC TTGCCATCCT GAAG 24 

45 



2) INFORMATION FOR SEQ ID NO: 1359 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359 

TGCCCCTGCC ACCTCACTC 19 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1360 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360 
CGTACCAACT TGCCATCCTG AAGA 24 

2) INFORMATION FOR SEQ ID NO: 1361 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X01385 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361 



GTGCAATACG AATGGCGAAA AGCCGAGCTC 

TGGGGTTACC CCCGGCGGTG TGCTGCTGGT 

45 TCCGGCCCCT CGAAGATGGG CCACTTGGAC 

GCGCTGGGTC CGGGAGGGAC GCTCGTCATG 

CGACGAGCCG TTCGATCCTG CCACGTCGCC 

TTGTCTCTGA CACATTCTGG CGCCTGCCAA 

CCATTTGCCT TTGCGGCAGC GGGGCCACAG 

50 TCCATTGCCC CTGCCACCTC ACTCGCCTGC 

ATGAACTCGA TGGGCAGGTA CTTCTCCTCG 

ACGACGCTGC ATCTTGCCGA GTTGATGGCA 

GAGACACTGC ACCATTCTTC AGGATGGCAA 



ATCGGTCAGC TTCTCAACCT 50 

CCACAGCTCC TTCCGTAGCG 100 

TGATCGAGGC CCTGCGTGCT 150 

CCCTCGTGGT CAGGTCTGGA 200 

CGTTACACCG GACCTTGGAG 250 

ATGTAAAGCG CAGCGCCCAT 300 

GCAGAGCAGA TCATCTCTGA 350 

AAGCCCGGTC GCCCGTGTCC 400 

GCGTGGGACA CGATGCCAAC 4 50 

AAGGTTCCCT ATGGGGTGCC 500 

GTTGGTACGC GTCGATTATC 550 
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TCGAGAATGA CCACTGCTGT GAGCGCTTTG CCTTGGCGGA CAGGTGGCTC 
AAGGAGAAGA GCCTTCAGAA GGAAGGTCCA GTCGGTCATG CCTTTGCTCG 
GTTGATCCGC TCCCGCGACA TTGTGGCGAC AGCCCTGGGT CAACTGGGCC 
GAGATCCGTT GATCTTCCTG CATCCGCCAG AGGGCGGGAT GCGAAGAATG 
CGATGCCGCT CGCCAGTCGA TTGGCTGAGC TCATGA 



600 
650 
700 
750 
786 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1362 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

t 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362 
CGCCGCCATC GCCCAAAGCT GG 22 



2) INFORMATION FOR SEQ ID NO: 1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1363 
CGGCATAATG GAGCGCGGTG ACTG 24 



2) INFORMATION FOR SEQ ID NO: 1364 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364 
TTTCTCGCCC ACGCAGGAAA AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1365 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365 
CATCCTCGAC GAATATGCCG CG 22 



2) INFORMATION FOR SEQ ID NO: 1366 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 
(C) ACCESSION NUMBER: M88012 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366 



ATGACTGATC CCCGCAAAAA CGGCGATTTG 

CGCGACGCCC TGGTCCAAAA GCGAGCTGGT 

35 GCGTGCGCTC AGGCGATATG GTGATGCCGC 

GGGCCGCTGG CGGACGGACC GCAGACACTT 

CGTCGGCCCC ACCGGGAATA TTCTCGCCTT 

CCTATGAACA GACGCTGGGT CATGATGCGC 

AGCTGGCCTG CGTTCGACCC CGACCATGCG 

40 CGCGATCAAC GAATTTATCC GAACCTATCC 

ATCCCGACGC ATCGATGGCG GCGATCGGGC 

GCGCCGCACG AGATGGGCGC CGCTTATGGC 

TTTTCTCGCC CACGCAGGAA AAATCCTGTC 

CAGTCACCGC GCTCCATTAT GCCGAAGCGG 

45 CGCCGCGTCA CTTATTCGAT GCCCTTACTG 

GGTCACCACG TCCGACTGGG ATTCGAACGG 

CGCCCGACGG CCCCGACGCG GTCGAACGGA 

CGCACCAGGG TTGCGCAAGG CCCGGTCGGC 

CGACGCGGCC GATATCGTTT CCTTCGGCAT 

50 ACGCCGCGCC AGCGGCGGCA GCGCTGAAGC 



CACGAACCCG 


CGACGGCACC 


50 


CCGGCAATTG 


CGCGACCTCG 


100 


ATGTGTCGTT 


GCGCGCCGTC 


150 


GTCGATGCGC 


TGATCGAGGC 


200 


CGTCTCGTGG 


CGCGATTCGC 


250 


CGCCCGCCGC 


CATCGCCCAA 


300 


CCCGCCTACC 


CCGGCTTTGG 


350 


GGGGTGTCGG 


CGCACGGCCC 


400 


CCGATGCGGC 


GTGGCTGGTG 


450 


CCCCGCTCGC 


CGATCGCGCG 


500 


GATCGGCGCC 


GGGCCCGATG 


550 


TGGCGCGGAT 


CGAGGGCAAG 


600 


CGCGAAGGCA 


AGCGCGTCTG 


650 


CATCCTCGAC 


GAATATGCCG 


700 


TCGCCCGCGA 


CTATCTCGCC 


750 


GGCGCGCAAT 


CCCGGCTGAT 


800 


CGAATGGCTC 


GAGGCGCGCC 


850 


CGAAACAACG 


CCGCGACTGA 


900 
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2) INFORMATION FOR SEQ ID NO: 1367 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367 
CAAATATACT AACAGAAGCG TTCA ' 24 



2) INFORMATION FOR SEQ ID NO: 1368 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1368 
AGGATCTTGC CAATACCTTT AT 22 

30 

2) INFORMATION FOR SEQ ID NO: 1369 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369 
4 5 AAACCTTTGT TTCGGTCTGC TAAT 24 



2) INFORMATION FOR SEQ ID NO: 1370 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

732 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10- 



25 



50 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370 
AAGCGATTCC AATAATACCT TGCT 24 

2) INFORMATION FOR SEQ ID NO: 1371 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 558 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrojbacter di versus 
(C) ACCESSION NUMBER: M18967 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371 



ATGAATTATC AAATTGTGAA TATTGCGGAA TGCAGCAATT ATCAGTTAGA 50 

AGCAGCAAAT ATACTAACAG AAGCGTTCAA TGATCTTGGT AACAATTCAT 100 

30 GGCCAGATAT GACGAGTGCA ACAAAAGAAG TAAAAGAATG TATTGAGAGT 150 

CCAAACCTTT GTTTCGGTCT GCTAATAAAT AACTCCTTAG TTGGCTGGAT 200 

AGGCTTAAGG CCAATGTACA AGGAAACCTG GGAATTGCAT . CCATTGGTTG 250 

TCAGACCAGA TTATCAAAAT AAAGGTATTG GCAAGATCCT GCTTAAGGAA 300 

TTAGAAAACA GAGCTAGAGA GCAAGGTATT ATTGGAATCG CTTTAGGAAC 350 

35 AGATGATGAA TACTATAGAA CAAGTCTCTC TTTAATAACT ATAACAGAAG 400 

ATAATATATT TGATTCAATA AAAAATATTA AAAATATTAA TAAACATCCA 450 

TATGAGTTTT ATCAGAAGAA TGGTTATTAT ATTGTTGGAA TAATTCCAAA 500 

TGCCAATGGT AAAAACAAAC CAGATATTTG GATGTGGAAA AGTTTAATCA 550 

AAGAGTAA 558 

40 



2) INFORMATION FOR SEQ ID NO: 1372 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372 
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GCTTTCGTTG CCTTTGCCGA GGTC 24 



2) INFORMATION FOR SEQ ID NO: 1373 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
1-0 (B) TYPE: -Nucleic acid- - 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373 
CACCCCTGTT GCTTCGCCCA CTC 23 

2) INFORMATION FOR SEQ ID NO: 1374 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374 
AGATATTGGC TTCGCCGCAC CACA 24 

35 

2) INFORMATION FOR SEQ ID NO: 1375 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1375 
50 CCCTGTTGCT TCGCCCACTC CTG 23 
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2) INFORMATION FOR SEQ ID NO: 1376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M94066 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376 

ATGATCGTCA TCTGCGACCA CGACAACCTC GACGCCTGGC TGGCGCTGCG 50 

CACCGCGCTG TGGCCCTCCG GCTCGCCTGA AGATCACCGC GCGGAAATGC 100 

GCGAGATATT GGCTTCGCCG CACCACACCG CGTTTATGGC GCGGGGGCTG 150 

20 GACGGCGCTT TCGTTGCCTT TGCCGAGGTC GCGCTGCGCT ACGATTACGT 200 

CAACGGCTGC GAATCGTCGC CGGTGGCGTT TTTGGAAGGA ATTTATACCG 250 

CCGAACGCGC CCGCCGCCAG GGCTGGGCCG CGCGCCTGAT CGCGCAGGTG 300 

CAGGAGTGGG CGAAGCAACA GGGGTGCAGC GAGCTGGCGT CGGATACCGA 350 

TATCGCCAAT CTGGACTCCC AGCGCCTGCA TGCGGCGCTG GGCTTTGCCG 400 

25 AAACGGAGCG AGTAGTGTTT TACCGCAAAA CGCTGGGCTG A 441 



30 



2) INFORMATION FOR SEQ ID NO: 1377 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1377 

40 

GCCGTGGGTC GATGTTTGAT GTTA 24 



45 2) INFORMATION FOR SEQ ID NO: 1378 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1378 
GCTCGATGAC GCCAACTACC TCTG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1379 

10 ■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE :< DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1379 
20 AGCAGCAACG ATGTTACGCA GCAG 24 



25 



2) INFORMATION FOR SEQ ID NO: 1380 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1380 

35 

CGCTCGATGA CGCCAACTAC CTCT 24 



40 2) INFORMATION FOR SEQ ID NO: 1381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: X02340 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1381 

GTGGTAACGG CGCAGTGGCG GTTTTCATGG CTTCTTGTTA TGACATGTTT 50 

TTTTGGGGTA CAGTCTATGC CTCGGGCATC CAAGCAGCAA GCGCGTTACG 100 

5 CCGTGGGTCG ATGTTTGATG TTATGGAGCA GCAACGATGT TACGCAGCAG 150 

GGCAGTCGCC CTAAAACAAA GTTAAACATC ATGAGGGAAG CGGTGATCGC 200 

CGAAGTATCG ACTCAACTAT CAGAGGTAGT TGGCGTCATC GAGCGCCATC 250 

TCGAACCGAC GTTGCTGGCC GTACATTTGT ACGGCTCCGC AGTGGATGGC 300 

GGCCTGAAGC CACACAGTGA TATTGATTTG CTGGTTACGG TGACCGTAAG 350 

10 GCTTGATGAA ACAACGCGGC- GAGCTTTGAT- CAACGACCTT - TTGGAAACTT 400 

CGGCTTCCCC TGGAGAGAGC GAGATTCTCC GCGCTGTAGA AGTCACCATT 450 

GTTGTGCACG ACGACATCAT TCCGTGGCGT TATCCAGCTA AGCGCGAACT 500 

GCAATTTGGA GAATGGCAGC GCAATGACAT , TCTTGCAGGT ATCTTCGAGC 550 

CAGCCACGAT CGACATTGAT CTGGCTATCT TGCTGACAAA AGCAAGAGAA 600 

15 CATAGCGTTG CCTTGGTAGG TCCAGCGGCG GAGGAACTCT TTGATCCGGT 650 

TCCTGAACAG GATCTATTTG AGGCGCTAAA TGAAACCTTA ACGCTATGGA 700 

ACTCGCCGCC CGACTGGGCT GGCGATGAGC GAAATGTAGT GCTTACGTTG 750 

TCCCGCATTT GGTACAGCGC AGTAACCGGC AAAATCGCGC CGAAGGATGT 800 

CGCTGCCGAC TGGGCAATGG AGCGCCTGCC GGCCCAGTAT CAGCCCGTCA 850 

20 TACTTGAAGC TAGACAGGCT TATCTTGGAC AAGAAGAAGA TCGCTTGGCC 900 

TCGCGCGCAG ATCAGTTGGA AGAATTTGTC CACTACGTGA AAGGCGAGAT 950 

CACCAAGGTA GTCGGCAAAT AA 972 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1382 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1382 
TAGATATGAT AGGCGGTAAA AAGC 24 

2) INFORMATION FOR SEQ ID NO: 1383 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1383 



737 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CCCAAATTCG AGTAAGAGGT ATT 



PCT/CAOO/01150_ 

23 



5 2) INFORMATION FOR SEQ ID NO: 1384 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

10 . .- . (C) STRANDEDNESS : Single - - 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1384 

GATATGATAG GCGGTAAAAA GC 22 



20 



30 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1385 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1385 
TCCCAAATTC GAGTAAGAGG TA 22 

2) INFORMATION FOR SEQ ID NO: 1386 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01282 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1386 

ATGAAAGAAA GATATGGAAC AGTATATAAA GGCTCTCAGA GGCTCATAGA 50 
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CGAGGAAAGT GGAGAAGTAA TAGAGGTAGA 
CGTCTGGTAA CTTTGTAAAA GCGTATATCG 
GATATGATAG GCGGTAAAAA GCTCAAGATT 
TGTACATCTA AGTAATAACA CAATGATAGC 
5 AAGGAACAAA TACAAGCACG AAAACCGTAA 
GAAGAAGGAA ATATCATTAA AAGAAGAACT 
AGAGCTACTC ATGAGAGGCG ATGACCAAAA 
AATTTGGGAA CTTTGAGCAA GAGGACGACC 
TCAGAATATT ATTCTTTCAA GGAGTAG 
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TAAGCTATAC CGTAAACAAA 100 

TCCAATTAAT AAGTATGTTA 150 

GTTAATTATA TATTAGATAA 200 

AACTGTTAGA GAAATAGCAG 250 

ATACAACGCT TAAAATCTTA 300 

GGAGCATTAA TGCTAAACCC 350 

ACAAAAATAC CTCTTACTCG 400 

AAAAGCAAGA AAATGCTTTA 450 

477 



2) INFORMATION FOR SEQ ID NO: 138,7 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1387 
25 TTATGCCTCT TCCGACCATC AAGC 24 



30 



2) INFORMATION FOR SEQ ID NO: 1388 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1388 

40 

TACGCTCGTC ATCAAAATCA CTCG 24 



45 2) INFORMATION FOR SEQ ID NO: 1389 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

739 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1389 
GAATAACGGT TTGGTTGATG CGAG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1390 

10 " (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE:' DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1390 
20 ATGGCAAGAT CCTGGTATCG GTCT 24 



25 



2) INFORMATION FOR SEQ ID NO: 1391 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: J01839 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1391 

40 ATGAGCCATA TTCAACGGGA AACGTCTTGC TCGAGGCCGC GATTAAATTC 50 

CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC GATAATGTCG 100 

GGCAATCAGG TGCGACAATC TATCGATTGT ATGGGAAGCC CGATGCGCCA 150 

GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA 200 

TGAGATGGTC AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA 250 

45 TCAAGCATTT TATCCGTACT CCTGATGATG CATGGTTACT CACCACTGCG 300 

ATCCCCGGGA AAACAGCATT CCAGGTATTA GAAGAATATC CTGATTCAGG 350 

TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG TTGCATTCGA 400 

TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 450 

CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA 500 

50 TGACGAGCGT AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA 550 

AGCTTTTGCC ATTCTCACCG GATTCAGTCG TCACTCATGG TGATTTCTCA 600 

CTTGATAACC TTATTTTTGA CGAGGGGAAA TTAATAGGTT GTATTGATGT 650 

TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC ATCCTATGGA 700 
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ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 750 
TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT 800 
CGATGAGTTT TTCTAA 816 



2) INFORMATION FOR SEQ ID NO: 1392 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH :• 24 bases ----- 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1392 
TGGGTGGAGA GGCTATTCGG CTAT 24 

20 



2) INFORMATION FOR SEQ ID NO: 1393 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1393 
35 CAGTCCCTTC CCGCTTCAGT GAC 23 



40 



2) INFORMATION FOR SEQ ID NO: 1394 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1394 

50 

GACGTTGTCA CTGAAGCGGG AAGG 24 
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10 



15 



30 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1395 
CTTGGTGGTC GAATGGGCAG GTAG . 24 

2) INFORMATION FOR SEQ ID NO: 1396 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00618 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1396 



ATGATTGAAC 
GAGGCTATTC 

35 CCGCCGTGTT 
ACCGACCTGT 
ATCGTGGCTG 
TCACTGAAGC 
GATCTCCTGT 

40 TGATGCAATG 
ACCACCAAGC 
GGTCTTGTCG 
AGCCGAACTG 
TCGTCGTGAC 

45 GGCCGCTTTT 



TCTCCGGCCG 


CTTGGGTGGA 


50 


GACAATCGGC 


TGCTCTGATG 


100 


GCCCGGTTCT 


TTTTGTCAAG 


150 


CAGGACGAGG 


CAGCGCGGCT 


200 


CGCAGCTGTG 


CTCGACGTTG 


250 


TGGGCGAAGT 


GCCGGGGCAG 


300 


GAGAAAGTAT 


CCATCATGGC 


350 


TCCGGCTACC 


TGCCCATTCG 


400 


CACGTACTCG 


GATGGAAGCC 


450 


GAGCATCAGG 


GGCTCGCGCC 


500 


CATGCCCGAC 


GGCGAGGATC 


550 


CGAATATCAT 


GGTGGAAAAT 


600 


CGGCTGGGTG 


TGGCGGACCG 


650 


TATTGCTGAA 


GAGCTTGGCG 


700 


ACGGTATCGC 


CGCTCCCGAT 


750 


GACGAGTTCT 


TCTGA 


795 



2) INFORMATION FOR SEQ ID NO: 1397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : DNA 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 1397 
GTGGGAGAAA ATGAAAACCT AT 22 



15 2) INFORMATION FOR SEQ ID NO: 1398 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1398 

ATGGAGTGAA AGAGCCTGAT 20 



2) INFORMATION FOR SEQ ID NO: 1399 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1399 
ACCTATGATG TGGAACGGGA AAAG 24 

2) INFORMATION FOR SEQ ID NO: 1400 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

743 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1400 
CGATGGAGTG AAAGAGCCTG ATG 



-2 ) INFORMATION FOR- SEQ ID NO: - 1401 ------- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: 1 Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
(C) ACCESSION NUMBER: V01547 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1401 

ATGGCTAAAA TGAGAATATC ACCGGAATTG AAAAAACTGA TCGAAAAATA 50 

CCGCTGCGTA AAAGATACGG AAGGAATGTC TCCTGCTAAG GTATATAAGC 100 

TGGTGGGAGA AAATGAAAAC CTATATTTAA AAATGACGGA CAGCCGGTAT 150 

AAAGGGACCA CCTATGATGT GGAACGGGAA AAGGACATGA TGCTATGGCT 200 

GGAAGGAAAG CTGCCTGTTC CAAAGGTCCT GCACTTTGAA CGGCATGATG 250 

GCTGGAGCAA TCTGCTCATG AGTGAGGCCG ATGGCGTCCT TTGCTCGGAA 300 

GAGTATGAAG ATGAACAAAG CCCTGAAAAG ATTATCGAGC TGTATGCGGA 350 

GTGCATCAGG CTCTTTCACT CCATCGACAT ATCGGATTGT CCCTATACGA 400 

ATAGCTTAGA CAGCCGCTTA GCCGAATTGG ATTACTTACT GAATAACGAT 450 

CTGGCCGATG TGGATTGCGA AAACTGGGAA GAAGACACTC CATTTAAAGA 500 

TCCGCGCGAG CTGTATGATT TTTTAAAGAC GGAAAAGCCC GAAGAGGAAC 550 

TTGTCTTTTC CCACGGCGAC CTGGGAGACA GCAACATCTT TGTGAAAGAT 600 

GGCAAAGTAA GTGGCTTTAT TGATCTTGGG AGAAGCGGCA GGGCGGACAA 650 

GTGGTATGAC ATTGCCTTCT GCGTCCGGTC GATCAGGGAG GATATCGGGG 700 

AAGAACAGTA TGTCGAGCTA TTTTTTGACT TACTGGGGAT CAAGCCTGAT 750 

TGGGAGAAAA TAAAATATTA TATTTTACTG GATGAATTGT TTTAG 795 



2) INFORMATION FOR SEQ ID NO: 1402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

744 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1402 
TATTCAACAA TTTATCGGAA ACAG 24 

5 



2) INFORMATION FOR SEQ ID NO: 1403 

(i) SEQUENCE CHARACTERISTICS:. . ... . __ 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1403 
20 TCAGAGAGCC AACTCAACAT TTT 23 



10 - 



15 



2) INFORMATION FOR SEQ ID NO: 1404 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1404 

35 

AAACAGCGTT TTAGAGCCAA ATAA 24 

40 2) INFORMATION FOR SEQ ID NO: 1405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1405 

TTCTCAGAGA GCCAACTCAA CATT 24 

745 
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2) INFORMATION FOR SEQ ID NO: 1406 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetOibacter baumannii 

(B) STRAIN: BM2580 

(C) ACCESSION NUMBER: X07753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1406 

ATGGAATTGC CCAATATTAT TCAACAATTT ATCGGAAACA GCGTTTTAGA 50 

GCCAAATAAA ATTGGTCAGT CGCCATCGGA TGTTTATTCT TTTAATCGAA 100 

ATAATGAAAC TTTTTTTCTT AAGCGATCTA GCACTTTATA TACAGAGACC 150 

ACATACAGTG TCTCTCGTGA AGCGAAAATG TTGAGTTGGC TCTCTGAGAA 200 

ATTAAAGGTG CCTGAACTCA TCATGACTTT TCAGGATGAG CAGTTTGAAT 250 

TCATGATCAC TAAAGCGATC AATGCAAAAC CAATTTCAGC GCTTTTTTTA 300 

ACAGACCAAG AATTGCTTGC TATCTATAAG GAGGCACTCA ATCTGTTAAA 350 

TTCAATTGCT ATTATTGATT GTCCATTTAT TTCAAACATT GATCATCGGT 400 

TAAAAGAGTC AAAATTTTTT ATTGATAACC AACTCCTTGA C GAT AT AG AT 450 

CAAGATGATT TTGACACTGA ATTATGGGGA GACCATAAAA CTTACCTAAG 500 

TCTATGGAAT GAGTTAACCG AGACTCGTGT TGAAGAAAGA TTGGTTTTTT 550 

CTCATGGCGA TATCACGGAT AGTAATATTT TTATAGATAA ATTCAATGAA 600 

ATTTATTTTT TAGATCTTGG TCGTGCTGGG TTAGCAGATG AATTTGTAGA 650 

TATATCCTTT GTTGAACGTT GCCTAAGAGA GGATGCATCG GAGGAAACTG 700 

CGAAAATATT TTTAAAGCAT TTAAAAAATG ATAGACCTGA CAAAAGGAAT 750 

TATTTTTTAA AACTTGATGA ATTGAATTGA 780 



2) INFORMATION FOR SEQ ID NO: 1407 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1407 
CCCTGTAATA GAAAAGCAAG TAGG 24 



2) INFORMATION FOR SEQ ID NO: 1408 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1408 - 
TTGTCGTATC CCTCAAATCA CC 



2) INFORMATION FOR SEQ ID NO: 1409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1409 
TGGGATTACA ATGGCAATCA GCG 



2) INFORMATION FOR SEQ ID NO: 1410 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1410 
GGGGAATAGG TCACAAGATC TGCTT 



2) INFORMATION FOR SEQ ID NO: 1411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

747 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: S46063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1411 



ATGCTTTTAT ATAAAATGTG TGACAATCAA AATTATGGGG TTACTTACAT 50 

GAAGTTTTTA TTGGCATTTT CGCTTTTAAT ACCATCCGTG GTTTTTGCAA 100 

GTAGTTCAAA GTTTCAGCAA GTTGAACAAG, ACGTTAAGGC AATTGAAGTT 150 

TCTCTTTCTG CTCGTATAGG TGTTTCCGTT CTTGATACTC AAAATGGAGA 200 

15 ATATTGGGAT TACAATGGCA ATCAGCGCTT CCCGTTAACA AGTACTTTTA 250 

AAACAATAGC TTGCGCTAAA TTACTATATG ATGCTGAGCA AGGAAAAGTT 300 

AATCCCAATA GTACAGTCGA GATTAAGAAA GCAGATCTTG TGACCTATTC 350 

CCCTGTAATA GAAAAGCAAG TAGGGCAGGC AATCACACTC GATGATGCGT 400 

GCTTCGCAAC TATGACTACA AGTGATAATA CTGCGGCAAA TATCATCCTA 450 

20 AGTGCTGTAG GTGGCCCCAA AGGCGTTACT GATTTTTTAA GACAAATTGG 500 

GGACAAAGAG ACTCGTCTAG ACCGTATTGA GCCTGATTTA AATGAAGGTA 550 

AGCTCGGTGA TTTGAGGGAT ACGACAACTC CTAAGGCAAT AGCCAGTACT 600 

TTGAATAAAC TTTTATTTGG TTCCGCGCTA TCTGAAATGA ACCAGAAAAA 650 

ATTAGAGTCT TGGATGGTGA ACAATCAAGT CACTGGTAAT TTACTACGTT 700 

25 CAGTATTGCC GGCGGGATGG AACATTGCGG ATCGCTCAGG TGCTGGCGGA 750 

TTTGGTGCTC GGAGTATTAC AGCAGTTGTG TGGAGTGAGC ATCAAGCCCC 800 

AATTATTGTG AGCATCTATC TAGCTCAAAC ACAGGCTTCA ATGGCAGAGC 850 

GAAATGATGC GATTGTTAAA ATTGGTCATT CAATTTTTGA CGTTTATACA 900 

TCACAGTCGC GC 912 

30 



2) INFORMATION FOR SEQ ID NO: 1412 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1412 
45 GAGAAAACGC TCCAGCAGGG C 21 



2) INFORMATION FOR SEQ ID NO: 1413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

748 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1413 
CATGAGGCTT TCACTGCGGG G 21 



2) INFORMATION FOR SEQ ID NO:. 1414 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1414 
TATCGTTAAT CGCACCATCA C 21 

25 



2) INFORMATION FOR SEQ ID NO: 1415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1415 
40 ATGCAGTAAT GCGGCTTTAT C 21 



45 



2) INFORMATION FOR SEQ ID NO: 1416 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



74 9 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: HEL-1 

(C) ACCESSION NUMBER: X91840 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1416 



ATGATGAAAA 
CACATTTGCT 

10 GCACCATCAC 
GCCGTTATCT 
TATCGCCAAT 
CGGTTAGTAA 
GGCGAAATTA 

15 AGGCAAACAG 
CAGGCGGCCT 
TTACTGCATT 
GCGACTTTAC 
AACCCTCAGG 

20 CCATTAAAAC 
AGATTATGCC 
GACAACTTGA 
GCCCGCTGGG 
GCTCCAGCAG 

25 ATATGTACCA 
GATTCGATCA 
CGCCGTTGAG 
ATAAAACGGG 
GAAAAAAACC 

30 TGTCCGTGTC 



AATCGTTATG 
GCCGCAAAAA 
CCCGTTGATG 
ACCAGGGAAA 
AACCACCCAG 
GACGTTTAAC 
AGCTCAGCGA 
TGGCAGGGTA 
ACCGCTGCAG 
TTTATCAAAA 
GCTAACTCCA 
AATGAGTTAC 
TGGCGCATAC 
TGGGGCTATC 
CGCCGAAGCC 
TTCAGGCCAA 
GGCATTGCGC 
GGGATTAGGC 
TCAACGGCAG 
GTAAACCCGC 
CTCCACTGGT 
TTGGCATCGT 
GAGGCGGCCT 



CTGCGCTCTG 

CAGAACAACA 

CAGGAGCAGG 

ACCCTATTAT 

TCACGCAGCA 

GGCGTGTTGG, 

TCCGGTCACG 

TCCGCCTGCT 

ATCCCCGATG 

CTGGCAGCCG 

GCATTGGTCT 

GAAGAGGCAA 

CTGGATTACG 

GCGAAGGGAA 

TATGGCGTGA 

CATGGATGCC 

TTGCGCAGTC 

TGGGAGATGC 

CGACAGCAAA 

CCGCCCCCGC 

GGATTTGGCA 

GATGCTGGCA 

GGCGCATTCT 



CTGCTGACAG 
GATTGCCGAT 
CTA-TTCCGGG 
TTCACCTGGG 
AACGCTGTTT 
GCGGCGATGC 
AAATACTGGC 
GCACTTAGCC 
ACGTTAGGGA 
CAATGGACTC 
GTTTGGCGCG 
TGACCAGACG 
GTTCCGCAGA 
GCCCGTACAC 
AATCCAGCGT 
AGCCACGTTC 
TCGCTACTGG 
TGAACTGGCC 
GTGGCATTGG 
AGTGAAAGCC 
GCTACGTAGC 
AACAAAAGCT 
TGAAAAGCTG 



CCTCTTTCTC 
ATCGTTAATC 
TATGGCCGTT 
GTAAAGCCGA 
GAGCTAGGAT 
TATCGCCCGC 
CAGAACTGAC 
ACCTATACGG 
TAAAGCCGCA 
CGGGCGCTAA 
CTGGCGGTGA 
CGTCCTGCAA 
ACGAACAAAA 
GTTTCTCCGG 
TATTGATATG 
AGGAGAAAAC 
CGTATTGGCG 
GCTGAAAGCT 
CAGCGCTTCC 
TCATGGGTGC 
CTTCGTTCCA 
ATCCTAACCC 
CAATAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1146 



2) INFORMATION FOR SEQ ID NO: 1417 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1417 

45 

TGGTTAACTA YAATCCSATT GCGGA 25 



50 2) INFORMATION FOR SEQ ID NO: 1418 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1418 

ATGCTTTACC CAGCGTCAGA TT 22 

10 - 



2) INFORMATION FOR SEQ ID NO: 1419 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1419 
25 CGATGAATAA GCTGATTTCT CACG 24 



30 



2) INFORMATION FOR SEQ ID NO: 1420 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1420 

40 

TGCTTTACCC AGCGTCAGAT TACG 24 



45 2) INFORMATION FOR SEQ ID NO: 1421 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1421 
AATTAGAGCG GCAGTCGGGA GGAA 



2) INFORMATION FOR SEQ ID NO: 1422 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1422 
GAAATCAGCT TATTCATCGC CACG 



2) INFORMATION FOR SEQ ID NO: 1423 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: GRI-1 

(C) ACCESSION NUMBER: X92506 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1423 

ATGGTTAAAA AATCACTGCG TCAGTTCACG CTGATGGCGA CGGCAACCGT 50 

CACGCTGTTG TTAGGAAGTG TGCCGCTGTA TGCGCAAACG GCGGACGTAC 100 

AGCAAAAACT TGCCGAATTA GAGCGGCAGT CGGGAGGAAG ACTGGGTGTG 150 

GCATTGATTA ACACAGCAGA TAATTCGCAA ATACTTTATC GTGCTGATGA 200 

GCGCTTTGCG ATGTGCAGCA CCAGTAAAGT GATGGCCGTG GCCGCGGTGC 250 

TGAAGAAAAG TGAAAGCGAA CCGAATCTGT TAAATCAGCG AGTTGAGATC 300 

AAAAAATCTG ACTTGGTTAA CTATAATCCG ATTGCGGAAA AGCACGTCGA 350 

TGGGACGATG TCACTGGCTG AGCTTAGCGC GGCCGCGCTA CAGTACAGCG 400 

ATAACGTGGC GATGAATAAG CTGATTTCTC ACGTTGGCGG CCCGGCTAGC 450 

GTCACCGCGT TCGCCCGACA GCTGGGAGAC GAAACGTTCC GTCTCGACCG 500 

TACCGAGCCG ACGTTAAACA CCGCCATTCC GGGCGATCCG CGTGATACCA 550 

CTTCACCTCG GGCAATGGCG CAAACTCTGC GTAATCTGAC GCTGGGTAAA 600 

GCATTGGGTG ACAGCCAACG GGCGCAGCTG GTGACATGGA TGAAAGGCAA 650 

752 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TACCACCGGT GCAGCGAGCA TTCAGGCTGG 

TGGGGGATAA AACCGGCAGC GGTGACTATG 

GTGATCTGGC CAAAAGATCG TGCGCCGCTG 

CCAGCCTCAA CCTAAGGCAG AAAGCCGTCG 

5 CTAAAATCGT CACCAACGGT TTGTAA 



PCT/CA00/01150 

ACTGCCTGCT TCCTGGGTTG 700 
GCACCACCAA CGATATCGCG 750 
ATTCTGGTCA CTTACTTCAC 800 
CGATGTATTA GCGTCGGCGG 850 

876 



2) INFORMATION FOR SEQ ID NO: 1424 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases. 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1424 

20 

GTTAACGGTG ATGGCGACGC TAC 23 



25 2) INFORMATION FOR SEQ ID NO: 1425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1425 

GAATTATCGG CGGTGTTAAT CAGC 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1426 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1426 

CACGCTCAAT ACCGCCATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1427 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1427 
TTATCGCCCA CTACCCATGA TTTC , 24 

2) INFORMATION FOR SEQ ID NO: 1428 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: CAS-5 

30 (C) ACCESSION NUMBER: X92507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1428 

ATGATGACTC AGAGCATTCG CCGCTCAATG TTAACGGTGA TGGCGACGCT 50 

35 ACCCCTGCTA TTTAGCAGCG CAACGCTGCA TGCGCAGGCG AACAGCGTGC 100 

AACAGCAGCT GGAAGCCCTG GAGAAAAGTT CGGGAGGTCG GCTTGGCGTT 150 

GCGCTGATTA ACACCGCCGA TAATTCGCAG ATTCTCTACC GTGCCGATGA 200 

ACGTTTTGCG ATGTGCAGTA CCAGTAAGGT GATGGCGGCC GCGGCGGTGC 250 

TTAAACAGAG CGAGAGCGAT AAGCACCTGC TAAATCAGCG CGTTGAAATC 300 

40 AAGAAGAGCG ACCTGGTTAA CTACAATCCC ATTGCGGAGA AACACGTTAA 350 

CGGCACGATG ACGCTGGCTG AGCTTGGCGC AGCGGCGCTG CAGTATAGCG 400 

ACAATACTGC CATGAATAAG CTGATTGCCC ATCTGGGTGG TCCCGATAAA 450 

GTGACGGCGT TTGCTCGCTC GTTGGGTGAT GAGACCTTCC GTCTGGACAG 500 

AACCGAGCCC ACGCTCAATA CCGCCATTCC AGGCGACCCG CGTGATACCA 550 

45 CCACGCCGCT CGCGATGGCG CAGACCCTGA AAAATCTGAC GCTGGGTAAA 600 

GCGCTGGCGG AAACTCAGCG GGCACAGTTG GTGACGTGGC TTAAGGGCAA 650 

TACTACCGGT AGCGCGAGCA TTCGGGCGGG TCTGCCGAAA TCATGGGTAG 700 

TGGGCGATAA AACCGGCAGC GGAGATTATG GCACCACCAA CGATATCGCG 750 

GTTATCTGGC CGGAAAACCA CGCACCGCTG GTTCTGGTGA CCTACTTTAC 800 

50 CCAACCGGAG CAGAAGGCGG AAAGCCGTCG GGATATTCTG GCTGCGGCGG 850 

CGAAAATCGT AACCCACGGT TTCTGA 876 
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2) INFORMATION FOR SEQ ID NO: 1429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1429 
TTTACGGCTA AAGATACTGA AAAGT 25 



2) INFORMATION FOR SEQ ID NO: 1430 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 30 
GTTTAATAAA ACAACCACCG AATAAT 26 

30. 

2) INFORMATION FOR SEQ ID NO: 1431 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1431 
45 TAATTGACAC TCCATTTACG GCTAA 25 



2) INFORMATION FOR SEQ ID NO: 1432 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

755 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



25 



45 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1432 
ACCGAATAAT ATTTTCCTTT CAGGCA 26 

2) INFORMATION FOR SEQ ID NO: 1433 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 741 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: AJ223604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1433 



ATGAGCAAGT TATCTGTATT CTTTATATTT TTGTTTTGCA GCATTGCTAC 50 

CGCAGCAGAG TCTTTGCCAG ATTTAAAAAT TGAAAAGCTT GATGAAGGCG 100 

30 TTTATGTTCA TACTTCGTTT GAAGAAGTTA ACGGGTGGGG CGTTGTTCCT 150 

AAACATGGTT TGGTGGTTCT TGTAAATGCT GAGGCTTACC TAATTGACAC 200 

TCCATTTACG GCTAAAGATA CTGAAAAGTT AGTCACTTGG TTTGTGGAGC 250 

GTGGCTATAA AATAAAAGGC AGCATTTCCT CTCATTTTCA TAGCGACAGC 300 

ACGGGCGGAA TAGAGTGGCT TAATTCTCGA TCTATCCCCA CGTATGCATC 350 

35 TGAATTAACA AATGAACTGC TTAAAAAAGA CGGTAAGGTT CAAGCCACAA 400 

ATTCATTTAG CGGAGTTAAC TATTGGCTAG TTAAAAATAA AATTGAAGTT 450 

TTTTATCCAG GCCCGGGACA CACTCCAGAT AACGTAGTGG TTTGGTTGCC 500 

TGAAAGGAAA ATATTATTCG GTGGTTGTTT TATTAAACCG TACGGTTTAG 550 

GCAATTTGGG TGACGCAAAT ATAGAAGCTT GGCCAAAGTC CGCCAAATTA 600 

40 TTAAAGTCCA AATATGGTAA GGCAAAACTG GTTGTTCCAA GTCACAGTGA 650 

AGTTGGAGAC GCATCACTCT TGAAACTTAC ATTAGAGCAG GCGGTTAAAG 700 

GGTTAAACGA AAGTAAAAAA CCATCAAAAC CAAGCAACTA A 741 



2) INFORMATION FOR SEQ ID NO: 1434 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1434 
5 CACAATCAAG ACCAAGATTT GCGAT 25 



2.) INFORMATION FOR SEQ ID NO: 1435 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases. 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1435 

20 

GAAAGGGCAG CTCGTTACGA TAGAG 25 



25 2) INFORMATION FOR SEQ ID NO: 1436 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1436 

CAGCATCAAC ATTTAAGATC CCCA 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1437 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1437 

CTCCACTTGA TTAACTGCGG AAATTC 26 
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2) INFORMATION FOR SEQ ID NO: 1438 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

- - (ii)- MOLECULE TYPE:. Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X06046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1438 

ATGGCAATCC GAATCTTCGC GATACTTTTC TCCATTTTTT CTCTTGCCAC 50 

TTTCGCGCAT GCGCAAGAAG GCACGCTAGA ACGTTCTGAC TGGAGGAAGT 100 

TTTTCAGCGA ATTTCAAGCC AAAGGCACGA TAGTTGTGGC AGACGAACGC 150 

CAAGCGGATC GTGCCATGTT GGTTTTTGAT CCTGTGCGAT CGAAGAAACG 200 

CTACTCGCCT GCATCGACAT TCAAGATACC TCATACACTT TTTGCACTTG 250 

ATGCAGGCGC TGTTCGTGAT GAGTTCCAGA TTTTTCGATG GGACGGCGTT 300 

AACAGGGGCT TTGCAGGCCA CAATCAAGAC CAAGATTTGC GATCAGCAAT 350 

GCGGAATTCT ACTGTTTGGG TGTATGAGCT ATTTGCAAAG GAAATTGGTG 400 

ATGACAAAGC TCGGCGCTAT TTGAAGAAAA TCGACTATGG CAACGCCGAT 450 

CCTTCGACAA GTAATGGCGA TTACTGGATA GAAGGCAGCC TTGCAATCTC 500 

GGCGCAGGAG CAAATTGCAT TTCTCAGGAA GCTCTATCGT AACGAGCTGC 550 

CCTTTCGGGT AGAACATCAG CGCTTGGTCA AGGATCTCAT GATTGTGGAA 600 

GCCGGTCGCA ACTGGATACT GCGTGCAAAG ACGGGCTGGG AAGGCCGTAT 650 

GGGTTGGTGG GTAGGATGGG TTGAGTGGCC GACTGGCTCC GTATTCTTCG 700 

CACTGAATAT TGATACGCCA AACAGAATGG ATGATCTTTT CAAGAGGGAG 750 

GCAATCGTGC GGGCAATCCT TCGCTCTATT GAAGCGTTAC CGCCCAACCC 800 

GGCAGTCAAC TCGGACGCTG CGCGATAA 828 



2) INFORMATION FOR SEQ ID NO: 1439 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: J03427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1439 

ATGAAAACAT TTGCCGCATA TGTAATTATC GCGTGTCTTT CGAGTACGGC 50 

758 
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ATTAGCTGGT TCAATTACAG AAAATACGTC TTGGAACAAA GAGTTCTCTG 100 

CCGAAGCCGT CAATGGTGTC TTCGTGCTTT GTAAAAGTAG CAGTAAATCC 150 

TGCGCTACCA ATGACTTAGC TCGTGCATCA AAGGAATATC TTCCAGCATC 200 

AACATTTAAG ATCCCCAACG CAATTATCGG CCTAGAAACT GGTGTCATAA 250 

5 AGAATGAGCA TCAGGTTTTC AAATGGGACG GAAAGCCAAG AGCCATGAAG 300 

CAATGGGAAA GAGACTTGAC CTTAAGAGGG GCAATACAAG TTTCAGCTGT 350 

TCCCGTATTT CAACAAATCG CCAGAGAAGT TGGCGAAGTA AGAATGCAGA 400 

AATACCTTAA AAAATTTTCC TATGGCAACC AGAATATCAG TGGTGGCATT 450 

GACAAATTCT GGTTGGAAGG CCAGCTTAGA ATTTCCGCAG TTAATCAAGT 500 

10 GGAGTTTCTA GAGTCTCTAT" ATTTAAATAA ATTGTCAGCA " TCTAAAGAAA 550 
ACCAGCTAAT AGTAAAAGAG GCTTTGGTAA CGGAGGCGGC ACCTGAATAT ■ 600 

CTAGTGCATT CAAAAACTGG TTTTTC.TGGT GTGGGAACTG AGTCAAATCC 650 

TGGTGTCGCA TGGTGGGTTG GGTGGGTTGA GAAGGAGACA GAGGTTTACT 700 

TTTTCGCCTT TAACATGGAT ATAGACAACG AAAGTAAGTT GCCGCTAAGA 750 

15 AAATCCATTC CCACCAAAAT CATGGAAAGT GAGGGCATCA TTGGTGGCTA 800 

A 801 



20 2) INFORMATION FOR SEQ ID NO: 1440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1440 

AGACCGTTAT CGTAAACAGG GCTAAG 26 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1441 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1441 
TTTTTTGCTC AAACTTTTTC AGGATC 26 

2) INFORMATION FOR SEQ ID NO: 1442 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 " (A) ORGANISM: Pseudomonas aeruginosa- 

(B) STRAIN: RNL-1 
. (C) ACCESSION NUMBER: Z21957 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1442 



ATGAATGTCA TTATAAAAGC TGTAGTTACT GCCTCGACGC TACTGATGGT 50 

ATCTTTTAGT TCATTCGAAA CCTCAGCGCA ATCCCCACTG TTAAAAGAGC 100 

AAATTGAATC CATAGTCATT GGAAAAAAAG CCACTGTAGG CGTTGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTACTG ATTAATCCTT TTGAAAAATT 200 

20 CCCAATGCAA AGTGTATTTA AATTGCATTT AGCTATGTTG GTACTGCATC 250 

AGGTTGATCA GGGAAAGTTG GATTTAAATC AGACCGTTAT CGTAAACAGG 300 

GCTAAGGTTT TACAGAATAC CTGGGCTCCG ATAATGAAAG CGTATCAGGG 350 

AGACGAGTTT AGTGTTCCAG TGCAGCAACT GCTGCAATAC TCGGTCTCGC 400 

ACAGCGATAA CGTGGCCTGT GATTTGTTAT TTGAACTGGT TGGTGGACCA 450 

25 GCTGCTTTGC ATGACTATAT CCAGTCTATG GGTATAAAGG AGACCGCTGT 500 

GGTCGCAAAT GAAGCGCAGA TGCACGCCGA TGATCAGGTG CAGTATCAAA 550 

ACTGGACCTC GATGAAAGGT GCTGCAGAGA TCCTGAAAAA GTTTGAGCAA 600 

AAAACACAGC TGTCTGAAAC CTCGCAGGCT TTGTTATGGA AGTGGATGGT 650 

CGAAACCACC ACAGGACCAG AGCGGTTAAA AGGTTTGTTA CCAGCTGGTA 700 

30 CTGTGGTCGC ACATAAAACT GGTACTTCGG GTATCAAAGC CGGAAAAACT 750 

GCGGCCACTA ATGATTTAGG TATCATTCTG TTGCCTGATG GACGGCCCTT 800 

GCTGGTTGCT GTTTTTGTGA AAGACTCAGC CGAGTCAAGC CGAACCAATG 850 

AAGCTATCAT TGCGCAGGTT GCTCAGACTG CGTATCAATT TGAATTGAAA 900 

AAGCTTTCTG CCCTAAGCCC AAATTAA 927 

35 



2) INFORMATION FOR SEQ ID NO: 1443 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1443 
50 CTTCTGCTCT GCTGATGCTT GGC 23 
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2) INFORMATION FOR SEQ ID NO: 1444 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



_(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1444 
GGCGACCAGG TATTTTGTAA TACTGC 26 

2) INFORMATION FOR SEQ ID NO: 1445 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: JMC 

30 (C) ACCESSION NUMBER: X93314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1445 

ATGAATGTCA TCACAAAATG TGTTTTCACC GCTTCTGCTC TGCTGATGCT 50 

35 TGGCTTAAGT TCATTTGTAG TATCAGCCCA ATCCCCTTTG TTAAAAGAGC 100 

AGATTGAAAC CATAGTGACG GGTAAAAAGG CCACTGTAGG TGTAGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTGTTG CTGAATCCAT TTGAAAAGTT 200 

TCCGATGCAA AGTGTGTTTA AACTGCATTT AGCTATGTTA GTTCTGCATC 250 

AGGTCGATCA GGGGAAACTG GATTTAAATC AGTCTGTTAC TGTTAATCGT 300 

40 GCTGCAGTAT TACAAAATAC CTGGTCGCCA ATGATGAAAG ATCATCAGGG 350 

CGATGAATTT ACTGTTGCAG TACAGCAGTT ACTGCAGTAT TCGGTGTCAC 400 

ACAGCGACAA TGTGGCCTGC GATTTGTTAT TTGAACTGGT GGGCGGGCCG 4 50 

CAAGCTTTGC ATGCTTATAT CCAGTCTTTA GGCGTTAAAG AAGCTGCCGT 500 

GGTAGCAAAT GAAGCGCAAA TGCATGCGGA TGATCAGGTG CAATATCAAA 550 

45 ACTGGACGTC GATGAAAGCC GCAGCACAAG TTCTGCAAAA GTTTGAACAG 600 

AAAAAGCAGT TGTCTGAAAC CTCTCAGGCC TTGTTATGGA AATGGATGGT 650 

TGAAACCACC ACAGGACCAC AGCGGTTAAA AGGCTTGTTA CCTGCTGGTA 700 

CTATAGTGGC GCATAAAACC GGTACTTCGG GCGTCAGAGC AGGAAAAACT 750 

GCGGCGACTA ATGATGCGGG CGTCATTATG TTGCCTGATG GACGGCCTTT 800 

50 ATTGGTGGCG GTATTTGTCA AGGATTCGGC TGAATCAGAA CGAACCAATG 850 

AAGCTATTAT TGCGCAGGTT GCGCAAGCGG CTTATCAGTT TGAGCTGAAA 900 

AAACTCTCTG CAGTGAGTCC GGATTGA 927 
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2) INFORMATION FOR SEQ ID NO: 1446 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1446 
GGCCTGYGAT TTGTTATTTG AACTGGT , 27 

2) INFORMATION FOR SEQ ID NO: 1447 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1447 
CGCTSTGGTC CTGTGGTGGT TTC 23 

30 

2) INFORMATION FOR SEQ ID NO: 1448 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1448 
45 GATCAGGTGC ARTATCAAAA CTGGAC 26 



2) INFORMATION FOR SEQ ID NO: 1449 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1449 
AGCWGGTAAC AAYCCTTTTA ACCGCT 26 

2) INFORMATION FOR SEQ ID NO:. 1450 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1450 
ACCACTGGGA ATACACTTGT AATGGC 26 

25. 

2) INFORMATION FOR SEQ ID NO:. 1451 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1451 
40 ATCTACCTGG TCAATCATTG CTTCGT 26 



2) INFORMATION FOR SEQ ID NO: 1452 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: BM10393 

(C) ACCESSION NUMBER: AF045472 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1452 

ATGACATTAT CAATAATTGT CGCTCACGAT AAACAAAGAG TCATTGGGTA 50 

. CCAAAATCAA TTACCTTGGC ACTTACCAAA TGATTTAAAG CATATTAAAC 100 

10 AACTGACCAC TGGGAATACA" CTTGTAATGG CACGGAAAAC TTTTAATTCT - 150 

ATAGGGAAGC CATTGCCAAA TAGACGTAAC GTCGTACTCA CTAACCAAGC 200 

TTCATTTCAC CATGAAGGGG TAGATGTTAT AAACTCTCTT GATGAAATTA 250 

AAGAGTTATC TGGTCATGTT TTTATATTTG, GAGGACAAAC GTTATACGAA 300 

GCAATGATTG ACCAGGTAGA TGATATGTAT ATCACAGTAA TAGATGGAAA 350 

15 GTTTCAAGGA GACACATTCT TTCCACCATA CACATTCGAA AACTGGGAAG 400 

TCGAATCTTC AGTAGAAGGT CAACTAGATG AAAAAAATAC TATACCGCAT 450 

ACATTCTTAC ATTTAGTGCG TAGAAAAGGG AAATAG 486 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1453 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1453 
ATCGAAGAAT GGAGTTATCG GRAATG 26 

2) INFORMATION FOR SEQ ID NO: 1454 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1454 
CCTAAAAYTR 'CTGGGGATTT CWGGA 25 

50 



2) INFORMATION FOR SEQ ID NO: 1455 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
- (xi-) SEQUENCE- DESCRIPTION : . SEQ . ID NO:. 1455 . 
CAGGTGGTGG GGAGATATAC AAAA 



2) INFORMATION FOR SEQ ID NO: 1456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1456 
TATGTTAGAS RCGAAGTCTT GGKTAA 



2) INFORMATION FOR SEQ ID NO: 1457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1457 
CAAAGGTGAA CAGCTCCTGT TT 



2) INFORMATION FOR SEQ ID NO: 1458 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1458 

TCCGTTATTT TCTTTAGGTT GGTTAAA 27 



10" 



20 



25 



50 



2) INFORMATION FOR SEQ ID NO: 1459 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1459 
AAGGTGAACA GCTCCTGTTT 20 

2) INFORMATION FOR SEQ ID NO: 1460 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1460 
GATCACTACG TTCTCATTGT CA 22 

40 



2) INFORMATION FOR SEQ ID NO: 1461 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AJ238350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1461 

5 

GTGAAACTAT CACTAATGGT AGCTATATCG AAGAATGGAG TTATCGGGAA 50 

TGGCCCTGAT ATTCCATGGA GTGCCAAAGG TGAACAGCTC CTGTTTAAAG 100 

CTATTACCTA TAACCAATGG CTGTTGGTTG GACGCAAGAC TTTTGAATCA 150 

A-TGGGAGCAT TACCCAACCG AAAGTATGCG GTCGTAACAC GTCCAAGTTT 200 

10 TACATCTGAC AATGAGAACG TAGTGATCTT TCCATCAATT AAAGATGCTT -250 

TAACCAACCT AAAGAAAATA ACGGATCATG TCATTGTTTC AGGTGGTGGG 300 

GAGATATACA AAAGCCTGAT CGATCAAGTA GATACACTAC ATATATCTAC 350 

AATAGACATC GAGCCGGAAG GTGATGTTTA CTTTCCTGAA ATCCCCAGCA 4 00 

ATTTTAGGCC AGTTTTTACC CAAGACTTCG CCTCTAACAT AAATTATAGT 4 50 

15 TACCAAATCT GGCAAAAGGG TTAA 474 



20 



50 



2) INFORMATION FOR SEQ ID NO: 1462 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1462 

30 

GCACTCCCYA ATAGGAAATA CGC 23 



35 2) INFORMATION FOR SEQ ID NO: 1463 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1463 

AGTGTTGCTC AAAAACAACT TCG 23 



2) INFORMATION FOR SEQ ID NO: 1464 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1464 
10 ACGTTYGAAT CTATGGGMGC ACT 23 



2) INFORMATION FOR SEQ ID NO: 1465 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1465 

25 

GTCGATAAGT GGAGCGTAGA GGC 23 



30 2) INFORMATION FOR SEQ ID NO: 1466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1466 

AAGCATTGAC CTACAATCAG TGT 23 



45 



2) INFORMATION FOR SEQ ID NO: 1467 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1467 
5 AATACAACTA CATTGTCATC ATTTGAT 27 



2) INFORMATION FOR SEQ ID NO: 1468 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases, 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1468 

20 

CGTTACCCGC TCAGGTTGGA CATCAA 26 



25 2) INFORMATION FOR SEQ ID NO: 1469 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1469 

CATCCCCCTC TGGCTCGATG TCG 23 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1470 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 74 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: Z50804 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1470 

TTGAAAGTAT CATTGATAGC TGCGAAACGA AAAAACGGCG TGATTGGTTG 50 

5 CGGTCCAGAC ATACCGTGGT CCGCGAAAGG GGAGCAGCTA CTTTTTAAAG 100 

CATTGACCTA CAATCAGTGT CTTCTGGTGG GTCGCAAGAC GTTTGAATCT 150 

ATGGGCGCAC TCCCCAATAG GAAATACGCG GTCGTTACCC GCTCAGGTTG 200 

GACATCAAAT GATGACAATG TAGTTGTATT TCAGTCAATC GAAGAGGCCA 250 

TGGACAGGCT AGCTGAATTC ACCGGTCACG TTATAGTGTC TGGTGGCGGA 300 

10 GAAATTTACC GAGAAACATT ACCCATGGCC TCTACGCTCC ACTTATCGAC 350 

GATCGACATC GAGCCAGAGG GGGATGTTTT CTTCCCGAGT ATTCCAAATA 400 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA CTTCAAACAT TAACTATTGC 450 

TATCAAATTT GGAAAAAGGG TTAA 474 



15 



2) INFORMATION FOR SEQ ID NO: 1471 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1471 
GATAATGACA ACGTAATAGT ATTCCC 26 

30 



2) INFORMATION FOR SEQ ID NO: 1472 T 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1472 
45 GCTCAATATC AATCGTCGAT ATA 23 



2) INFORMATION FOR SEQ ID NO: 1473 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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10 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1473 
TTAAAGCCTT GACGTACAAC CAGTGG 26 



2) INFORMATION FOR SEQ ID NO: 1474 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1474 
TGGGCAATGT TTCTCTGTAA ATCTCC 26 

25 



2) INFORMATION FOR SEQ ID NO: 1475 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
40 (C) ACCESSION NUMBER: X12868 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1475 



GTGAAAGTAT CATTAATGGC TGCAAAAGCG 

45 CGGTCCACAC ATACCCTGGT CCGCGAAAGG 

CCTTGACGTA CAACCAGTGG CTTTTGGTGG 

ATGGGAGCAC TCCCTAATAG GAAATACGCG 

GACGGCCGAT AATGACAACG TAATAGTATT 

TGTACGGGCT GGCTGAACTC ACCGATCACG 

50 GAGATTTACA GAGAAACATT GCCCATGGCC 

GATTGATATT GAGCCGGAAG GAGATGTTTT 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA 

TATCAAATTT GGCAAAAGGG TTAA 



AAAAACGGAG TGATTGGTTG 50 

AGAGCAGCTA CTCTTTAAAG 100 

GCCGCAAGAC GTTCGAATCT 150 

GTCGTTACTC GCTCAGCCTG 200 

CCCGTCGATC GAAGAGGCCA 250 

TTATAGTGTC TGGTGGCGGG 300 

TCTACGCTCC ATATATCGAC 350 

CTTTCCGAAT ATTCCCAATA 400 

GCTCAAACAT TAACTATTGC 450 

474 
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15 



2) INFORMATION FOR SEQ ID NO: 1476 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1476 
GGCGAGCAGC TCCTATTCAA AG 22 



2) INFORMATION FOR SEQ ID NO: 1477 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2.5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1477 
TAGGTAAGCT AATGCCGATT CAACA 25 

30 

2) INFORMATION FOR SEQ ID NO: 1478 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1478 
45 GAGAATGGAG TAATTGGCTC TGGATT 26 



2) INFORMATION FOR SEQ ID NO: 1479 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
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40 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1479 
GCGAAATACA CAACATCAGG GTCAT 25 



2) INFORMATION FOR SEQ ID NO: 1480 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: J120 

25 (C) ACCESSION NUMBER: Z86002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1480 

ATGAAAATAT CTCTTATGGC AGCTGTTTCC GAGAATGGAG TAATTGGCTC 50 

30 TGGATTGGAT ATACCTTGGC ATGTACAAGG CGAGCAGCTC CTATTCAAAG 100 

CCATGACTTA CAATCAATGG CTTCTAGTTG GTCGTAAAAC CTTCGACTCA 150 

ATGGGTAAAC TTCCGAATAG AAAATATGCA GTGGTTACTC GTTCTAAAAT 200 

TATCTCGAAT GACCCTGATG TTGTGTATTT CGCAAGTGTT GAATCGGCAT 250 

TAGCTTACCT AAACAATGCG ACAGCACATA TCTTTGTTTC TGGTGGTGGT 300 

35 GAAATATATA AAGCTTTAAT CGATCAAGCA GATGTTATCC ATCTTTCAGT 350 

GATTCACAAG CATATCTCTG GCGATGTGTT TTTTCCTCCA GTTCCACAGG 4 00 

GCTTCAAGCA AACATTTGAG CAAAGTTTCA GTTCAAATAT TGATTACACG 450 

TACCAAATTT GGGCAAAGGG CTAA 474 



2) INFORMATION FOR SEQ ID NO: 1481 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1481 
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24 



5 2) INFORMATION FOR SEQ ID NO: 1482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1482 

TAATTTATAT TAGACAWAAA AAACTG 26 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1483 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic. acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1483 
CARYGTCAGA AAATGGCGTA ATC 23 

2) INFORMATION FOR SEQ ID NO: 1484 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1484 
TKCAAAGCRW TTTCTATTGA AGGAAA 26 

50 

2) INFORMATION FOR SEQ ID NO: 1485 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ -ID NO:- 1485 
AAAATGGCGT AATCGGTAAT GGC 



2) INFORMATION FOR SEQ ID NO: 1486 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1486 
CATTTGAGCT TGAAATTCCT TTCCTC 



2) INFORMATION FOR SEQ ID NO: 1487 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1487 
AATCGAAAAT ATGCAGTAGT GTCGAG 



2) INFORMATION FOR SEQ ID NO: 1488 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1488 

AGACTATTGT AGATTTGACC GCCA 24 



10 



20 



25 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1489 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: VA292 

(C) ACCESSION NUMBER: U31119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1489 



TTGAAAATTT CATTGATTTC TGCAACGTCA GAAAATGGCG TAATCGGTAA 50 

TGGCCCTGAT ATCCCATGGT CAGCAAAAGG TGAGCAGTTA CTCTTTAAAG 100 

30 CGCTCACATA TAATCAGTGG CTCCTTGTTG GAAGGAAAAC ATTTGACTCT 150 

ATGGGTGTTC TTCCAAATCG AAAATATGCA GTAGTGTCGA GGAAAGGAAT 200 

TTCAAGCTCA AATGAAAATG TATTAGTCTT TCCTTCAATA GAAATCGCTT 250 

TGCAAGAACT ATCGAAAATT ACAGATCATT TATATGTCTC TGGTGGCGGT 300 

CAAATCTACA ATAGTCTTAT TGAAAAAGCA GATATAATTC ATTTGTCTAC 350 

35 TGTTCACGTT GAGGTTGAAG GTGATATCAA TTTTCCTAAA ATTCCAGAGA 4 00 

ATTTCAATTT GGTTTTTGAG CAGTTTTTTT TGTCTAATAT AAATTACACA 4 50 

TATCAGATTT GGAAAAAAGG CTAA 474 



2) INFORMATION FOR SEQ ID NO: 1490 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1490 

GACCTATGAG AGCTTGCCCG TCAAA 25 
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2) INFORMATION FOR SEQ ID NO: 1491 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

-(ii) -MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1491 
TCGCCTTCGT ACAGTCGCTT AACAAA 



2) INFORMATION FOR SEQ ID NO: 1492 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1492 
CATTTTAGCT GCCACCGCCA ATGGTT 



2) INFORMATION FOR SEQ ID NO: 14 93 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1493 
GCGTCGCTGA CGTTGTTCAC GAAGA 



2) INFORMATION FOR SEQ ID NO: 1494 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS:' Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BL26A 

-(C) ACCESSION NUMBER: U10186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 94 



ATGATCGAGC TTCATGCCAT TTTAGCTGCC ACCGCCAATG GTTGCATTGG 50 

GAAGGACAAC GCACTTCCCT GGCCACCACT AAAAGGCGAT CTGGCCAGAT 100 

15 TCAAAAAATT GACCATGGGG AAGGTGGTCA TTATGGGGCG CAAGACCTAT 150 

GAGAGCTTGC CCGTCAAATT AGAAGGTCGC ACCTGCATCG TTATGACGCG 200 

CCAAGCGCTG GAGCTTCCGG GTGTTCGTGA CGCTAACGGC GCTATCTTCG 250 

TGAACAACGT CAGCGACGCC ATGCGGTTCG CTCAAGAAGA GAGCGTGGGC 300 

GATGTGGCCT ACGTCATTGG TGGCGCTGAG ATATTCAAGC GACTTGCCTT 350 

20 GATGATCACG CAGATTGAAT TGACCTTTGT TAAGCGACTG TACGAAGGCG 400 

ACACCTACGT TGATCTGGCC GAAATGGTCA AAGACTACGA GCAGAATGGC 450 

ATGGAAGAAC ATGACCTTCA CACTTACTTC ACTTACCGTA AAAAGGAGCT . 500 

TACAGAATGA 510 



2) INFORMATION FOR SEQ ID NO: 1495 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 95 
TCTCTAAACA TGATTGTCGC TGTC 24 

40 



2) INFORMATION FOR SEQ ID NO: 1496 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1496 
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CAGTGAGGCA AAAGTTTTTC TACC 24 



2) INFORMATION FOR SEQ ID NO: 1497 

(i) SEQUENCE CHARACTERISTICS: 
(A) 'LENGTH: -25 -bases- _ . 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



15 



20 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 97 
CGGACGACTT CATGTGGTAG TCAGT 25 

2) INFORMATION FOR SEQ ID NO: 1498 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1498 
TTTGTTTTCA GTAATGGTCG GGACCT 26 

35 



2) INFORMATION FOR SEQ ID NO: 1499 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
50 (C) ACCESSION NUMBER: X57730 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:" 1499 
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ATGGCTTCTC TAAACATGAT 
ATTTGAAAAT CAGATTCCGT 
AAGCTGTTAC AATGAACTCA 
TCACTGCCTA AAGTGCTGCC 
5 AGTACCACCC ACCCAGAACA 
AGATCGCAGT AAGAACTGCA 
CAAATTTTTG TAATTGGTGG 
CGTGGACAAA CTCTACTTAA 
CTGAACTGGA TTTATCCCTA 
10 CCGACCATTA CTGAAAACAA 
CCCTAACCCT ATTAGTGAGG 



TGTCGCTGTC AATAAGACAG 
GGCATGAACC AGAAGATTTA 
GTTTTGATTA TGGGTAGAAA 
CGGACGACTT CATGTGGTAG 
CTGATCAAGT TGTGTATGTA 
AGCTTGTTGG TTGACAAACC 
GAAGAGTGCG TACGAGAACT 
CTAGAGTACA GCTCAACACA 
TTCAAGTCAT GGAAACTCGT 
AACAAAACTT ATTTTCCAAA 
AACCCACATG TTAG 



PCT/CAOO/01150 

GAGGTATCGG 50 

AAACACTTCA 100 

AACTTTTGCC 150 

TCAGTAAAAC 200 

AGTACATACC 250 

AGAGTATTCT 300 

TAGCTGCCTA 350 

CAACAAGACA 400 

ATCTGAGGTC 450 

TTTGGATTAA 500 
534 



15 2) INFORMATION FOR SEQ ID NO: 1500 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1500 , 

ATCGGGTTAT TGGCAATGGT CCTA 24 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1501 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1501 
GCGGTAGTTA GCTTGGCGTG AGATT 25 

2) INFORMATION FOR SEQ ID NO: 1502 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1502 
GCGGGCGGAG CTGAGATATA CA 22 



- -10 -2) INFORMATION -FOR- SEQ ID. NO: 1503 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1503 

AACGGAGTGG GTGTACGGAA TTACAG 26 



25 

2) INFORMATION FOR SEQ ID NO: 1504 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 98 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: TKS84 

(C) ACCESSION NUMBER: Z21672 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1504 



ATGAACTCGG AATCAGTACG CATTTATCTC 

TCGGGTTATT GGCAATGGTC CTAATATCCC 

45 AGAAGATTTT TCGCAGACTC ACTGAGGGAA 

AAGACCTTTG AGTCTATCGG CAAGCCTCTA 

AATCTCACGC CAAGCTAACT ACCGCGCCAC 

CGCTGTCGCA CGCTATCGCT TTGGCATCCG 

GTCGCGGGCG GAGCTGAGAT ATACACTCTG 

50 CGTGTTTCTA TCTGAGGTAC ATCAAACCTT 

CAATGCTCAA CGAAACAGAA TTCGAGCTTG 

GCTGTAATTC CGTACACCCA CTCCGTTTAT 



GTTGCTGCGA TGGGAGCCAA 50 

CTGGAAAATT CCGGGTGAGC 100 

AAGTCGTTGT CATGGGGCGA 150 

CCGAACCGTC ACACATTGGT 200 

TGGCTGCGTA GTTGTTTCAA 250 

AACTCGGCAA TGAACTCTAC 300 

GCACTACCTC ACGCCCACGG 350 

CGAGGGTGAC GCCTTCTTCC 400 

TCTCAACCGA AACCATTCAA 450 

GCGCGTCGAA ACGGCTAA 4 98 
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WO 01/23604 PCT/CAOO/01150 
2) INFORMATION FOR SEQ ID NO: 1505 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1505 
ATTTTTCGCA GGCTCACCGA GAGC 24 



2) INFORMATION FOR SEQ ID NO: 1506 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1506 
CGGATGAGAC AACCTCGAAT TCTGCTG 27 

30 



2) INFORMATION FOR SEQ ID NO: 1507 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
45 (B) STRAIN: RA33.2 

(C) ACCESSION NUMBER: Z50802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1507 

50 ATGAACCCGG AATCGGTCCG CATTTATCTG GTCGCTGCCA TGGGTGCCAA 50 

TCGGGTTATT GGCAATGGTC CCGATATCCC CTGGAAAATC CCAGGTGAGC 100 

AGAAGATTTT TCGCAGGCTC ACCGAGAGCA AAGTGGTCGT TATGGGCCGC 150 

AAGACATTTG AGTCCATAGG CAAGCCCTTA CCAAACCGCC ACACAGTGGT 200 
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GCTCTCGCGC CAAGCTGGTT ATAGCGCTCC 
CGCTGTCACA CGTATCGCCA TCGACAGCCG 
GTAGCGCGCG GAGCCGAGGT ATATGCGCTG 
CGTCTTTCTA TCTGAGGTAC ATCAMCCTT 
5 CAGTGCTTAA CGCAGCAGAA TTCGAGGTTG 
GGCACAATCA CGTACACGCA CTCCGTCTAT 



PCT/CAOO/01150 

TGGTTGTGCA GTTGTTTCAA 250 

AACACGGCAA AGAACTCTAC 300 

GCGCTACCGC ATGCCAACGG 350 

TGAGGGTGAC GCCTTCTTCC 400 

TCTCATCCGA AACCATTCAA 4 50 

GCGCGTCGTA ACGGCTAA 4 98 



10 2) INFORMATION FOR" SEQ" I'D" NO: 1508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: ' Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1508 

AGAATGTATT GGTATTTCCA TCTATCG 27 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1509 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1509 
CAATGTCGAT TGTTGAAATA TGTAAA 26 

2) INFORMATION FOR SEQ ID NO: 1510 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1510 
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5 2) INFORMATION FOR SEQ ID NO: 1511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

10 " " (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1511 

CAGACACAAT CACATGATCC GTTATCG 27 



20 



2) INFORMATION FOR SEQ ID NO: 1512 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

3C 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

<B) STRAIN: UI14 

(C) ACCESSION NUMBER: Z83331 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1512 

GTGAAACTAT CACTAATGGC AGCAATTTCG AAGAATGGAG TTATCGGAAA 50 

TGGCCCAGAT ATTCCATGGA GTGCCAAAGG GGAACAATTA CTCTTCAAAG 100 

40 CGATTACCTA TAATCAGTGG CTTTTGGTAG GCCGAAAGAC TTTCGAGTCA 150 

ATGGGGGCTT TACCCAACCG AAAATATGCC GTTGTAACTC GTTCAAGCTT 200 

CACTTCCAGT GATGAGAATG TATTGGTATT TCCATCTATC GATGAAGCGC 250 

TAAATCATCT GAAGACGATA ACGGATCATG TGATTGTGTC TGGTGGTGGT 300 

GAAATATACA AAAGCCTGAT CGATAAAGTT GATACTTTAC ATATTTCAAC 350 

45 AATCGACATT GAGCCAGAAG GTGATGTCTA TTTTCCAGAA ATCCCCAGTA 400 

GTTTTAGGCC AGTTTTTAGC CAAGACTTCG TGTCTAACAT AAATTATAGT 450 

TACCAAATCT GGCAAAAGGG TTAA 474 



50 

2) INFORMATION FOR SEQ ID NO: 1513 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1513 
TTCAAGCTCA AATGAAAACG TCC 



2) INFORMATION FOR SEQ ID NO: 1514 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1514 
GAAATTCTCA GGCATTATAG GGAAT 



2) INFORMATION FOR SEQ ID NO: 1515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1515 
GTGGTCAGTA AAAGGTGAGC AAC 



2) INFORMATION FOR SEQ ID NO: 1516 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 
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MISSING AT THE TIME OF PUBLICATION 
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CTATGTCTCA AGGCCGTGCA ACATACTCTA TGGAATTTGC TAAATATGCT 50 
GAAACTCCAC GTAACGTGGC TGAAGGCATC ATTTCTAAAT TTCAGTCTGG 100 
CGGTAAAAAA GGTGACGACG AGTAA 125 



2) INFORMATION FOR SEQ ID NO: 1519 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter lwoffi 
20. (B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1519 

TCTTTCGATT ACTATAAGCC CAAACTAATT CATAGTTAAA AACCAAGTGC 50 
25 TCATGCAGTG ATCCTGCATG AGTAGTTTAA AAAGGAAGAT CTC 93 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1520 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Acinetobacter lwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1520 



45 ATGGCTAAGG 
AATCGGTCAC 
CTGTATGTGC 
GACTCTGCAC 
CGTAGAATAC 

50. GCCACGCCGA 
GGCGCGATCC 
TGAACACATC 
TCCTTAACAA 



CTAAGTTTGA 
GTTGACCATG 
GAAGAAATTC 
CAGAAGAAAA 
GATTCTCCAA 
TTATGTTAAA 
TTGTATGTGC 
CTTCTTTCTC 
GTGTGACCTT 



ACGTAATAAG 
GTAAAACAAC 
GGTGGCGAAG 
AGCACGTGGT 
CTCGTCACTA 
AACATGATTA 
TGCGACTGAT 
GTCAGGTTGG 
GTTGATGATG 



CCACACGTTA 
TTTAACAGCT 
CGAAAGACTA 
ATTACAATTA 
CGCACACGTA 
CTGGTGCTGC 
GGTCCAATGC 
TGTACCTTAC 
AAGAACTTCT 



ACGTGGGCAC 50 

GCAATTGCAA 100 

CGCTGCAATT 150 

ATACTTCACA 200 

GACTGCCCGG 250 

TCAGATGGAC 300 

CACAGACTCG 350 

ATTCTTGTAT 400 

TGAGCTAGTG 450 
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GAAATGGAAG TTCGTGAACT TCTTTCTACT TATGACTTCC CAGGTGATGA 500 

CACTCCAGTT ATCCGTGGTT CAGCTCTTCT TGCACTTAAC GGTGACGCTG 550 

GTCAGTATGG CGAAGAAGCA GTTGTTGCGC TTGTTGACGC ACTTGACACT 600 

TACATTCCAG AGCCAGTACG TGCAATCGAC CAAGCATTCT TAATGCCAAT 650 

5 CGAAGACGTA TTCTCTATTT CTGGTCGTGG TACAGTAGTA ACTGGCCGTG 700 

TAGAAACTGG TATTGTGAAA GTAGGCGAAT CAGTTGAAAT CGTTGGTATC 750 

CGTGATACTC AAGTAACTAC AGTTACTGGC GTAGAAATGT TCCGTAAATT 800 

GCTTGACGAA GGTCGTGCGG GCGAGAACTG TGGTGTTCTT CTACGTGGTA 850 

CTAAGCGTGA AGACGTACAA CGTGGTCAAG TACTTGCTAA ACCAGGTGCA 900 

10 ATCAAGCCAC ACACTAAATT CGATGCAGAA GTATA'CGTAC TTTCTAAAGA 950 

AGAAGGTGGT CGTCACACTC CATTCCTTAA CGGTTACCGT CCACAGTTCT 1000 

ACTTCCGTAC AACTGACGTA ACTGGCGCGA TCAAATTACA AGATGGCGTT 1050 

GAAATGGTTA TGCCTGGTGA CAACGTAGAA ATGTCAGTAG AATTAATCCA 1100 

CCCAAT 1106 

15 



2) INFORMATION FOR SEQ ID NO: 1521 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 
30 (B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1521 

ACAAACTCAA GGTCGTGCAT CTTACTCAAT GGAACCGTTA AAATATGCTG 50 
35 AAGCTCCAAC AAGTGTTGCG GCTGCAGTAA TTGAAGCGCG TAAAAAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1522 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1522 
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TTTTTGTAAA CCAGCGGTGT AAAATATGAT TGTTTTATAC CGCACTTCTT 50 
AGGAAACATT AGAA 64 



2) INFORMATION FOR SEQ ID NO: 1523 

(i) SEQUENCE CHARACTERISTICS: 
10 - (A) "LENGTH:- 1098 bases ~ 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1523 



ATGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTAA ACGTGGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC TTTAACAGCA GCAATTACAA 100 

25 CCGTATTAGC AAAACACTAC GGTGGTGCAG CGCGTGCATT TGACCAAATC 150 

GATAACGCGC CAGAAGAAAA AGCGCGTGGT ATTACCATCA ACACTTCACA 200 

TGTTGAATAC GATACACCAA CTCGCCACTA TGCACACGTA GACTGTCCAG 250 

GACACGCCGA CTATGTTAAA AACATGATTA CCGGTGCGGC GCAAATGGAT 300 

GGTGCTATTT TAGTAGTAGC AGCAACAGAT GGTCCTATGC CACAAACTCG 350 

30 TGAACATATC TTATTAGGTC GCCAAGTAGG TGTTCCATAC ATCATCGTAT 400 

TCTTAAACAA ATGCGACATG GTAGATGATG AAGAGTTATT AGAATTAGTA 450 

GAAATGGAAG TGCGTGAACT TCTATCTCAA TATGACTTCC CAGGTGACGA 500 

TACACCAATC GTACGTGGTT CAGCATTACA AGCATTGAAC GGCGTAGCAG 550 

AATGGGAAGA AAAAATCCTT GAATTAGCTG GTCACTTAGA TACTTACATC 600 

35 CCAGAACCAG AACGTGCGAT TGACCAACCG TTCCTTCTTC CAATTGAAGA 650 

CGTATTCTCA ATTTCAGGTC GTGGTACAGT AGTAACTGGT CGTGTAGAAC 700 

GTGGTATCAT CCGTACTGGT GATGAAGTAG AAATCGTCGG TATCAAAGAT 750 

ACAGCGAAAA CTACTGTAAC AGGTGTTGAA ATGTTCCGTA AATTACTTGA 800 

CGAAGGTCGT GCAGGTGAAA ACATCGGTGC ATTATTACGT GGTACCAAAC 850 

40 GTGAAGAAAT CGAACGTGGT CAAGTATTAG CGAAACCAGG TTCAATCACA 900 

CCACACACTG ATTTTGAATC AGAAGTATAC GTATTATCAA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGTTA CCGTCCACAA TTCTATTTCC 1000 

GTACAACAGA CGTAACTGGT ACAATTGAAT TACCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAGATGACA GTAAGCTTAA TCCACCCA 1098 

45 



2) INFORMATION FOR SEQ ID NO: 1524 

50 (i) SEQUENCE CHARACTERISTICS : ? 

(A) LENGTH: 77 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
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10- 



15 



25 



35 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus jnirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1524 

CAATGGAGTT CTTGAAGTAC AACGAAGCGC CTAGCAACGT CGCTCAGGCT 50 
ATTATCGAAG CTCGTAAAGC GAAATAA 77 



2) INFORMATION FOR SEQ ID NO: 1525 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1525 

GATCCTTTCG AGTTCAATTT AGTTTACGCT CCCTCTGTGA GAGGGAGCGA 50 
TATTAAGGAA TATAGTC 67 



2) INFORMATION FOR SEQ ID NO: 1526 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1112 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1526 

GTGTCTAAAG AAAAATTTGA ACGTTCAAAA CCGCACGTTA ACGTTGGTAC 50 
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TATCGGCCAC GTTGACCACG GTAAAACAAC TCTGACTGCT GCAATCACTA 100 

CAGTTTTAGC TAAAACTTAC GGTGGTGCTG CTCGTGCATT CGACCAAATC 150 

GATAATGCAC CAGAAGAAAA AGCGCGTGGT ATCACCATCT CTACTTCACA 200 

CGTAGAATAC GATACTCCAA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

5 GTCACGCCGA CTATGTTAAA AACATGATCA CTGGTGCTGC GCAAATGGAC 300 

GGCGCTATTC TGGTAGTAGC AGCAACTGAT GGTCCAATGC CACAAACTCG 350 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC ATCATCGTAT 400 

TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGCTGTT AGAATTAGTT 4 50 

GAAATGGAAG TTCGTGAACT TCTGTCTCAA TACGATTTCC CAGGTGATGA 500 

10 CACTCGAGTA ATCCGTGGTT- CAGCGCTGAA- AGCAGTGGAA GGCGAAGCAG 550 

AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA TTCTTATATC 600 

CCAGAGCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC CAATCGAAGA 650 

TGTATTCTCA ATCTCAGGCC GTGGTACAGT, AGTTACTGGT CGTGTAGAGC 700 

GTGGTATCAT CAAAGTAGGT GATGAAGTTG AGATTGTTGG TATCAAAGAA 750 

15 ACCACCAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA AATTACTTGA 800 

CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTACAAAAC 850 

GTGAAGAAAT CGAACGTGGA CAAGTACTGG CAAAACCAGG CTCAATCAAC 900 

CCACACAACA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

TGGTCGTCAC ACACCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

20 GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG CGTAGAAATG 1050 

GTAATGCCAG GCGACAACGT GAACATGATC GTTGAACTGA TCCACCCAAT 1100 

CGCAATGGAC GA 1112 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1527 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvvs 

(B) STRAIN: ATCC 35224 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1527 

ATCAACGAAG CTATCGAGGT TTATTTTGAG GTTGAGGGCA AGAAAAATAG 50 

ATTGATCCTG GAGGTCGCGG CTCACTTGGG TGATAACCGC GTCAGAACGA 100 

TCGCTATGGA TATGAGTGAG GGGCTTACTC GCGGGCTTGA AGCTACCGCT 150 

45 CTTGGTGCGC CTATTAGTGT GCCGGTTGGC GAGAAGGTTT TGGGAAGAAT 200 

TTTTAACGTC GTCGGCGATC TCATCGACGA GGGCGAGGGC GTAAATTTTG 250 

ATAAACATTG GTCTATCCAC CGCGATCCGC CACCATTTGA AGAACAAAGC 300 

ACGAAAAGTG AAATTTTTGA AACCGGTATA AAGGTTGTGG ATCTTCTTGC 350 

GCCTTACGCA AAGGGCGGTA AGGTCGGACT ATTTGGCGGT GCAGGTGTCG 400 

50 GTAAAACGGT CATCATCATG GAGCTCATCC ACAATGTCGC CTTCAAACAC 450 

AGCGGATACT CTGTATTTGC AGGTGTTGGC GAGAGGACGC GCGAAGGAAA 500 

CGACCTTTAT CACGAGATGA AAGAAAGTAA CGTTTTGGAT AAAGTCGCCT 550 

TGTGCTACGG ACAGATGAAC GAGCCGCCAG GGGCGAGAAA TCGTATCGCA 600 
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CTGACTGGTC TAACGATGGC TGAGTATTTC CGCGATGAGA TGGGACTTGA 
TGTGCTTATG TTTATCGACA ACATCTTCCG CTTCTCTCAA TCTGGTGCAG 
AGATGTCGGC ACTCCTCGGA CGTATCCCAT CAGCCGTTGG TTACCAGCCG 
ACGCTGGCAA GCGAGATGGG TAAATTTCAA GAAAGGATCA CATCGACTAA 



650 
700 
750 
800 
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2) INFORMATION FOR SEQ ID NO: 1528 

(i)~ SEQUENCE- CHARACTERISTICS: - 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:' DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1528 
AACTTGAGCG ATTTTCGGAT ACCCTG 26 



2) INFORMATION FOR SEQ ID NO: 1529 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1529 
TTGCCGATGA AATAACCGCC GACT 24 



2) INFORMATION FOR SEQ ID NO: 1530 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: M11277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1530 



ATGCGATTGG TTTGGAAATG TGGGGCGATT CAGGCATCCC GGTTATCTGA 50 

ATGGCTCAAC TCAACAGCCG GTGCTCATGA ACTTGAGCGA TTTTCGGATA 100 

5 CCCTGACCTT TTCTGTGTAT GGCTCAGTGC TGATCTGGCT GAAATCATAT 150 

CTCCGCGAAT CAGGAAGAAA ACTGCAGTTA GTCGGAATCG CCTTACCCAA 200 

CACCCTGAAC CCAAGGGACG ACCTAGCGCA ATTGGCCGAA ATTATCCAGC 250 

TCATCGATCA CCTCATGAAA CCGCACGTTG ATATGTTGAC TCACTTGTTG 300 

- • GCGTCCATTG ATGGCCAGTC GGCGGTTATT TCATCGGCAA AATGGGGGGA 350 

10 GCTAGAAACG GCTCGGCAGG AGAAAGCTAT CTCAGGGGTA ACCAGATTGA 400 

AGCTCCGCTT GGCGTCGCTT GCCCCCGTCC TGAAAAAACA CGTCAACAGC 450 

GATTTGTTCC GAAAAGCCTC TGATCGAATA GAGTCGATAG AGTATACGTT 500 

GGAAACCTTG CGTATAATGA AAACTTTCTT, CGATGGTACC TCTCTTGAGG 550 

GAGATACTTC CGTACGTGAC TCGTATATGG CGGGCGTAGT AGATGGAATG 600 

15 GTTCGAGCGA ATCCGGATGT GAAGATAATT CTGCTGGCGC ACAACAATCA 650 

TCTACAAAAA ACTCCAGTCT CCTTTTCAGG CGAGCTTACG GCTGTTCCCA 700 

TGGGGCAGCA CCTCGCAGAG AGGGTGAATT ACCGTGCGAT TGCATTCACC 750 
CATCTTGGAC CCACCGTGCC GGAAATGCAT TTCCCATCGC CAAAAAGTCC . 800 

TCTTGGATTC TCTGTTGTGA CCACGCCTGC CGATGCAATC CGTGAGGATA 850 

20 GTATGGAACA GTATGTCATC GACGCCTGTG GTACGGAGAA TTCATGTCTG 900 

ACATTGACAG ATGCCCCCAT GGAAGCAAAG CGAATGCGGT CTCAAAGCGC 950 

CTCTGTAGAA ACGAAATTGA GCGAGGCATT TGATGCCATC GTCTGTGTTA 1000 

CAAGCGCCGG CAAGGACAGC CTGGTTGCCC TATAG 1035 



25 



2) INFORMATION FOR SEQ ID NO: 1531 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1531 
TCTTTTTGTT ACGACATACG CTTTT 25 

40 



2) INFORMATION FOR SEQ ID NO: 1532 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1532 

793 
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AGTGCTTCTT TATCCGCTGT TCTA 24 



2) INFORMATION FOR SEQ ID NO: 1533 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1533 
CAGCGGATAA AGAAGCACTA CACATT 26 



2) INFORMATION FOR SEQ ID NO: 1534 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1534 
CCTCCTGAAA TAAAGCCCGA CAT 23 



2) INFORMATION FOR SEQ ID NO: 1535 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: A15097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1535 
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ATGAGGTTCG AAGAATGGGT CAAAGATAAG CATATTCCTT NCAAACNGAA 50 

TCACCCTGAT GATAATTACG ATGATTTTAA GCCATTAAGA AANATAATTG 100 

GAGATACCCG AGTTGTAGCA TTAGGTGAAA ATTCTCATTT CATAAAAGAA 150 

TTCTTTTTGT T AC G AC AT AC GCTTTTGCGT TTTTTTATCG AAGATCTAGG 200 

5 TTTTACTACG TTTGCTTTTG AATTTGGTTT TGCTGAGGGT CAAATCATCA 250 

ATAACTGGAT ACATGGACAA GGAACTGACG ATGAAATAGG CAGATTCTTA 300 

AAACACTTCT ATTATCCAGA AGAGCTCAAA ACCACATTTC TATGGCTAAG 350 

GGAGTACAAT AAAGCAGCAA AAGAAAAAAT CACATTTCTT GGCATTGATA 400 

TACCCAGAAA TGGAGGTTCA TACTTACCAA ATATGGAGAT AGTGCATGAC 450 

-10 TTTTTTAGAA CAGCGGATAA AGAAGCACTA CACATTATCG ATGATGCATT 500 

TAATATTGCA AAAAAGATTG ATTACTTCTC CACATCACAG GCAGCCTTAA 550 
ATTTACATGA GCTAACAGAT TCTGAGAAAT GCCGTTTAAC TAGCCAATTA . 600 

GCTCGAGTAA AAGTTCGCCT TGAAGCTATG GCTCCAATTC ACATTGAAAA 650 

ATATGGGATT GATAAATATG AGACAATTCT GCATTATGCC AACGGTATGA 700 

15 TATACTTGGA CTATAACATT CAAGCTATGT CGGGCTTTAT TTCAGGAGGC 750 

GGAATGCAGG GCGATATGGG TGCAAAAGAC AAATACATGG CAGATTCTGT 800 

GCTGTGGCAT TTAAAAAACC CACAAAGTGA GCAGAAAGTG ATAGTAGTAG 850 

CACATAATGC ACATATTCAA AAAACACCCA TTCTGTATGA TGGATTTCTA 900 

AGTTGCCTAC CAATGGGCCA AAGACTTAAA AATGCCATTG GTGATGATTA 950 

20 TATGTCTTTA GGTATTACTT CTTATAGTGG GCATACTGCA GCCCTCTATC 1000 

CGGAAGTTGA TACAAAATAT GGTTTTCGAG TTGATAACTT CCAACTGCAG 1050 

GAACCAAATG AAGGTTCTGT CGAGAAAGCT ATTTCTGGTT GTGGAGTTAC 1100 

TAATTCTTTT GTCTTTTTTA GAAATATTCC TGAAGATTTA CAATCCATCC 1150 

CGAACATGAT TCGATTTGAN TCTATTTACA TGAAAGCAGA ACTCGAGAAA 1200 

25 GCTTTCGATG GAATATTTCA AATTGAAAAG TCATCTGTAT CTGAGGTCGT 1250 

TTATGAATAA 1260 



30 2) INFORMATION FOR SEQ ID NO: 1536 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1536 

AGATGTATTA ACTGGAAAAC AACAA 25 



45 



2) INFORMATION FOR SEQ ID NO: 1537 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1537 
5 CTTTGTAATT AGTTTCTGAA AACCA 25 



10 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1538 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1538 

20 

TTAGAAGATA TAGGATACAA AATAGAAG 28 

25 2) INFORMATION FOR SEQ ID NO: 1539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1539 

GAATGAAAAA GAAGTTGAGC TT 22 



2) INFORMATION FOR SEQ ID NO: 1540 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 86 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
(C) ACCESSION NUMBER: M14039 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1540 

ATGAAAAATA ATAATGTAAC AGAAAAAGAA TTATTTTATA TTTTAGATTT 50 

5 ATTTGAACAC ATGAAAGTAA CTTATTGGTT AGATGGTGGC TGGGGGGTAG 100 

ATGTATTAAC TGGAAAACAA CAAAGAGAAC ACAGAGATAT AGATATAGAT 150 

TTTGACGCTC AACACACTCA AAAAGTTATA CAAAAATTAG AAGATATAGG 200 

ATACAAAATA GAAGTTCATT GGATGCCTTC ACGTATGGAA CTTAAGCATG 250 

AAGAATATGG GTATTTAGAT ATTCATCCTA TAAATCTAAA TGATGATGGA 300 

10 TCAATTACCC AAGCAAACCC AGAAGGTGGT AATTATGTTT TCCAAAATGA - 350 

CTGGTTTTCA GAAACTAATT ACAAAGATCG AAAAATACCA TGTATTTCAA 4 00 

AAGAAGCTCA ACTTCTTTTT CATTCTGGTT ATGATTTAAC AGAAACAGAC 4 50 

CATTTTGATA TAAAAAATTT AAAATCAATA ACATAA 486 



15 



2) INFORMATION FOR SEQ ID NO: 1541 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1541 
■ TGATAATCTT ATACGTGGGG AATTT 25 

30 



2) INFORMATION FOR SEQ ID NO: 1542 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid. 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1542 
45 ATAATTTTCT AATTGCCCTG TTTCAT 26 



2) INFORMATION FOR SEQ ID NO: 1543 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

797 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1543 
GGGCAATTAG AAAATTATTT ATCAGA 26 

2) INFORMATION FOR SEQ ID NO: 1544 



(1) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1544 
TTTTACTCAT GTTTAGCCAA TTATCA 26 

25 



2) INFORMATION FOR SEQ ID NO: 1545 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (C) ACCESSION NUMBER: AF110130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1545 

ATGTTAAAAC AAAAAGAATT AATTGCAAAC GTTAAGAATC TTACTGAGTC 50 

45 AGATGAACGA ATTACAGCTT GTATGATGTA TGGATCGTTT ACCAAAGGAG 100 

AAGGTGACCA ATACTCTGAT ATAGAGTTCT ATATATTTTT GAAACATAGT 150 

ATAACCTCGA ACTTTGATTC ATCCAACTGG TTGTTTGACG TAGCTCCGTA 200 

CTTGATGCTT TATAAAAATG AGTACGGAAC AGAGGTAGTT ATTTTTGATA 250 

ATCTTATACG TGGGGAATTT CATTTCCTTT CTGAAAAAGA TATGAACATA 300 

50 ATCCCCTCGT TTAAAGATTC AGGTTATATT CCTGATACGA AGGCTATGCT 350 

TATTTACGAT GAAACAGGGC AATTAGAAAA TTATTTATCA GAGATAAGTG 400 

GTGCAAGACC AAATAGACTT ACTGAAGAAA ATGCTAATTT TTTGTTGTGT 450 

AATTTCTCTA ATCTATGGTT GATGGGAATC AACGTTCTAA AAAGAGGAGA 500 

798 
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ATATGCTCGT TCATTAGAAC TCTTATCACA 

AACTTATACG TATGGCAGAA AAAAATGCTG 

AAAAACCTTG AAAAAGAAAT TAGCCTTGAA 

GACCACTGCT CGATTAGATA AGGTAGAATT 

CTTTGCTATT AGTTATGGAT TTGCAAAGTC 

TTAAAAGTTA CACATGACAT TTTAGAAAGA 
ATAG 



PCT/CA00/01150 

ACTTCAAAAA AATACACTAC 550 

ATAATTGGCT AAACATGAGT 600 

AATTATAAAA AATTTGCAAA 650 

ATTTGAAGCC TATAAAAATT 700 

ACCTTATTGA ACAATACAAC 750 

TTGTTGAATT ACATTAGTGA 800 

804 



2) INFORMATION FOR SEQ ID NO: 1546 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1546 
CAAGAAGGAA TGGCTGTACT AC 22 



2) INFORMATION FOR SEQ ID NO: 1547 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1547 
TAATTCCCAA ATAACCCTAA TAATAGA 27 



2) INFORMATION FOR SEQ ID NO: 1548 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

799 
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(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: U70055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1548 

5 

ATGGAAAAAT ACAACAATTG GAAACTTAAG TTTTATACAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CTTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACT GGATCCGCGA TGGTCTTGTC TATGGCTTCA 150 

CTATTAGGTT TTTTACCCTA TGCGGTCTTT GGACCTGCAA TTGGTGTGCT 200 

10" -AGTGGATCGT- CATGATAGGA- AGAAGATAAT - GATTGGTGCT- GATT-TAATTA 250 

TCGCAGCAGC TGGTTCGGTG CTTACTATTG TTGCATTCTA TATGGAGCTA 300 

CCTGTCTGGA TGGTTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCACACC CCGGCTCTCA ATGCGGTTAC, GCCACTTTTA GTACCAGAAG 4 00 

AACAGCTTAC GAAATGTGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 4 50 

15 TATATTGTTA GTCCGGCGGT TGCAGCACTC TTATACTCCG TTTGGGAACT 500 

AAATGCTATT ATTGCCATCG ATGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATT CCTAAGCTGG GTGATCGCGT GCAAAGTTTG 600 

GACCCAAATT TCATAAGAGA AATGCAAGAA GGAATGGCTG TACTACGGCA 650 

AAATAAAGGA TTATTTGCTT TATTACTCGT TGGAACATTA TATATGTTTG 700 

20 TTTATATGCC AATTAATGCA CTATTCCCTT TAATTAGCAT GGATTACTTT 750 

AATGGAACAC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATCTGG 800 

AATGTTGATA GGGGGTCTAT TATTAGGGTT ATTTGGGAAT TACCAAAAGC 850 

GAATCTTATT AATAACGGCA TCCATTTTTA TGATGGGGAT AAGCTTAACC 900 

ATTTCAGGAT TACTTCCCCA AAGTGGATTT TTCATTTTTG TAGTCTGCTG 950 

25 TGCAATAATG GGGCTTTCTG TTCCGTTTTA CAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTA 1050 

ACTGGAAGTA TCATGTCTCT TGCTATGCCA ATTGGATTAA TTCTTTCTGC 1100 

ACTCTTTGCT GATAGAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

CTTTAATTAT TTGCATTGCA ATAGTTTGCC CAATGATAAA TGAGATTAGA 1200 

30 AAATTAGATT TAAAATAA 1218 



35 



2) INFORMATION FOR SEQ ID NO: 1549 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1549 

45 

GCTTATTATT AGGAAGATTA GGGGGC 26 



50 2) INFORMATION FOR SEQ ID NO: 1550 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 

800 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1550 
TAGCAAGTGA CATGATACTT CCGA 



2) INFORMATION FOR SEQ ID NO: 1551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
(C) ACCESSION NUMBER: U83667 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1551 

ATGGAAAAAT ACAACAATTG GAAACGAAAA TTTTATGCAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CCTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACA GGATCTGCGA TGGTCTTGTC TATGGCTTCA 150 

TTAGTAGGTT TTTTACCCTA TGCGATTTTG GGACCTGCCA TTGGTGTGCT 200 

AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCC GATTTAATTA 250 

TCGCAGCAGC TGGTGCAGTG CTTGCTATTG TTGCATTCTG TATGGAGCTA 300 

CCTGTCTGGA TGATTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCATACC CCAGCACTCA ATGCGGTTAC ACCACTTTTA GTACCAGAAG 400 

AACAGCTAAC GAAATGCGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 450 

TATATTGTTA GTCCGGCAGT TGCAGCACTC TTATACTCCG TTTGGGATTT 500 

AAATGCTATT ATTGCCATCG ACGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATA CCTAAGCTGG GTAATCAAGT GCAAAGTTTA 600 

GAACCAAATT TCATAAGGGA GATGAAAGAA GGAGTTGTGG TTCTGAGACA 650 

AAACAAAGGA TTGTTTGCCT TATTACTCTT AGGAACACTA TATACTTTTG 700 

TTTATATGCC AATCAATGCA CTATTTCCTT TAATAAGCAT GGAACACTTT 750 

AATGGAACGC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATTTGG 800 

GATGCTAGCA GGAGGCTTAT TATTAGGAAG ATTAGGGGGC TTCGAAAAGC 850 

ATGTATTACT AATAACAAGT TCATTTTTTA TAATGGGGAC CAGTTTAGCC 900 

GTTTCGGGAA TACTTCCTCC AAATGGATTT GTAATATTCG TAGTTTGCTG 950 

TGCAATAATG GGGCTTTCGG TGCCATTTTA TAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTG 1050 

ATCGGAAGTA TCATGTCACT TGCTATGCCA ATTGGGTTAA TTCTTTCTGG 1100 

ATTCTTTGCT GATAAAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

TTTTAATTAT TGGCATTGCT ATAGTTTGCC AAATGATAAC TGAGGTTAGA 1200 

AAATTAGATT TAAAATAA 1218 
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2) INFORMATION FOR SEQ ID NO: 1552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1552 
GGCAAGCAGT ATCATTAATC ACTA , 24 

2) INFORMATION FOR SEQ ID NO: 1553 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1553 
CAATGCTACG GATAAACAAT ACTATC 26 

30 



2) INFORMATION FOR SEQ ID NO: 1554 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1554 
45 AGAAAATTAA GCCTGAATAT TTAGGAC 27 



2) INFORMATION FOR SEQ ID NO: 1555 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

802 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1555 
TAGTAAAAAC CAATGATTTA CACCG 25 



2) INFORMATION FOR SEQ ID NO:. 1556 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1556 
ACTGTACGCA CTTGCAGCCC GACAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1557 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1557 
4 0 GAACGGCAGG CGATTCTTGA GCAT 24 



2) INFORMATION FOR SEQ ID NO: 1558 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



803 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1558 



GTGGTGGTGC ATGGCGATCT CT 22 



2) INFORMATION FOR SEQ ID NO: 1559 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1559 
GCCGCAGCGA GGTACTCTTC GTTA 24 

20 



2) INFORMATION FOR SEQ ID NO: 1560 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
35 (C) ACCESSION NUMBER: D16251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1560 

ATGACCGTAG TCACGACCGC CGATACCTCC CAACTGTACG CACTTGCAGC 50 

40 CCGACATGGG CTCAAGCTCC ATGGCCCGCT GACTGTCAAT GAGCTTGGGC 100 

TCGACTATAG GATCGTGATC GCCACCGTCG ACGATGGACG TCGGTGGGTG 150 

CTGCGCATCC CGCGCCGAGC CGAGGTAAGC GCGAAGGTCG AACCAGAGGC 200 

GCGGGTGCTG GCAATGCTCA AGAATCGCCT GCCGTTCGCG GTGCCGGACT 250 

GGCGCGTGGC CAACGCCGAG CTCGTTGCCT ATCCCATGCT CGAAGACTCG 300 

45 ACTGCGATGG TCATCCAGCC TGGTTCGTCC ACGCCCGACT GGGTCGTGCC 350 

GCAGGACTCG GAGGTCTTCG CGGAGAGCTT CGCGACCGCG CTCGCCGCCC 400 

TGCATGCCGT CCCCATTTCC GCCGCCGTGG ATGCGGGGAT GCTCATCCGT 450 

ACACCGACGC AGGCCCGTCA GAAGGTGGCC GACGACGTTG ACCGCGTCCG 500 

ACGCGAGTTC GTGGTGAACG ACAAGCGCCT CCACCGGTGG CAGCGCTGGC 550 

50 TCGACGACGA TTCGTCGTGG CCAGATTTCT CCGTGGTGGT GCATGGCGAT 600 

CTCTACGTGG GCCATGTGCT CATCGACAAC ACGGAGCGCG TCAGCGGGAT 650 

GATCGACTGG AGCGAGGCCC GCGTTGATGA CCCTGCCATC GACATGGCCG 700 

CGCACCTTAT GGTCTTTGGT GAAGAGGGGC TCGCGAAGCT CCTCCTCACG 750 

804 
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TATGAAGCGG CCGGTGGCCG 
GGAGCGCCTT GCGTTCGGGG 
CGGGTAACGA AGAGTACCTC 
GAATGA 



GGTGTGGCCG CGGCTCGCCC 
CGGTCACCTA CGCACTCTTC 
GCTGCGGCGA AGGCGCAGCT 



ACCACATCGC 800 
GCCCTCGACT 850 
CGCCGCAGCG 900 
906 



2) INFORMATION FOR SEQ ID NO: 1561 

10" " " (i ) "SEQUENCE CHARACTERISTICS :- 

(A) LENGTH: 1048 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE:' Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 
20 (B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1561 

GATCATGGTA AAACTACATT GACTGCTGCT ATCACCAAAG TTTTAGCCGA 50 

25 ACAAGGTGGT GCCAACTTCT TGGATTAYGG TTCTATTGAT AGAGCTCCAG 100 

AAGAAAGAGC TAGAGGTATC ACTATTTCCA CTGCCCACGT TGAATACGAA 150 

ACCAAGAACA GACACTATGC CCACGTTGAT TGTCCAGGAC ACGCTGATTA 2 00 

TATCAAAAAT ATGATTACTG GTGCCGCTCA AATGGATGGT GCTATCATTG 250 

TTGTTGCTGC CACTGATGGT CAAATGCCTC AAACCAGAGA ACATTTGTTA 300 

30 TTGGCCAGAC AAGTTGGTGT TCAAGACTTG GTTGTGTTTG TCAACAAAGT 350 

CGATACTATT GATGACCCTG AAATGTTGGA ATTAGTCGAA ATGGAAATGA 400 

GAGAATTGTT ATCCACCTAC GGTTTTGATG GTGACAACAC TCCAGTTATT 450 

ATGGGATCTG CTTTAATGGC TTTGGAAGAC AAGAAACCAG AAATTGGTAA 500 

GGAAGCTATC TTGAAATTGT TAGATGCTGT CGATGAACAC ATTCCAACTC 550 

35 CATCAAGAGA CTTGGAACAA CCATTTTTGT TACCAGTTGA AGACGTGTTC 600 

TCCATCTCCG GTAGAGGAAC TGTTGTCACT GGTAGAGTTG AAAGAGGTGT 650 

TTTGAAGAAG GGTGAAGAAA TCGAAATTGT TGGTGGTTTT GACAAACCTT 700 

ACAAGACTAC TGTTACCGGT ATTGAAATGT TCAAAAAAGA ATTAGACTCT 750 

GCTATGGCTG GTGACAACTG TGGTGTTTTG TTAAGAGGTG TTAAAAGAGA 800 

40 TGAAATCAAG AGAGGTATGG TTTTGGCCAA ACCAGGTACT GCCACTTCTC 850 

ACAAGAAGTT CTTGGCTTCC TTGTATATTT TGACTTCCGA AGAAGGTGGY 900 

CGTTCCACTC CATTTGGTGA AGGTTACAAG CCTCAATGCT TCTTCAGAAC 950 

TAACGATGTC ACTACCACAT TTTCATTCCC AGAAGGAGAA GGTGTTGACC 1000 

ATTCTCAAAT GATCATGCCA GGTGACAACA TTGAAATGGT TGGTGAAT 1048 

45 



2) INFORMATION FOR SEQ ID NO: 1562 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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15 
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25 



30 
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40 



45 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1562 

TGATCACGGT AAAACCACAT TAACTGCTGC CATTACCAAA GTATTAGCTG 50 

AACAAGGTGG TGCCAACTTT TTGGATTACG GTTCCATTGA TAGAGCTCCA 100 

GAAGAAAGAG CCAGAGGTAT CACTATTTCC ACTGCCCACG TTGAATACGA 150 

AACCAAGAAC AGACACTATG CCCACGTTGA TTGTCCAGGA CACGCTGATT 200 

ATATCAAAAA CATGATTACT GGTGCTGCTC AAATGGATGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAAATGCCA CAAACCAGAG AACATTTATT 300 

GTTGGCAAGA CAAGTTGGTG TTCAAGACTT GGTTGTCTTT GTCAACAAAG 350 

TTGATACTAT TGATGACCCT GAGATGTTGG AATTAGTCGA AATGGAAATG 400 

AGAGAATTGT TGTCCACCTA CGGTTTTGAT GGTGACAACA CTCCTGTTAT 450 

TATGGGATCT GCTTTAATGG CCTTGGAAGG CAAAAAACCA GAAATTGGTA 500 

AGGAAGCTAT TTTGAGATTG TTAGATGCTG TCGATGAACA CATTCCAACT 550 

CCATCAAGAG ACTTGGAACA ACCATTTTTG TTGCCAGTTG AAGACGTGTT 600 

CTCCATCTCT GGTAGAGGAA CTGTTGTCAC CGGTAGAGTT GAAAGAGGTG 650 

TCTTGAAGAA GGGTGAAGAA ATCGAAATTG TTGGTGGTTT TGACAAACCA 700 

TACAAGACCA CTGTTACTGG TATTGAAATG TTCAAAAAGG AATTAGATTC 750 

TGCTATAGCT GGTGACAACT GTGGTGTTTT GTTGAGAGGT GTTAAAAGAG 800 

ATGAAATCAA GAGAGGTATG GTTTTGGCCA AGCCAGGTAC TGCTACTTCT 850 

CACAAGAAAT TTTTAGCATC TTTGTATATT TTGACTTCAG AAGAAGGTGG 900 

TCGTTCCACT CCATTTGGAG AAGGTTACAA GCCTCAATGT TTCTTCAGAA 950 

CTAATGACGT CACTACCACA TTTTCATTCC CAGAAGGAGA AGGTGTTGAC 1000 

CACTCCCAAA TGGTCATGCC AGGTGATAAC ATTGAAATGG TTGGTGAATT 1050 

GATCAAATCA TGTCCATTGG AAGT 1074 



2) INFORMATION FOR SEQ ID NO: 1563 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1563 

GATCACGGGA AGACTACTTT GACCGCTGCC ATCACCAAAG TTTTAGCCGA 50 
AAAAGGTGGT GCTAACTTCT TGGACTACGG TTCTATCGAT AAAGCTCCAG 100 
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AAGAAAGAGC CAGAGGTATT ACTATTTCTG 
ACTGACAAGA GACACTATGC CCATGTTGAT 
TATCAAGAAT ATGATTACTG GTGCTGCTCA 
TTGTTGCTGC TTCCGATGGT CAAATGCCTC 
TTGGCCAGAC AAGTTGGTGT TCAACACTTG 
CGACACCATT GACGATCCAG AAATGTTGGA 
GAGATTTGTT AACTACTTAC GGTTTTGATG 
ATGGGATCTG CTTTGTGTGC TTTGGAATCC 
AAAAGCCATT GAAAAATTGT TAGATGCCGT 
CAGTCAGAGA- CTTGGAACAA CCATTCTTGA 
TCCATTTCCG GTAGAGGTAC CGTTGTTGCT 
CTTGAAGAAG GGTGAAGAAA TCGAAATTGT 
TCAAGGCCAC TGTTACTGGT ATTGAAATGT 
GCTCTTGCTG GTGACAACTG TGGTATCTTG 
CGAAGTTAAG AGAGGTATGG TCTTGACCAA 
ACAAGAAGAT CTTGGCCTCG TTGTATATCT 
AGACACTCTC CATTTGGAGC CAACTACAAG 
CACCGATGTT ACCGGTACCA TGACCTTCCC 
CCATGGTCAT GCCAGGTGAC AACGTTGAAA 



CTGCCCATGT TGAATACGAA 150 

TGTCCAGGTC ACGCAGATTA 200 

AATGGATGGT GCCATTATTG 250 

AAACCAGAGA ACATTTGTTA 300 

GTTGTTTTCG TCAACAAGGT 350 

ATTGGTTGAA ATGGAAATGA 400 

GTGATAACAC CCCAGTTATC 450 

AGAGAACCAG AAATTGGTCA 500 

CGATGAATAC ATTCCAACCC 550 

TGCCAGTTGA- AGAAGTTTTC 600 

GGTAGAGTCG AAAGAGGTAC 650 

TGGTGGTTTC GACAAGCCAT 700 

TCAAGAAGGA ATTGGACTCC 750 

TTGAGAGGTG TCAAGAGAGA 800 

GCCAAACACC GTCACTTCCC 850 

TGACCAAGGA AGAAGGTGGT 900 

CCCCAATTGT TCATGAGAAC 950 

AGAAGGTGCC GACCAATCTG 1000 

TGC 1033 



2) INFORMATION FOR SEQ ID NO: 1564 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1564 

GATCACGGTA AGACTACATT GACAGCTGCT ATCACCAAGA CATTGGCCAA 50 

GAACGGTGGT GCTGATTTCT TGGACTACTC TTCCATTGAC AAAGCTCCAG 100 

AGGAGAGAGC CCGTGGTATC ACTATCTCTA CTGCCCATGT CGAGTACGAG 150 

ACCGCCAAGA GACATTACTC CCACGTCGAC TGTCCAGGTC ACGCCGACTA 200 

CATCAAGAAC ATGATTACTG GTGCTGCCCA AATGGACGGT GCTATCATCG 250 

TTGTCGCCGC CACCGATGGT CAAATGCCAC AAACTAGAGA GCATTTGCTG 3 00 

TTGGCCAGAC AAGTCGGTGT TCAACGTATC GTTGTCTTTG TCAACAAGGT 350 

GGACACCATC GATGACCCTG AAATGTTGGA ATTAGTGGAA ATGGAAATGA 400 

GAGAATTGTT GAACGAATAC GGTTTTGACG GTGACAATGC CCCTATCATT 450 

ATGGGTTCCG CTTTGTGTGC CCTAGAAGGT CGTCAACCTG AAATTGGTGA 500 

GCAAGCTATC ATGAAACTAT TGGACGCTGT TGATGAATAC ATTCCAACCC 550 

CAGAAAGAGA CTTGAACAAG CCATTCTTGA TGCCTGTTGA AGACATCTTC 600 

TCCATCTCTG GTAGAGGTAC CGTCGTCACT GGACGTGTCG AAAGAGGTAA 650 

CTTGAAGAAG GGTGAAGAAG TTGAAATTGT TGGTCACAAC ACTACCCCAT 700 

TGAAGACCAC CGTTACTGGT ATCGAAATGT TCAGAAAGGA ATTGGACCAA 750 
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GCTATGGCTG GTGACAACGC 
CCAATTGAAG AGAGGTATGG 
ACACCAAGAT TTTGGCTTCT 
AGACATTCTG GTTTCGGTGA 
5 CGCAGATGTC ACTGTTGTGA 
CTATGCAAGT TATGCCAGGT 
CACCCA 



CGGTATCCTA TTGAGAGGTA 
TCATGGCCAA GCCAGGTACC 
TTGTACATCT TGTCTAAGGA 
AAACTACAGA CCTCAGATGT 
TGAAGTTCCC AGAATCTGTG 
GACAACGTCG AAATGGTCTG 



TCAGAAGAGA 800 

GTCAAGGCTC 850 

AGAAGGTGGT 900 

TTATCAGAAC 950 

GAAGACCACT 1000 

TGAACTAGTC 1050 
1056 



10 



20 



50 



2) INFORMATION FOR SEQ ID NO: 1565 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1061 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1565 

GATCATGGTA AGACCACTTT GACCGCTGCT ATCACCAAGG TTTTGTCGGA 50 

AAAAGGAGGT GCTAATTTCT TGGATTACGG CTCCATCGAC AGAGCTCCAG 100 

AAGAGAGAGC CAGAGGTATC ACCATTTCCA CTGCCCATGT TGAGTACCAA 150 

30 ACTGATAAGA GACATTATGC CCACGTTGAC TGTCCAGGTC ACGCCGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCCATTATTG 250 

TTGTTGCTGC CACTGACGGT CAAATGCCTC AGACCAGAGA GCACTTGTTG 300 

TTGGCCAGAC AAGTTGGTGT GCAACACTTG GTAGTTTTTG TGAACAAGGT 350 

GGACACCATT GACGATCCCG AGATGTTGGA ATTGGTCGAG ATGGAAATGA 400 

35 GAGAATTGTT GAGTCAGTAC GGTTTCGATG GTGACAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAAAGT AAGCAGCCAG AAATTGGTGT 500 

GCAAGCCATT GAAAAATTGT TGGACGCTGT CGATGAGCAC ATTCCTACTC 550 

CTACCCGTGA CTTGGAACAG CCATTCTTGT TGCCTGTTGA AGATGTGTTC 600 

TCCATTTCTG GTAGAGGAAC TGTGGTTACT GGTAGAGTCG AAAGAGGTTC 650 

4 0 GTTGAAGAAG GGTGAGGAAA TCGAGATTGT TGGTGACTTT GACAAGCCAT 700 

TCAAGACCAC TGTGACTGGA ATTGAAATGT TCAAGAAGGA ATTGGATGCT 750 

GCTATGGCTG GTGACAATGC TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

CGATGTCAAG AGAGGTATGG TTTTGGCCAA GCCTTCCACC GTCACTTCTC 850 

ACAAGAAGGT GTTGGCTTCC TTGTACATCT TGAGTAAGGA AGAAGGTGGC 900 

45 CGTCACTCTC CTTTTGGTGA GAACTACAAG CCTCAATTGT TCATCAGAAC 950 
TACTGACGTT ACCGGTACTT TAAGATTCCC AGCCGGCGAG GGTGTCGACC . 1000 

ACTCGCAAAT GGTTATGCCA GGTGACAATG TTGAGATGGA AATTGAGCTT 1050 

GTGAGAAAGA C 1061 



2) INFORMATION FOR SEQ ID NO: 1566 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 



(B) STRAIN: ATCC 22991 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1566 

15 GATCACGGTA AGACTACCTT GACTGCTGCT ATCACCAAGG TTTTGGCTTC 50 

TAAGGGTGGT GCTAGCTTCT TGGACTATGG TTCCATTGAC AGAGCCCCAG 100 

AGGAGAGAGC TAGAGGTATT ACTATTTCTA CTGCCCACGT TGAGTACCAA 150 

ACCGAAAAGA GACACTACGC CCACGTCGAC TGTCCAGGTC ACGCTGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCTATCATTG 250 

20 TTGTTGCTGC TTCTGATGGT CAGATGCCTC AGACCAGAGA GCACCTTTTG 300 

TTGGCCAGAC AGGTTGGTGT TCAGAACTTG GTTGTTTTCG TTAACAAGGT 350 

TGACACCATT GACGACCCTG AAATGTTGGA ATTGGTTGAG ATGGAAATGA 400 

GAGAATTGTT GACTACTTAC GGTTTTGACG GTGATGAGAC TCCTGTTATC 450 

ATGGGTTCTG CTTTGTGCGC TTTGGAAGAG AAGCAACCAG AGATTGGTGA 500 

25 GCAGGCTATC ATGAAGTTGT TGGACGCTGT CGATGAGTAC ATTCCAACCC 550 

CACAGCGTGA CTTGGAGCAG CCATTCTTGA TGCCTGTTGA GGATGTTTTC 600 

TCCATTTCTG GTAGAGGTAC TGTCGTTACT GGTAGAGTTG AGAGAGGTTC 650 

TTTGAAGAAG GGTGAGGAAA TCGAGATTGT CGGTGACTTC GCCAAGACTT 700 

TCAAGGCTAC CGTTACTGGT ATTGAGATGT TCAAGAAGGA ATTGGATGCT 750 

30 GCTATGGCTG GTGACAACGC CGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

TGAGATCTCC CGTGGTGATG TCTTGGCCAA GCCAGGTACT GTTACTCCAC 850 

ACAAGAAGAT CTTGGCTTCT TTGTACGTTT TGACCAAGGA AGAAGGTGGT 900 

CGTCACAACC CATTCGCTGA GAACTACAAG CCACAGTTGT TCCTCAGAAC 950 

CACCAACGTC ACTGGTACCA TGAGATTCCC AGAAGGTGAA GATGTTGACC 1000 

35 ACTCTGCCAT GGTTAACCCA GGTGACAACG TTGAGATGGA AATCGAGTTG 1050 

GGTAGAAAGG CCCCACTTGA GTT 1073 



40 2) INFORMATION FOR SEQ ID NO: 1567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

809 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1567 



CATGGTAAGA CCACTTTGAC TGCTGCCATC ACCAAGACTC TAGCTGAACG 50 

TGGTGGTGCT GACTTTTTGG ACTACTCTTC TATTGACAAG GCTCCAGAAG 100 

5 AAAGAGCYAG AGGTATCACT ATTTCTACTG CTCATGTTGA ATACGAGACT 150 

GAAAAGAGAC ATTACTCCCA CGTTGACTGT CCAGGTCACG CTGATTACAT 200 

CAAGAACATG ATTACTGGTG CTGCTCAAAT GGACGGTGCT ATTATTGTTG 250 

TTGCTGCTAC TGATGGTCAA ATGCCTCAAA CCAGAGAGCA TTTGTTGTTG 300 

GCCAGACAAG TTGGTGTCCA ACACATTGTT GTTTTCGTTA ACAAGGTTGA 350 

10" CACCATCGAT'GATCCAGAAA TGTTGGAATT GGTTGAAATG -GAAATGAGAG 400 

AATTRTTGAC TCAATATGGC TTTGACGGTG ACAACACTCC AGTGATCATG 450 

GGTTCTGCTT TGTGTGCCTT GGAAGGTAAG CAACCAGAAA TTGGTGAGCA 500 

AGCCATCATG AAGTTGTTGG ACGCTGTTGA, CGAATACATC CCAACCCCAG 550 

CCCGTGACTT GGAAAAVCCA TTCTTGATGC CTGTTGAAGA TATCTTCTCC 600 

15 ATTTCCGGTA GAGGTACTGT CGTCACTGGT AGAGTTGAAC GTGGTAACTT 650 

GAAGAAGGGT GAAGAAATCG AAATTGTTGG TCACAACACC ACTCCTTTCA 700 

AGACTACTGT TACTGGTATT GAAATGTTCA GAAAGGAATT GGACCAAGCC 750 

ATGGCTGGTG ACAACGCTGG TGTCCTTTTG AGAGGTGTCA GAAGAGACCA 800 

ATTGAAGAGA GGTATGGTTT TGGCTAAGCC AGGTACTGTT AAGGCCCACA 850 

20 CCAAGTTCTT GGCTTCCTTG TACATTTTGA CCAAGGAAGA AGGTGGTAGA 900 

CACTCCGGTT TCGGTGAAAA CTACAGACCA CAAATCTACG TCAGAACTGC 950 

TGACGTTACC GTDGTCTTGA AGTTCCCAGA ATCTGTTGAA GACCATTCCA 1000 

TGCAAGTCAT GCCAGGTGAC AATGTCGAAA TGGAGTGTGA ATTGGTTCAC 1050 

CCAACTCCAT TG 1062 

25 



2) INFORMATION FOR SEQ ID NO: 1568 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 
40 (B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1568 

CGGAAAGACC ACCTTGACCG CCGCCATTAC CAAGGTTTTG GCTGACAAGG 50 

45 GTGGCGCCAA CTTCTTGGAC TACGGTGCCA TTGACAAGGC TCCTGAAGAA 100 

AGAGCACGTG GTATCACCAT TTCCACTGCC CACGTTGAAT ACGAGACCGA 150 

CAACAGACAC TACGCCCACG TTGACTGTCC AGGTCACGCC GATTACATCA 200 

AGAACATGAT CACGGGTGCC GCTCAAATGG ACGGTGCCAT TATTGTTGTT 250 

GCAGCCACCG ACGGCCAAAT GCCTCAAACC AGAGAGCACT TGTTGTTGGC 300 

50 CAGACAAGTT GGTGTGCAAC ACTTGGTTGT TTTCGTGAAC AAGGTTGATA 350 

CCATCGACGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGAGAA 4 00 

TTGTTGACTC AATACGGATT TGACGGCGAT GAAACCCCTG TTGTTATGGG 450 

CTCTGCTCTT TGCGCTTTGG AAGGTAGAGA ACCAGAGATT GGTGAGCAAG 500 
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CGTGACTTGG AACAACCATT CTTGATGCCT 
TTCTGGTAGA GGTACTGTTG TCACTGGTAG 
AGAAGGGTGA GGAGATCGAG ATTGTTGGTG 
5 ACTACTGTTA CTGGTATTGA GATGTTCAAG 
GGCTGGTGAC AATGCTGGTA TCTTGTTGAG 
TTTCCCGTGG TATGGTTTTG GCCAAGCCAG 
AAGGTTTTGG CTTCTTTGTA CATTTTGTCT 
CTCTCCATTT GGCGAGAACT ACAAGCCTCA 
10 ATGTCACTGG TACTTTGAGA TTCCCAGCAG 
GCTATGGTTT CTCCAGGTGA CAATGTCGAG 
AAAGACTCCT CT 
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AGTACATCCC AACCCCACAA 550 

GTTGAAGATG TTTTCTCCAT 600 

AGTGGAGAGA GGTTCCTTGA 650 

ACTTTGACAA GCCTTTCAAG 700 

AAGGAATTGG ACGCTGCTAT 750 

AGGTGTCAAG AGAGAACAAG 800 

GCACTGTGAC CTCGCACAAG 850 

AAGGAAGAAG GTGGTCGTCA 900 

ATTGTTCCTT AGAACTACCG 950 

GTGAGGACGT TGACCACTCC - 1000 - 

ATGGAAATCG AGTTGGTCAG 1050 

1062 



15 



2) INFORMATION FOR SEQ ID NO: 1569 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 990 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida spha erica 

(B) STRAIN: ATCC 2504 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1569 

TCACCAAGAC TTTGGCTGAA CGTGGTGGTG CTGATTTCTT GGACTACTCT 50 

35 TCCATTGACA AGGCTCCAGA AGAAAGAGCR AGAGGTATCA CTATTTCTAC 100 

TGCACATGTT GAATATGAAA CTGACAAGAG ACATTACTCT CACGTCGACT 150 

GTCCAGGTCA TGCTGATTAC ATCAAGAATA TGATTACTGG TGCTGCCCAA 2 00 

ATGGATGGTG CTATCATTGT TGTTGCTGCT ACAGATGGTC AAATGCCTCA 250 

AACCAGAGAA CATTTGTTGT TGGCTAGACA AGTTGGTGTT CAACAYATCG 300 

40 TTGTTTTCGT TAACAAGGTT GACACTATCG ATGACCCTGA AATGTTAGAA 350 

TTGGTTGAAA TGGAAATGAG AGAATTATTG ACCCAATACG GTTTCGATGG 400 

TGACAACACT CCAGTCATCA TGGGTTCTGC TTTGTGTGCT TTAGAAGGTA 450 

AGCAACCAGA AATTGGTGAA CAAGCAATCA TGAAGTTATT GGACGCTGTT 500 

GACGAATACA TCCCAACTCC AGCTCGTGAT TTGGAAAAGC CTTTCTTGAT 550 

45 GCCTGTTGAA GATATCTTCT CCATCTCCGG TAGAGGTACC GTCGTAACTG 600 

GTAGAGTTGA ACGTGGTAAC TTGAAKAAGG GTRAAGAAAT CGAAATCGTT 650 

GGTCACAACA CCACTCCATT CAAGACCACT GTTACTGGTA TTGAAATGTT 700 

CAGAAAGGAA TTGGACCAAG CTATGGCTGG TGATAACGCT GGTGTCWTGT 750 

TGAGAGGTGT CAGAAGAGAC CAATTAAAGA GAGGTATGGT CTTGGCCAAG 800 

50 CCAGGTACTG TCAAGGCTCA CACCRAATTC TTGGCCTCTT TGTATATCTT 850 

GACCAAGGAA GAAGGTGGTA GACATTCCGG TTTCGGTGAA AATTACAGAC 900 

CTCAAATCTA CGTTAGAACT GCTGATGTCA CCGTTGTTTT GAAGTTCCCA 950 

GAAGCTGTTG AAGATCACTC TATGCAAGTC ATGCCAGGTG 990 

811 
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2) INFORMATION FOR SEQ ID NO: 1570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1184 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

— (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 750 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1570 

GATCATGGTA AAACCACTTT GACTGCTGCC ATTACTAAAG TCTTGGCTGA 50 

TAAAGGTCAA GCTAACTTCT TAGATTACGG TTCTATTGAT AGAGCTCCAG 100 

AAGAAAGAGC CAGAGGTATC ACCATCTCTA CTGCCCACGT TGAATATGAA 150 

20 ACCGAAAAAA GACATTACGC CCATGTTGAG TATGTATACT TTTTTTTGTT 200 

GTGTAATTGT TTTAAAGATT TTCTTTAAAG CTGAAGAAGT CAAATCAGTT 250 

CTTTGATAAC TTCTATTAAA AAAAGGGAAA AATTAACAAG ATATACTAAC 300 

ACTATAACAG TTGTCCTGGA CATCAAGATT ATATCAAGAA TATGATTACC 350 

GGTGCCGCTC AAATGGATGG TGCTATTATT GTTGTTGCTG CCACTGATGG 4 00 

25 TCAAATGCCA CAAACCAGAG AACATTTGTT GTTGGCTAGA CAAGTCGGTG 450 

TTCAAGATTT GGTTGTCTTT GTTAACAAAG TCGACACTAT TGATGACCCA 500 

GAAATGTTGG AATTGGTTGA AATGGAAATG AGAGAATTAT TGACTACTTA 550 

CGGTTTTGAT GGTGATAACA CTCCTGTTAT CATGGGTTCT GCTTTGATGG 600 

CCTTGCAAGG TAAACAACCA GAAATTGGTG AACAAGCTAT CATGAAATTG 650 

30 ATGGACGCTA TTGATGAACA CATTCCAACC CCAACCAGAG ACTTGGAACA 700 

ATCTTTCTTG ATGCCAGTTG AAGATGTTTT CTCCATTTCT GGTAGAGGTA 750 

CTGTTGTTAC TGGTAGAGTC GAAAGAGGTG TCTTAAAGAA GGGTGAAGAA 800 

ATTGAAATTG TTGGTGGTTT CGAAAAACCA TTCAAGACCA CTGTTACTGG 850 

TATTGAAATG TTCAAGAAAG AATTAGATGC TGCTATGGCT GGTGACAACT 900 

35 GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG ACGAAATCAA GAGAGGTATG 950 

GTTTTGGCTA AACCAGGTAC TGCTACTTCC CACAAGAAAT TCTTGGCTTC 1000 

CATGTATATC TTAACTGCTG AAGAAGGTGG TCGTTCCACT CCATTCGGTG 1050 

AAGGTTACAA GCCACAATGT TTCTTCAGAA CTAACGATGT TACCACTTCC il00 

TTCTCTTTCC CAGAAGGTGA AGGTGTTGAC CACTCCCAAA TGGTTATGCC 1150 

40 AGGTGACAAC ATTGAAATGG TCGGTGAATT GATT 1184 



45 



2) INFORMATION FOR SEQ ID NO: 1571 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

812 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1571 

CGATCACGGT AAGACCACCT TGACCGCCGC CATCACCAAG GTCTTGGCCG 50 

ACAAGGGTCA GGCTAACTTC TTGGACTACG GATCCATTGA CAGAGCCCCT 100 

GAAGAAAGAG CAAGAGGTAT CACTATCTCC ACTGCCCACG TTGAATACGA 150 

10 GACTGATAAG AGACACTATG CCCACGTTGA TTGCCCGGGC CATCAAGATT 2 00 

ATATCAAGAA TATGATCACT GGTGCTGCCC AAATGGACGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAGAT.GCCA CAAACCAGAG AACACTTGTT 300 

GTTGGCTAGA CAAGTTGGTG TCCAAGACTT GGTTGTTTTC GTTAACAAGG 350 

TTGACACTAT CGATGACCCA GAAATGTTGG AATTGGTTGA AATGGAAATG 400 

15 AGAGAATTAT TATCTTCTTA YGGCTTTGAC GGTGACAACA CCCCAGTTGT 450 

CATGGGTTCC GCTTTGATGG CTTTGCAAGG TAAGCAACCA GAAATTGGTG 500 

AACAAGCTAT TATTAAGTTG ATGGACGCTA TTGATGAACA CATYCCAACC 550 

CCAACCAGAG ACTTGGAACA ACCATTCTTG TTGCCAGTTG AAGATGTCTT 600 

TTCTATTTCC GGTAGAGGTA CCGTCGTCAC TGGTAGAGTC GAAAGAGGTG 650 

20 TCTTGAAGAA GGGTGAAGAA ATTGAAATTG TCGGTAACTT TGAAAAGCCA 700 

TTCAAGACCA CCGTTACTGG TATTGAAATG TTCAAGAAGG AATTGGATGC 750 

TGCTATGGCT GGTGACAACT GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG 800 

ACGAAATCAG CAGAGGTATG GTTTTGGCCA AGCCAGGTAC CGTCACTTCC 850 

CACAAGAAGT TCTTGGCCTC CATGTACATC TTGACTGGTG AAGAAGGTGG 900 

25 TCGTCGTACC CCATTCGGTG AAGGTTACAA GCCACAATGT TTCTTCAGAA 950 

CCAATGACAT CACCACCACT TTCACTTTCC CAGAAGGTGA AGGTGTCGAC 1000 

CACTCCCAAA TGGTTATGCC AGGTGACAAC ATCGAAATGG TTGGTGAATT 1050 

GTACAAGGCT TGTCCTTGGA A 1071 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1572 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
(D> TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1572 



TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 50 

ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 100 

50 AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 150 

GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 200 

CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 250 

CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 300 

813 
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TCCTACGCCT GAGCGTGCCG TTGACGGTAC 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA 
5 ACCAGGGCGA AGCTGGCGAT AACGTCGGTC 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG 
GCCACACACT GACTTCGACG CCGAGGTGTA 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT 
CGTACAACTG ACGTGACCGG CACCATCGAG 
10 " GGTTCTGCCA GGCGACAACA TTTCGATGAA 
TCGCCATGGA AGAAGGT 



PCT/CAOO/01150 

GTTCCTGATG CCTGTTGAAG 350 

TTGTGACCGG TCGTATTGAG 400 

GAAATCGTGG GTATCAAAGA 4 50 

AATGTTCCGC AAACTGCTGG 500 

TGCTGCTGCG TGGTACCAAG 550 

GCCAAGCCAG GCTCGATCAA 600 

CATTCTGTCC AAAGAAGAAG 650 

ACCGTCCTCA GTTCTACTTC 700 

CTGCCAGAAG ACAAGGAAAT 750 

AGTGTCCCTG " ATCGCTCCTA 800 

817 



15 2) INFORMATION FOR SEQ ID NO: 1573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 



30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1573 



TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC . 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 200 

35 TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 300 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 4 00 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 

40 GTCTGTTGTA ACAGGCGTTG AAATGTTCCG TAAGATTCTT TCTGAAGGTG 500 

AAGCAGGTGA TAACGTAGGA CTTTTGCTCC GCGGTATCGA TAAGGATGAA 550 

GTAAAGCGTG GTATGGTTGT TGTACACCCA GGTGCCATCA CTCCTCACGA 600 

TCACTTCAAG GCTTCCATCT ATGTATTGAA GAAGGAAGAG GGTGGACGTC 650 

ATACTCCATT CGGAAACAAG TATCGTCCTC AGTTCTATCT CCGTACAATG 700 

45 GACTGTACAG GTGAGATCAC TTTGCCAGAA GGCGTAGAGA TGGTGATGCC 750 

TGGTGACAAC GTAGAGATTG AGGTTACCTT GATTTACAAG GTTGCC 796 



50 2) INFORMATION FOR SEQ ID NO: 1574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 820 bases 

814 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1574 



GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

15 TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 350 

20 CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

25 CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 
ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC ' 800 

TATGGCTGAA GGCGAGAGAT 820 

30 



2) INFORMATION FOR SEQ ID NO: 1575 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 
45 (B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1575 

GGTAGTGTCT GCAGCTGACG GCCCAATGCC ACAAACTCGT GAACATATCC 50 

50 TATTGTCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGACGATGA AGAATTACTT GAATTAGTTG AAATGGAAGT 150 

ACGTGATCTA TTAACTGAAT ACGACTTCCC AGGTGATGAT ACTCCTGTTA 200 

TTTCTGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA ATATGAACAA 250 

815 
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AAAGTCTTAG ACTTAATGGC AGCTGTAGAT GATTTCATCC CAACTCCTGA 300 

ACGTGACCAT GACAAACCGT TCATGATGCC GATTGAAGAT GTTTTGTCAA 350 

TCACTGGTCG TGGAACTGTT GCTACAGGTC GTGTTGAACG TGGAACTATT 400 

AAAGTCGGTG ATGAAGTTGA TATCATCGGT ATTCATGAAA ATGTTAAAAA 450 

5 GACAACTGTT ACGGGTGTAG AAATGTTCCG TAAATTGTTG GATTACGCTG 500 

AAGCTGGCGA TAACATCGGT ACATTATTGC GTGGTGTTTC TCGTGATGAT 550 

ATCGAACGTG GTCAAGTGTT GGCTAAACCA GGCACAATCA CACCACATAC 600 

AAAATTCTCA GCTGAAGTTT ATGTATTAAC AAAAGAAGAA GGCGGACGTC 650 

ATACTCCATT CTTCTCAAAC TATCGCCCAC AATTTTACTT CCGTACAACT 700 

10 GACATCACTG GTGTCATTGA" ATTGCCAGAA GGTACTGAAA TGGTTATGCC 750 

AGGTGATAAC GTAGCAATGG AAGTTGAATT AATTCACCCT GTTGCTATTG 800 

AAA 803 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 1576 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 805 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subp. jejuni 

(B) STRAIN: ATCC 33292 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1576 

TTAATGAAGC AATTGTTGTA AATTTTGAAA GCGAAGGAAA AAAACATAAA 50 

CTTGTTTTAG AAGTAGCAGC TCATTTAGGA GATAATAGAG TTAGAACTAT 100 

TGCTATGGAT ATGACAGATG GTTTGGTAAG GGGCTTAAAA GCTGAGGCTT 150 

35 TAGGTGCTCC TATTAGTGTT CCTGTTGGTG AGAAAGTTTT AGGAAGAATT 200 

TTCAATGTTA CTGGAGATTT GATCGATGAA GGTGAAGAAA TTTCTTTTGA 250 

TAAAAAATGG GCAATTCATA GAGATCCGCC AGCTTTTGAA GATCAAAGCA 300 

CAAAAAGTGA GATTTTTGAA ACAGGGATTA AAGTTGTAGA TTTGCTTGCT 350 

CCTTATGCAA AAGGTGGTAA AGTAGGTCTT TTTGGTGGTG CAGGTGTTGG 400 

40 TAAAACTGTT ATTATTATGG AGCTTATTCA CAATGTTGCA TTTAAGCATA 450 

GCGGCTATTC TGTATTTGCA GGTGTGGGTG AGAGAACTCG TGAAGGAAAT 500 

GACCTTTATA ATGAAATGAA AGAAAGTAAT GTTTTAGACA AAGTTGCTCT 550 

ATGTTATGGA CAAATGAATG AACCACCAGG AGCAAGAAAT CGTATTGCTT 600 

TAACAGGTTT AACAATGGCT GAGTATTTTA GAGATGAAAT GGGTCTTGAT 650 

45 GTGCTTATGT TTATTGATAA TATCTTTAGA TTTTCACAAT CAGGTTCTGA 700 

AATGTCAGCA CTTTTAGGAA GAATTCCATC AGCTGTGGGT TATCAACCAA 750 

CCCTAGCAAG TGAAATGGGT AAATTCCAAG AAAGAATTAC TTCAACTAAA 800 

AAAGG 805 



2) INFORMATION FOR SEQ ID NO: 1577 



816 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Campylobacter rectus 

(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1577 

15 ATTTACCCAA GATCAACGAA GCCGTCGAAG TAAATTTCGA GGTTGAGGGC 50 

AAGCAAAACA GACTTGTGTT AGAGGTTGCC GCACACCTTG GCGATAACCG 100 

CGTAAGAACG ATCGCTATGG ATATGAGCGA AGGCTTAACC AGAGGCCTTG 150 

AGGCTACGGC TCTTGGCGCG CCTATTAGCG TTCCGGTCGG CGAAAAAGTT 200 

TTGGGAAGGA TTTTTAACGT CGTCGGCGAT CTGATCGACG AGGGTGAAGG 250 

20 CATAGAATTT GATAAAAAAT GGTCTATCCA CCGCGATCCT CCGCCGTTTG 300 

AAGAGCAAAG CACGAAGAGT GAAATTTTTG AAACGGGTAT AAAAGTGGTC 350 

GATCTTCTAG CCCCTTATGC AAAAGGCGGT AAGGTCGGAT TATTCGGCGG 400 

TGCCGGCGTC GGTAAGACGG TTATTATCAT GGAGCTTATC CACAACGTTG 450 

CATTTAAGCA TAGCGGTTAT TCCGTGTTTG CCGGCGTGGG CGAGCGAACC 500 

25 CGCGAAGGAA ACGACCTTTA TCACGAGATG AAAGAGAGTA ACGTTTTGGA 550 

CAAAGTCGCC TTGTGCTACG GCCAGATGAA CGAGCCGCCG GGAGCAAGAA 600 

ACCGCATCGC TCTAACAGGC CTAACGATGG CTGAATACTT CCGCGACGAG 650 

ATGGGACTTG ACGTTTTGAT GTTTATAGAC AACATCTTCC GTTTCTCTCA 700 

GTCTGGCGCT GAGATGTCGG CGCTTCTTGG ACGTATCCCG TCAGCCGTTG 750 

30 GTTATCAGCC GACTTTGGCG AGCGAAATGG GCAAATTCCA AGAGAGAATT 800 

ACATCAACC 809 



35 2) INFORMATION FOR SEQ ID NO: 1578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1578 

AATCGAAGTG CAACGTTCAT TGCGTGTACT TGATGGTGCT GTAACTGTGT 50 
TGGACTCACA ATCAGGTGTA GAACCTCAAA CAGAAACAGT TTGGCGTCAA 100 
GCGACAGATT ACCGCGTACC GCGTGTTGTA TTCTGTAACA AAATGGACAA 150 

817 
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AATTGGTGCA GACTTCTTAT ACTCTGTATC AACTTTACAT GATCGTTTAC 200 

AAGCAAATGC TCACCCAATC CAATTACCAA TTGGTGCGGA AGATGACTTT 250 

ACTGGTATTA TCGACTTAGT AAAAATGAAA GCTGAAATCT ACACAAATGA 300 

CTTAGGAACT GAAATCCAAG AGACTGAAAT TCCTGAAGAA TACGTAGAAT 350 

5 TAGCTGAAGA ATGGCGCGAA AAATTAATTG AAGCTGTTGC TGATACTGAT 400 

GAAGAACTAA TGATGAAATT CTTGGAAGGT GAAGAAATCA CTGAAGAAGA 450 

ATTGAAAGCT GGTATTCGTC AAGCAACATT GACTGTTGAC TTTTTCCCTG 500 

TTCTTTGCGG ATCTGCCTTT AAAAACAAAG GGGTTCAATT GATGTTGGAT 550 

GCAGTCATCG ACTACTTGCC TTCACCACTT GATGTTCCTG CGATTAAAGG 600 

10" GATCAATCCT AAAACAGACG AAGAAACTGA TCGTCCGGCT GACGATGAAG- 650 

CACCATTTGC TTCATTAGCA TTTAAAGTAA TGACTGACCC ATTCGTARGT 700 

CGTTTGACAT TCTTYCGTGT GTATTCARGT ATCTTGAACT CTGGATCATA 750 

CGTATTGAAT GCTTCAAAAG GCAAACGCGA, ACGTATCGGT CGGATCCTAC 800 

AAATGCACGC CAACACTCGT GCTGAAATCC AAACAGTATA CTCAGGCGAT 850 

15 ATCGCCGCTG CTGTTGGTTT GAAAGACACA ACAACTGGGG ATCCACTATG 900 

TGATGAAAAA TCCCCAGTAA TCCTTGAATC AATCGAATTC CCAGAACCAG 950 

TTATCGAAGT CGCTGTTGAG CTTAAATCAA AAGCTGACCA AGATAAAATG 1000 

GGGGTTGCTT TACAAAAACT TGCTGAAGAA GATCCATCAT TCCGTGTGGA 1050 

AMCAAACGCT GAAACAGGCG AAACTGTTAT CGCTGGTATG GGAGAACTTC 1100 

20 ACTTGGACGT CTTAGTTGAC CGTATGCGTC GCGAATTTAA AGTTGAAGCA 1150 

AACGTAGGTG CGCCTCAAGT TTCTTATCGT GAAACATTCC GTGCAGCAAC 1200 

ACAAGCGGAA GGTAAATTTG TACGTCAGTC TGGTGGTAAA GGTCAATACG 1250 

GTCACGTATG GGTCGAATTT ACACCAAACG AAGAAGGTAA AGGCTTCGAA 1300 

TTCGAAAACG CGATTGTCGG TGGTGTGGTT CCTCGTGAAT ACATCCCAGC 1350 

25 AGTTGAAAAA GGACTTGAAG AATCAATGGC GAACGGTGTC TTAGCCGGTT 1400 

ACCCATTAGT AGACATCAAA GCAAAACTTT ATGATGGTTC ATACCATGAT 14 50 

GTCGATTCAA GTGAAACTGC CTTCCGTGTT GCAGCTTCTA TGGCTTTACG 1500 

TGCTGCAGCG AAGAAAGCAA ACCCAGTAAT TCTTGAACCA ATGATGAAAG 1550 

TAGTTATCAC TGTACCAGAA GATTACTTAG GTGATGTTAT GGGTCACGTA 1600 

30 ACTGCTCGTC GTGGACGCGT AGAAGGAATG GAAGCACWCG GTAACTCACA 1650 

AATCGTGAAC GCAATCGTGC C 1671 



35 2) INFORMATION FOR SEQ ID NO: 1579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1579 

GAAGTACAAC GTTCACTACG GGTTCTTGAC GGWGCTGTAA CAGTATTGGA 50 
CTCACAATCT GGTGTAGAAC CACAAACTGA AACAGTTTGG CGTCAAGCGA 100 
CAGATTACCG CGTACCACGT ATCGTATTCT GTAACAAAAT GGATAAAATC 150 

818 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GGTGCAGACT 
AAATGCTCAT 
GTATCATCGA 
GGAACAGAGA 
5 TGAAGAATGG 
AGCTAATGAT 
AAAGCTGGTA 
TTGTGGTTCA 
TCATCGACTA 

10- AATCCTAAAA 
TTTTGCTTCA 
TGACATTCTT 
TTGAATGCTT 
GCACGCGAAC 

15 CTGCAGCTGT 
AAAAATCACC 
CGAAGTAGCT 
TGGCTTTACA 
AACGCTGAAA 

20 GGACGTCCTT 
TTGGTGCGCC 
GCCGAAGGTA 
CGTATGGATC 
AAAACGCAAT 

25 GAAAAAGGAC 
ATTGGTTGAC 
ATTCAAGTGA 
GCAGCGAAGA 
TATCACTGTT 

30 CTCGTCGTGG 
GTTAACGCGA 



TCTTATACTC 
CCAATCCAAT 
TCTAGTAAAA 
TTCAAGAAAC 
CGCGAAAAAT 
GAAATTCTTG 
TCCGTCAAGC 
GCCTTCAAAA 
CCTTCCTTCA 
CTGACGAAGA 
CTAGCATTTA 
CCGTGTGTAT 
CAAAAGACAA 
ACTTGTGCAG 
TGGTTTGAAA 
CAGTAATCCT 
GTTGAACYTA 
AAAACTTGCT 
CTGGCGAAAC 
GTTGACCGTA 
TCAAGTTTCT 
AATTTGTACG 
GAATTTACAC 
TGTCGGTGGT 
TTGAAGACTC 
ATCAAAGCCA 
AACAGCCTTC 
AAGCTAATCC 
CCTGAAGATT 
ACGTGTTGAA 
TT 



TGTATCTACT 
TACCAATTGG 
ATGAAAGCTG 
TGAAATTCCT 
TAGTTGAAGC 
GAAGGTGAAG 
GACATTGACT 
ACAAAGGGGT 
CCACTTGATG 
AAGTGATCGT 
AAGTAATGAC 
TCAGGTGTCT 
ACGCGAACGT 
AAATCCAAAC 
GATTCCACAA 
TGAATCAATC 
AATCAAAAGC 
GAAGAAGATC 
TGTTATCGCA 
TGCGTCGCGA 
TACCGCGAAA 
TCAGTCTGGT 
CAAACGAAGA 
GTGGTTCCAC 
AATGGCTAAC 
AGCTTTACGA 
CGTGTGGCAG 
AGTGATTCTT 
ACTTAGGTGA 
GGTATGGAAG 



TTACATGATC 
TGCGGAAGAT 
AGATCTATAC 
GAAGAGTACA 
TGTTGCAGAT 
AAATCACTGA 
GTTGAATTTT 
TCAATTGATG 
TTCCTGCAAT 
CCTGCTGACG 
TGACCCATTC 
TGAACTCTGG 
ATCGGTCGTA 
AGTTTATTCA 
CAGGGGATAC 
GAATTCCCAG 
TGACCAAGAT 
CTTCATTCCG 
GGGATGGGTG 
ATTTAAAGTT 
CATTCCGTGC 
GGTAAAGGTC 
AGGTAAAGGC 
GTGAATACAT 
GGTGTTCTAG 
TGGTTCATAC 
CTTCAATGGC 
GAACCAATGA 
TATTATGGGA 
CGCATGGTAA 



GCTTGCAAGC 
GACTTTACTG 
AAACGATTTA 
AAGAATTAGC 
ACTGACGAAG 
AGAAGAATTG 
TCCCAGTTCT 
TTGGATGCAG 
CAAAGGGATC 
AT-GAAGCGGC 
GTAGGTCGTT 
ATCATATGTC 
TTCTGCAAAT 
GGCGATATCG 
ATTGTGTGCG 
AMCCAGTTAT 
AAAATGGGTG 
TGTAGAAMCA 
AACTTCACTT 
GAAGCAAACG 
TTCTACGCAA 
AATACGGTCA 
TTCGAATTCG 
CCCAGCAGTT 
CTGGTTATCC 
CATGATGTCG 
TTTACGTGCT 
TGAAAGTTGT 
CACGTAACTG 
CTCACAAATC 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1662 



35 2) INFORMATION FOR SEQ ID NO: 1580 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1580 

CAATCGAAGT ACAACGTTCT CTTCGTGTAT TGGATGGTGC TGTTACCGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TATGGAGTTC CACGTATCGT ATTTGCCAAC AAAATGGACA 150 
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AAATCGGTGC 
CAAGCAAATG 
CCGTGGTATC 
ACCTTGGTAC 
5 CAAGCTCAAG 
CGAAGAATTG 
AATTGAAAGC 
GTATTGTGTG 
TGCGGTTATT 

10 GTATCAACCC 
GAGCCATTTG 
TCGTTTGACA 
ACGTATTGAA 
CAAATGCACG 

15 TATCGCTGCT 
CAGATGAAAA 
GTTATCCAAT 
GGGTATCGCC 
AAMCAAACGT 

20 CACCTTGACG 
AAACGTAGGT 
CTCAAGCACG 
GGTGATGTAT 
ATTCGAAAAC 

25 • CGGTTGAAAA 
TACCCAATGG 
TGTCGACTCA 
AAGAAGCTGC 
GTAACAATCA 

30 AACTGCTCGT 
AAATCGTTCG 



TGACTTCCTT 
CACACCCAAT 
ATCGACTTGA 
AGATATCCTT 
AATACCGTGA 
ATGATGAAAT 
TGGTATCCGT 
GCTCTGCCTT 
GACTACCTTC 
AGATWCAGAT 
CAGCTCTTGC 
TTCTTCCGTG 
CACTTCTAAA 
CCAACAGCCG 
GCCGTTGGTT 
AGCTAAAATC 
TGATGGTTGA 
CTTCAAAAAT 
TGAAACTGGT 
TCCTTGTTGA 
GCTCCTCAAG 
TGGATTCTTC 
GGATTGAATT 
GCAATCGTCG 
AGGTTTGGTA 
TTGACGTTAA 
TCTGAAACTG 
TAAATCAGCA 
CTGTTCCAGA 
CGTGGACGTG 
TGCTTACGT 



TACTCTGTAA 
CCAATTGCCA 
TCAAGATGAA 
GAAGAAGACA 
AAAATTGATC 
ACCTCGAAGG 
AAAGCGACTA 
CAAGAACAAA 
CAAGCCCACT 
GAAGAAGAAA 
CTTCAAGATC 
TTTACTCARG 
GGKAAACGTG, 
TCAAGAAATT 
TGAAAGATAC 
ATTCTTGAGT 
GCCAAAATCT 
TGGCTGAAGA 
GAAMCAGTTA 
CCGTATGCGT 
TATCTTACCG 
AAACGTCAGT 
TACTCCAAAC 
GTGGTGTGGT 
GAATCTATGG 
AGCTAAGCTT 
CCTTCAAGAT 
CAACCAGCTA 
AGAAAACCTT 
TAGATGGTAT 



GCACACTTCA 
ATCGGTTCTG 
AGCTGAAATC 
TCCCAGCTGA 
GAAGCAGTTG 
TGAAGAAATC 
TCAACGTTGA 
GGTGTTCAAT 
TGACATCCCA 
CTCGTCCAGC 
ATGACAGACC 
TGKTCTTCAA 
AGCGTATCGG 
GACACTGGTT 
TCCAACTGGT 
CAATCAACGT 
AAAGCTGACC 
AGATCCAACA 
TCTCTGGTAT 
CGTGAGTTCA 
TGAAACATTC 
CTGGTGGTAA 
GAAGAAGGTA 
TCCTCGTGAA 
CTAACGGTGT 
TACGATGGTT 
TGCGGCTTCA 
TCCTTGAGCC 
GGTGATGTTA 
GGAAGCACWC 



CGATCGTCTT 
AAGATGACTT 
TATACTAACG 
ATACCTTGAC 
CTGAAACTGA 
ACTAACGAAG 
ATTCTTCCCA 
TGATGCTTGA 
GCGATCAAAG 
ATCTGATGAA 
CATTCGTAAG 
TCAGGTTCAT 
ACGTATCCTT 
ACTCAAGAGA 
GACTCATTGC 
TCCAGAMCCA 
AAGCCAAGAT 
TTCCGCGTTG 
GGGTGAGCTT 
AAGTTGAAGC 
CGCGCTTCTA 
AGGTCAATTC 
AAGGATTCGA 
TTTATCCCAG 
ACTTGCAGGT 
CATACCACGA 
CTTGCTCTTA 
AATGATGCTT 
TGGGTCACGT 
GGTAACAGCC 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1669 



35 2) INFORMATION FOR SEQ ID NO: 1581 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1581 

AATTGGGGAC TACACCTATT ATGATG 26 



50 



2) INFORMATION FOR SEQ ID NO: 1582 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1582 
10- -GGCAAATCAG TCAGTTCAGG AGT . . - . . . 23. 



2) INFORMATION FOR SEQ ID NO: 1583 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1583 

25 

CGATTGGCAA CAATACACTC CTG 23 



30 2) INFORMATION FOR SEQ ID NO: 1584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1584 

TCACCTATTT TTACGCCTGG TAGGAC 26 



45 

2) INFORMATION FOR SEQ ID NO: 1585 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 645 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (C) ACCESSION NUMBER: AF139725 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1585 





ATGACTATAC 


CTGACGCAAA 


TGCAATCTAT 


CCTAACTCAG 


CCATCAAAGA 


50 


10 


" GGTTGTCTTT 


ATCAAGAACG" 


TGATCAAAAG 


TCCCAATATT- 


GAAATTGGGG - 


100 




ACTACACCTA 


TTATGATGAC 


CCAGTAAATC 


CCACCGATTT 


TGAGAAACAC 


150 




GTTACCCATC 


ACTATGAATT 


TCTAGGCGAC 


AAATTAATCA 


TCGGTAAATT 


200 




TTGTTCTATC 


GCCAGTGGCA 


TTGAATTTAT 


CATGAACGGT 


GCCAACCACG 


250 




TAATGAAAGG 


TATTTCGACT 


TATCCATTTA 


ATATTTTAGG 


TGGCGATTGG 


300 


15 


CAACAATACA 


CTCCTGAACT 


GACTGATTTG 


CCGTTGAAAG 


GTGATACTGT 


350 




AGTCGGAAAT 


GACGTGTGGT 


TTGGGCAAAA 


TGTGACCGTC 


CTACCAGGCG 


400 




TAAAAATAGG 


TGACGGTGCC 


ATTATCGGAG 


CAAATAGTGT 


TGTAACAAAA 


450 




GACGTCGCTC 


CATATACAAT 


TGTCGGTGGC 


AATCCAATTC 


AACTCATCGG 


500 




ACCAAGATTT 


GAACCGGAAG 


TTATTCAAGC 


ATTAGAAAAT 


CTGGCATGGT 


550 


20 


GGAATAAAGA 


TATTGAATGG 


ATAACTGCTA 


ATGTTCCTAA 


ACTAATGCAA 


600 




ACAACACCCA 


CACTTGAATT 


GATAAACAGT 


TTAATGGAAA 


AATAA 


645 



25 2) INFORMATION FOR SEQ ID NO: 1586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1586 

ATTCCCACAA TCTTTTTTAT C AATAA 26 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1587 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1587 

CATTGTTCAG ATTCGGTAAA GTTC 24 
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2) INFORMATION FOR SEQ ID NO: 1588 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1588 
GTTTTTGAAG TTAAATAGTG TTCTT , 25 



2) INFORMATION FOR SEQ ID NO: 1589 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1589 
CTTCCATTTG TACTTTCCCT A 21 



2) INFORMATION FOR SEQ ID NO: 1590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cloning vector pFW16 
(C) ACCESSION NUMBER: U50983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1590 

ATGAAAATTA TTAATATTGG AGTTTTAGCT CATGTTGATG CAGGAAAAAC 50 

TACCTTAACA GAAAGCTTAT TATATAACAG TGGAGCGATT ACAGAATTAG 100 

GAAGCGTGGA CAAAGGTACA ACGAGGACGG ATAATACGCT TTTAGAACGT 150 

CAGAGAGGAA TTACAATTCA GACAGGAATA ACCTCTTTTC AGTGGGAAAA 200 

TACGAAGGTG AACATCATAG ACACGCCAGG ACATATGGAT TTCTTAGCAG 250 

823 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AAGTATATCG TTCATTATCA GTTTTAGATG 
GCAAAAGATG GCGTACAAGC ACAAACTCGT 
GAAAATGGGG ATTCCCACAA TCTTTTTTAT 
GAATTGATTT ATCAACGGTT TATCAGGATA 
5 GAAATTGTAA TCAAACAGAA GGTAGAACTG 
GAACTTTACC GAATCTGAAC AATGGGATAC 
ACCTTTTAGA GAAATATATG TCCGGTAAAT 
GAACAAGAGG AAAGCATAAG ATTTCAGAAT 
TCATGGAAGT GCAAAAAGTA ATATAGGGAT 

10 " TtACTAATAA" ATTTTATTCA TCAACACATC 
GGAAATGTTT TCAAAATTGA ATATACAAAA 
TATACGCCTT TATAGTGGAG TACTACATTT 
CAGAAAAGGA AAAAATAAAA GTTACAGAAA 
GAATTATGTA AGATTGATAG AGCTTATTCT 

15 AAATGAGTTT TTGAAGTTAA ATAGTGTTCT 
CACAGAGAAA AAAGATTGAA AATCCGCACC 
GAACCGAGTA AACCTGAACA GAGAGAAATG 
AATCTCAGAT AGTGATCCGC TTCTACGATA 
ATGAAATTAT ACTTTCTTTC TTAGGGAAAG 

20 GCACTGTTGC AAGAAAAGTA TCATGTGGAG 
AGTCATTTAT ATGGAGAGAC CGTTAAAAAA 
TCGAAGTGCC GCCAAATCCT TTCTGGGCTT 
CCGCTTCCGT TGGGAAGTGG AATGCAGTAT 
ATACTTAAAT CAATCATTTC AAAATGCAGT 

25 GTTGTGAACA AGGATTGTAT GGTTGGAATG 
TTTAAGTATG GCTTATACTA TAGCCCTGTT 
GATGCTTGCT CCTATTGTAT TGGAACAAGT 
AATTGTTAGA GCCATATCTT AGTTTTAAAA 
CTTTCACGAG CATACAACGA TGCTCCTAAA 

30 CACTCAATTG AAAAATAATG AGGTCATTCT 
GGTGTATT.CA AGAATATCGT AGTGATTTAA 
AGTGTTTGTT TAACAGAGTT AAAAGGGTAC 
TGTTTGCCAG CCCCGTCGTC CAAATAGTCG 
TGTTCAATAA AATAACTTAG 

35 
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GGGCAATTCT ACTGATTTCT 300 

ATATTATTTC ATGCACTTAG 350 

CAATAAGATT GACCAAAATG 4 00 

TTAAAGAGAA ACTTTCTGCC 4 50 

TATCCTAATG TGTGTGTGAC 500 

GGTAATAGAG GGAAACGATG 550 

CATTAGAAGC ATTGGAACTC 600 

■TGTTCTCTGT TCCCTCTTTA 650 

TGATAACCTT ATAGAAGTTA 700 
GAGGTCCGTC TGAACTTTGC - 750 

AAAAGACAAC GTCTTGCATA 800 

ACGAGATTCG GTTAGAGTAT 850 

TGTATACTTC AATAAATGGT 900 

GGAGAAATTG TTATTTTGCA 950 

TGGAGATACA AAACTATTGC 1000 

CTCTACTACA AACAACTGTT 1050 

TTGCTTGATG CCCTTTTGGA 1100 

TTACGTGGAT TCTACGACAC 1150 

TACAAATGGA AGTGATTAGT 1200 

ATAGAAATAA CAGAGCCTAC 1250 

TGCAGAATAT ACCATTCACA 1300 

CCATTGGTCT ATCTGTATCA 1350 

GAGAGCTCGG TTTCTCTTGG 1400 

TATGGAAGGG ATACGCTATG 14 50 

TGACGGACTG TAAAATCTGT 1500 

AGTACCCCAG CAGATTTTCG 1550 

CTTAAAAAAA GCTGGAACAG 1600 

TTTATGCGCC ACAGGAATAT 1650 

TATTGTGCGA ACATCGTAGA 1700 

TAGTGGAGAA ATCCCTGCTC 1750 

CTTTCTTTAC AAATGGACGT 1800 

CATGTTACTA CCGGTGAACC 1850 

GATAGATAAA GTACGATATA 1900 

1920 



2) INFORMATION FOR SEQ ID NO: 1591 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1591 
50 ATGAGGTAAT AGAACGGATT 20 



2) INFORMATION FOR SEQ ID NO: 1592 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi-) SEQUENCE DESCRIPTION: SEQ ID NO: 1592 
CAGTATTTCA GTAAGCGTAA A 



2) INFORMATION FOR SEQ ID NO: 1593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1593 
CCGAGCGATT TACCGGATAC TTGGCTGCIC GCTCGG 



2) INFORMATION FOR SEQ ID NO: 1594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: N97-330 

(C) ACCESSION NUMBER: AF175293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1594 

ATGTTTAGAA TTAAAGTTGC AGTTCTGTTT GGGGGCTGTT CAGAGGAACA 50 

TAATGTTTCG ATAAAATCTG CGATGGAGAT TGCCGCAAAC ATAGATACAA 100 

AAAAGTATCA GCCTTATTAT ATTGGAATCA CAAAATCCGG CGTTTGGAAA 150 

ATGTGTGAAA AACCTTGTTT GGAGTGGGAA CAATATGCGG GGGATCCGGT 200 

TGTTTTTTCA CCGGACAGAA GTACGCATGG TCTGCTGATA CAAAAAGACA 250 
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CTGGGTATGA AATCCAGCCT GTGGATGTGG GATTACCGAT GATTCATGGC 300 

AAGTTTGGCG AGGATGGCTC CATACAAGGC TTGCTTGAAT TGTCAGGCAT 350 

TCCGTATGTG GGATGCGATA TTCAAAGCTC CGTGACCTGC ATGGATAAGG 400 

CGCTTGCATA TACCGTTGTG AAAAATGCGG GTATCGCTGT GCCTGGGTTC 450 

5 CGGATCCTTC AGGAGGGGGA TCGCCTGGAA ACAGAGGATT TCGTATATCC 500 

CGTTTTTGTA AAGCCTGCCC GTTCCGGCTC ATCCTTTGGC GTAAACAAGG 550 

TATGCAAGGC AGAAGAACTG CAGGCAGCAA TCGAAGATGC AAGAAAATAT 600 

GACAGCAAGA TTTTGATTGA AGAGGCCGTT ACCGGGAGTG AGGTAGGCTG 650 

CGCCATACTG GGAAACGGAA ATGATCTCAT GGCTGGCGAG GTGGATCAGA 700 

10" TTGAGCTGAG ACACGGCTTT TTTAAGATTC A-TCAGGAAGC -ACAGCCGGAG - 750 

AAAGGATCTG AAAATGCAGT CATCCGAGTT CCAGCCGCCT TACCGGATGA 800 

GGTAATAGAA CGGATTCAGA AAACAGCAAT GAAGATTTAC CGGATACTTG 850 

GCTGCAGAGG ATTGGCCCGC ATTGACCTGT TTTTGCGGGA GGACGGCTGC 900 

ATTGTGTTGA ATGAAGTGAA TACCATGCCG GGTTTTACTT CCTACAGCCG 950 

15 TTATCCCCGC ATGATGACAG CAGCCGGTTT TACGCTTACT GAAATACTGG 1000 

ATCGCTTGAT TGAACTTTCA CTTAGGAGGT AA 1032 



20 2) INFORMATION FOR SEQ ID NO: 1595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1595 

AAATAATGCT CCATCAATTT GCTGA 25 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1596 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1596 
ATAGTCGAAA AAGCCATCCA CAAG 24 

2) INFORMATION FOR SEQ ID NO: 1597 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
'(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1597 
GATGAATTTG CGAAAATACA TGGA 24 



15 2) INFORMATION FOR SEQ ID NO: 1598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1598 

CAGCCAATTT CTACCCCTTT CAC 23 



2) INFORMATION FOR SEQ ID NO: 1599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: BM4405 

(C) ACCESSION NUMBER: AF136925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1599 

CAGGGAGTAT TTGAGTTATT AGATATTCCA TATGTAGGTT GTGGTATCGG 50 

AGCTGCAGCA ATTTCCATGA ATAAAATAAT GCTCCATCAA TTTGCTGAAA 100 

50 TAGTTGGTGT AAAAAGCACT CCTAGTATGA TTATAGAAAA GGGACAAGAC 150 

CTACAAAAAG TCGATGAATT TGCGAAAATA CATGGATTTC CTTTATATAT 200 

CAAACCGAAT GAGGCAGGCT CATCAAAAGG AATTAGCAAG GTAGAACAAA 250 

AAAGTGATTT ATATAAAGCA ATAGACGAAG CTTCAAAATA TGATAGCCGT 300 
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ATTTTAATTC AAAAGGAAGT 
AGGGAATGAA CAATTGGTCG 
ATGGCTTTTT CGACTATGAA 
TTGTTACCAG CTAAACTATC 
AGCAAAAAAA CTATACAGAC 
ACTTTTTCTT AACGGATGAC 
CTCC 



GAAAGGGGTA GAAATTGGCT GTGGGATTTT 
TTGGAGAATG TGATCAAATT AGTCTTGTGG 
GAGAAATACA ATTTAGTAAC AGCAGAAATT 
AATAGACAAA AAAGAAGACA TCCAGATAAA 
TATTAGGGTG CAAAGGATTA GCGAGAATCG 
GGAGAAATTT TATTAAATGA GATCAACACC 



350 
400 
450 
500 
550 
600 
604 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDE DNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campyi obac ter jejuni subsp. doylei 

(B) STRAIN: ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1600 

ACCTCAAATT AATGAAGCAA TTGTTGTAAA TTTTGAAAGC GAAGGAAAAA 50 

AACATAAACT TGTTTTAGAA GTAGCAGCTC ATTTAGGAGA TAATAGAGTT 100 

AGAACTATTG CTATGGATAT GACAGATGGT TTGGTAAGGG GCTTAAAAGC 150 

TGAGGCTTTA GGTGCTCCTA TTAGTGTTCC TGTTGGTGAG AAAGTTTTAG 200 

GAAGAATTTT CAATGTTACT GGAGATTTGA TCGATGAAGG TGAAGAAATT 250 

CCTTTTGATA AAAAATGGGC AATTCATAGA GATCCGCCAG CTTTTGAAGA 300 

TCAAAGCACA AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTAGATT 350 

TACTTGCTCC TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA 400 

GGTGTTGGTA AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT 450 

TAAGCATAGC GGCTATTCTG TATTTGCAGG TGTGGGTGAG AGAACTCGTG 500 

AAGGAAATGA CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA 550 

GTTGCTCTAT GTTATGGACA AATGAATGAA CCACCAGGAG CAAGAAATCG 600 

TATTGCTTTA ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG 650 

GTCTTGATGT GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA 700 

GGTTCTGAAA TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA 750 

TCAACCAACC CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT 800 

CAACT 805 



2) INFORMATION FOR SEQ ID NO: 1601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Enterococcus svlfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1601 

10 CTCCTTACCA GATATCAACA- ATGCGCTGGT TGTATATAAA -AATGATGCCA - 50 

ATAAAACAAA AGTTGTTTTA GAAGCAGCGT TAGAATTAGG TGACGGAATC 100 

ATTCGTGCGA TTGCAATGGA ATCGACTGAT GGATTGCAAC GTGGGATGGA 150 

AGTTGTCGAT ATGGGAGAAT CTATTTCTGT, ACCAGTTGGA ACAGAAACTT 200 

TAGGACGTGT GTTTAATGTA TTAGGAGATA CGATTGACTT AGAAGCTCCT 2 50 

15 TTTCCTAAAG ATGCACCGCG TAGTGGAATC CATAAAAAAG CCCCTAATTT 3 00 

TGATGAATTA AGTACAAGCA CAGAAATTCT AGAAACAGGG ATCAAAGTCA 350 

TCGATCTATT AGCCCCTTAT TTAAAAGGTG GTAAAGTTGG ATTGTTTGGG 40 0 

GGAGCCGGTG TAGGAAAAAC GGTATTGATT CAAGAGTTGA TTCACAATAT 450 

CGCACAAGAA CACGGTGGGA TTTCGGTCTT TACTGGTGTT GGTGAGCGTA 500 

20 CACGTGAAGG AAACGATTTG TATTATGAAA TGAAAGATTC AGGAGTCATT 550 

GAAAAAACAG CGATGGTGTT TGGACAAATG AATGAGCCAC CTGGTGCACG 600 

GATGCGTGTA GCCTTGACTG GATTAACGAT TGCTGAATAT TTCCGTGATG 650 

TTGAAGGACA GGATGTACTG TTGTTTATCG ACAATATCTT CCGTTTTACA 700 

CAAGCTGGTT CTGAAGTGTC TGCCTTGTTA GGTCGTATGC CATCAGCTGT 750 

25 GGGATATCAA CCAACATTAG CTACCGAAAT GGGGCAATTG CAAGAGCGGA 800 

TCACGTCAAC GAAAAAAGGA TCGATC 826 



30 2) INFORMATION FOR SEQ ID NO: 1602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 49428 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1602 



TGATACTTTA CCAGATATTA ATAATGCATT AGTAGTATAT AAAAAGGACG 50 

AGGACAAGAC ACGCGTTGTC TTAGAAGCCA CCTTGGAACT TGGAGATGGC 10 0 

ATGATTCGTG CAATCTCTAT GGGATCGACT GATGGCTTGC AACGGGGAAT 150 

GGAAGTTGTG GACACACAAG CACCTATTTC TGTTCCAGTA GGAAATGAAA 200 

50 CCTTAGGACG TGTTTTTAAT GTCTTAGGAG AAACGATTGA TAAACAGGCA 250 

CCGTTTCCTG AAGATGCCAA AAAAAGTGGT ATTCATAAAA AAGCACCCGC 3 00 

TTTTGATGAA TTAAGTACCA GTTCTGAAAT ATTGGAAACC GGGATTAAAG 350 

TAATCGATTT GCTAGCTCCT TATTTACGAG GTGGTAAAGT TGGATTATTT 40 0 
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GGCGGTGCTG GCGTGGGTAA 
CGTTGCCCAA GAACATGGGG 
GTACTCGTGA AGGAAATGAC 
ATTGAAAAAA CGGCTATGGT 
5 ACGTATGCGT GTAGCGTTAA 
ATGTACAAGG TCAAGACGTA 
ACTCAAGCAG GAACAGAAGT 
CGTTGGTTAC CAACCAACTC 
GAATCACATC GACAGATAAA 

10 



AACGGTATTA ATTCAAGAAT 
GAATTTCTGT TTTTACGGGT 
CTATATTATG AAATGCAGGA 
ATTTGGACAA ATGAACGAAC 
CTGGTTTGAC ACTTGCTGAG 
TTATTATTTA TAGATAATAT 
ATCTGCTTTA TTAGGACGGA 
TAGCAACGGA AATGGGACAG 
GGATCAATTA CCT 



TAATTCATAA 


450 


GTCGGAGAGC 


500 


TTCAGGCGTT 


550 


CCCCTGGTGC 


600 


TACTTCCGTG 


650 


TTTCCGCTTT 


700 


TGCCGTCTGC 


750 


TTGCAAGAAC 


800 
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2) INFORMATION FOR SEQ ID NO: 1603 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 
25 (B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1603 

ATCAGCCTAA AATTAACGAA GCAATAGAAG TTAATTATGA ATTAGATGGT 50 

30 AAAAAAATAA GACTTATTCT TGAAGTAGCT GGACATCTTG GCGATAATAG 100 

AGCAAGAACC ATTGCTATGG ATATGAGTGA TGGTTTACAA AGAGGATTAG 150 

AAGTTACGGC TCTTGGTGCT CCTATAACAG TTCCTGTTGG AGATAAAGTT 200 

TTAGGTAGAA TGTTTAATGT TGTAGGTGAC TTAATAGATG AAGGTGAAGT 250 

AACAGATTTT GATAAAAGAT GGGCTATCCA TAGAGATCCT CCTTCGTTTG 300 

35 AAGATCAAAG TACAAAAAGT GAAATTTTTG AAACAGGTAT AAAAGTAGTT 350 

GATCTTCTTG CACCTTATGC AAAAGGTGGT AAAGTTGGCT TATTTGGTGG 4 00 

TGCTGGCGTT GGAAAAACAG TTATCATAAT GGAGCTTATA CATAATGTTG 450 

CATTTAAACA CAGCGGTTAT TCAATTTTTG CCGGTGTTGG AGAGAGAACA 500 

AGAGAGGGAA ATGATCTTTA TAATGAGATG AAAGAGTCTG GTGTTTTGGA 550 

40 TAAAGTTGCC TTATGTTATG GACAAATGAA TGAACCACCA GGAGCAAGAA 600 

ACCGTATAGC ATTAACAGGT CTTACAATGG CTGAATATTT CCGTGATGAA 650 

ATGGGGCTTG ATGTGTTGAT GTTTATAGAT AATATTTTTA GATTTTCTCA 700 

ATCAGGTTCT GAAATGTCAG CGCTGCTTGG TAGAATTCCA TCTGCTGTTG 750 

GTTATCAACC AACATTAGCA AGTGAGATGG GAAAACTTCA AGAAAGAATT 800 

45 ACTTCC 806 



50 



2) INFORMATION FOR SEQ ID NO: 1604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 4 9372 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1604 

GGTGTTATCC GCACAATCGC TATGGAATCT ACAGATGGAT TGCAACGGGG 50 

GATGGAAGTT GTCGATACCG GCAAACCAAT, CTCTGTTCCT GTAGGTAAAG 100 

AAACATTAGG TCGTGTGTTT AACGTATTAG GTGAAACGAT CGACAAAGAA 150 

15 GCACCTTTTC CAGAAGATGT AGAAAAGAGC GGTATTCACA AAAAGGCCCC 200 

CGCTTTTGAA GACCTTAGCA CCAGTAATGA GATTTTAGAA ACTGGGATCA 250 

AGGTTATCGA CTTATTAGCC CCTTACTTAA AAGGTGGTAA AGTTGGACTA 300 

TTCGGTGGTG CCGGTGTTGG TAAAACCGTC TTAATTCAAG AACTGATTCA 350 

TAATATCGCC CAAGAACACG GTGGGATTTC TGTCTTTACC GGGGTTGGGG 400 

20 AACGGACTCG TGAAGGGAAC GACCTTTATT ATGAAATGAA AGAATCCGGC 450 

GTTATTGAAA AAACAGCGAT GGTCTTCGGA CAAATGAATG AGCCACCAGG 500 

TGCGCGGATG CGCGTTGCCT TGACTGGTTT GACATTAGCT GAATATTTCC 550 

GTGATGAAGA AGGTCAAGAT GTGTTGCTAT TTATCGATAA CATTTTCCGC 600 

TTCACACAAG CCGGATCAGA AGTTTCGGCG CTATTAGGTC GGATGCCATC 650 

2 5 TGCCGTTGGT TATCAACCAA CCTTGGCAAC AGAAATGGGT CAATTACAAG 700 

AACGAATCAC TTCAACGAAA AAAGGCTCAA TTACATCG 738 



30 2) INFORMATION FOR SEQ ID NO: 1605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1605 

ATYATYGAAR TITAYGCICC 20 
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2) INFORMATION FOR SEQ ID NO: 1606 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



831 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1606 
5 CCRAACATIA YICCIACTTT TTC 23 



10 



2) INFORMATION FOR _ SEQ ID NO: 1607 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1607 



25 


CTGGATTATG 


CGATGTCGGT 


CATTGTTGGC 


CGTGCGCTGC 


CGGATGTCCG 


50 




AGATGGCCTG 


AAACCGGTAC 


ACCGTCGCGT 


ACTTTACGCC 


ATGAACGTAT 


100 




TGGGCAATGA 


CTGGAACAAA 


GCCTATAAAA 


AATCCGCCCG 


TGTCGTTGGT 


150 




GACGTAATCG 


GTAAATACCA 


CCCTCATGGT 


GATACCGCCG 


TTTATGACAC 


200 




CATTGTACGT 


ATGGCACAGC 


CATTCTCCTT 


GCGTTATATG 


CTGGTCGATG 


250 


30 


GCCAGGGTAA 


CTTCGGTTCT 


GTCGATGGCG 


ACTCCGCCGC 


AGCGATGCGT 


300 




TATACGGAAA 


TCCGTATGTC 


GAAAATCGCC 


CACGAG 




336 



35 2) INFORMATION FOR SEQ ID NO: 1608 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1608 

CTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG CTGCCGGATG 50 
TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTATA CGCCATGAAC 100 
GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCTG CCCGTGTCGT 150 
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GGGTGACGTC ATCGGTAAAT ACCACCCTCA TGGTGATACT GCCGTATACG 
ACACCATTGT ACGTATGGCG CAGCCATTCT CCCTGCGTTA CATGCTGGTA 
GATGGCCAGG GTAACTTTGG TTCGGTCGAC GGCGACTCCG CCGCAGCGAT 
GCGTTATACG GAAATCCGTA TGTCGAAGAT CGCCCATGAA C 



200 
250 
300 
341 



2) INFORMATION FOR SEQ ID NO: 1609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1609 
GCCCTGATCC AAATAGCATA TA 22 



2) INFORMATION FOR SEQ ID NO: 1610 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1610 
CCTGGCATAA CAGTAACATT CTG 23 



2) INFORMATION FOR SEQ ID NO: 1611 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1611 
TGGGAAAAAG CAACTCCATC TC 22 
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2) INFORMATION FOR SEQ ID NO: 1612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

_(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1612 
ACAACTGAAT TCGCAGCAAC AAT 23 



2) INFORMATION FOR SEQ ID NO: 1613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: U19459 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1613 

ATGAAATATG GCCCTGATCC AAATAGCATA TATCCACATG AAGAAATAAA 50 

AAGTGTTTGT TTTATTAAAA ATACAATTAC CAATCCAAAT ATTATAGTTG 100 

GAGATTATAC TTACTATTCC GATGTTAACG GAGCTGAAAA ATTTGAAGAA 150 

CATGTGACAC ATCATTATGA ATTTAGGGGT GATAAACTTG TAATTGGCAA 200 

GTTTTGTGCA ATAGCTGAAG GTATAGAATT TATTATGAAT GGAGCAAACC 250 

ATAGAATGAA TTCAATAACA ACTTATCCTT TTAATATAAT GGGAAATGGT 300 

TGGGAAAAAG CAACTCCATC TCTTGAAGAT TTACCATTTA AGGGAGATAC 350 

TGTTGTTGGA AATGATGTGT GGATTGGTCA GAATGTTACT GTTATGCCAG 400 

GAATTCAAAT AGGAGATGGA GCAATTGTTG CTGCGAATTC AGTTGTTACA 450 

AAAGATGTAC CACCATATCG TATTATTGGT GGAAATCCGA GTAGAATTAT 500 

AAAGAAAAGG TTTGAAGATG AATTGATAGA TTATTTATTG CAAATAAAAT 550 

GGTGGGATTG GTCAGCACAA AAAATATTTT CTAATCTTGA AACACTTTGT 600 

AGCTCTGATT TAGAGAAAAT AAAATCTATT C GAG ATT AG 639 



2) INFORMATION FOR SEQ ID NO: 1614 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1614 
CCAATCCAGA AGAAATATAC CC 



2) INFORMATION FOR SEQ ID NO: 1615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1615 
ATTAGTTTAT CCCCAATCAA TTCA 



2) INFORMATION FOR SEQ ID NO: 1616 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1616 
ATAATGAATG GGGCTAATCA TCGTAT 



2) INFORMATION FOR SEQ ID NO: 1617 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1617 
GCCAACAACT GAATAAGGAT CAAC 



PCT/CA00/01150 

24 



2 ) INFORMATION FOR SEQ ID NO: 1618 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH": 639 bases " 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
(C) ACCESSION NUMBER: AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1618 

ATGAAATGGC AAAATCAGCA AGGCCCCAAT CCAGAAGAAA TATACCCTAT 50 

AGAAGGTAAT AAACATGTTC AATTTATTAA ACCATCTATA ACAAAGCCCA 100 

ATATTTTAGT TGGGGAATAT TCATATTACG ATAGTAAAGA TGGTGAATCT 150 

TTTGAAAGCC AAGTTCTTTA TCACTATGAA TTGATTGGGG ATAAACTAAT 200 

ATTAGGGAAG TTTTGTTCTA TTGGACCCGG AACGACATTT ATAATGAATG 250 

GGGCTAATCA TCGTATGGAT GGTTCAACAT TTCCATTCAA TCTTTTCGGA 300 

AATGGTTGGG AGAAGCATAC CCCTACATTG GAAGACCTTC CTTATAAGGG 350 

TAACACGGAA ATTGGGAACG ATGTTTGGAT TGGACGAGAT GTGACAATTA 400 

TGCCCGGTGT AAAAATAGGA AACGGGGCTA TTATTGCAGC AAAATCGGTT 450 

GTGACAAAGA ACGTTGATCC TTATTCAGTT GTTGGCGGTA ATCCTTCACG 500 

ATTAATTAAG ATAAGGTTTT CCAAGGAAAA AATCGCAGCA TTACTAAAAG 550 

TAAGGTGGTG GGACCTAGAG ATAGAGACGA TAAATGAAAA TATTGATTGC 600 

ATCCTGAATG GT GAT AT AAA AAAGGTTAAA AGAAGTTAG 639 



2) INFORMATION FOR SEQ ID NO: 1619 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 1619 
AAGGCAAAAT AAAAGGAGCA AAGC 24 
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10" 



15 



2) INFORMATION FOR SEQ ID NO: 1620 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1620 
TGTACCCGAG ACATCTTCAC CAC , 23 



2) INFORMATION FOR SEQ ID NO: 1621 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1621 
AATTGAAGGA CGGGTATTGT GGAAAG 26 

30 

2) INFORMATION FOR SEQ ID NO: 1622 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1622 
45 CGATTTTGAC AGATGGCGAT AATGAA 26 



2) INFORMATION FOR SEQ ID NO: 1623 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M90056 

10 (xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 1623 

ATGAAAATAA TGTTAGAGGG ACTTAATATA AAACATTATG TTCAAGATCG 
TTTATTGTTG AACATAAATC GCCTAAAGAT TTATCAGAAT GATCGTATTG 
GTTTAATTGG TAAAAATGGA AGTGGAAAAA CAACGTTACT TCACATATTA 

15 TATAAAAAAA TTGTGCCTGA AGAAGGTATT GTAAAACAAT TTTCACATTG 
TGAACTTATT CCTCAATTGA AGCTCATAGA ATCAACTAAA AGTGGTGGTG 
AAGTAACACG AAACTATATT CGGCAAGCGC TTGATAAAAA TCCAGAACTG 
CTATTAGCAG ATGAACCAAC AACTAACTTA GATAATAACT ATATAGAAAA 
ATTAGAACAG GATTTAAAAA ATTGGCATGG AGCATTTATT ATAGTTTCAC 

20 ATGATCGCGC TTTTTTAGAT AACTTGTGTA CTACTATATG GGAAATTGAC 
GAGGGAAGAA TAACTGAATA TAAGGGGAAT TATAGTAACT ATGTTGAACA 
AAAAGAATTA GAAAGACATC GAGAAGAATT AGAATATGAA AAATATGAAA 
AAGAAAAGAA ACGATTGGAA AAAGCTATAA ATATAAAAGA ACAGAAAGCT 
CAACGAGCAA CTAAAAAACC GAAAAACTTA AGTTTATCTG AAGGCAAAAT 

25 AAAAGGAGCA AAGCCATACT TTGCAGGTAA GCAAAAGAAG TTACGAAAAA 
CTGTAAAATC TCTAGAAACC AGACTAGAAA AACTTGAAAG CGTCGAAAAG 
AGAAACGAAC TTCCTCCACT TAAAATGGAT TTAGTGAACT TAGAAAGTGT 
AAAAAATAGA ACTATAATAC GTGGTGAAGA TGTCTCGGGT ACAATTGAAG 
GACGGGTATT GTGGAAAGCA AAAAGTTTTA GTATTCGCGG AGGAGACAAG 

30 ATGGCAATTA TCGGATCTAA TGGTACAGGA AAGACAACGT TTATTAAAAA 
AATTGTGCAT GGGAATCCTG GTATTTCATT ATCGCCATCT GTCAAAATCG 
GTTATTTTAG CCAAAAAATA GATACATTAG AATTAGATAA GAGCATTTTA 
GAAAATGTTC AATCTTCTTC ACAACAAAAT GAAACTCTTA TTCGAACTAT 
TCTAGCTAGA ATGCATTTTT TTAGAGATGA TGTTTATAAA CCAATAAGTG 

35 TCTTAAGTGG TGGAGAGCGA GTTAAAGTAG CACTAACTAA AGTATTCTTA 
AGTGAAGTTA ATACGTTGGT ACTAGATGAA CCAACAAACT TTCTTGATAT 
GGAAGCTATA GAGGCGTTTG AATCTTTGTT AAAGGAATAT AATGGCAGTA 
TAATCTTTGT ATCTCACGAT CGTAAATTTA TCGAAAAAGT AGCCACTCGA 
ATAATGACAA TTGATAATAA AGAAATAAAA ATATTTGATG GCACATATGA 

40 ACAATTTAAA CAAGCTGAAA AGCCAACAAG GAATATTAAA GAAGATAAAA 
AACTTTTACT TGAGACAAAA ATTACAGAAG TACTCAGTCG ATTGAGTATT 
GAACCTTCGG AAGAATTAGA ACAAGAGTTT CAAAACTTAA TAAATGAAAA 
AAGAAATTTG GATAAATAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1569 
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2) INFORMATION FOR SEQ 



ID NO: 1624 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1624 
TTCTTTAATG CTCGTAGATG AACCTA 



2) INFORMATION FOR SEQ ID" NO: 1625 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1625 
TTTTCGTATT CTTCTTGTTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1626 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1626 
AGGAATGATT AAGCCCCCTT CAAAAA 



2) INFORMATION FOR SEQ ID NO: 1627 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1627 
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2) INFORMATION FOR SEQ ID NO 
(i) 



1628 



10 



15 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: U82085 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1628 



20 ATGCTTAAAA 
AATTTTAAAT 
TTGTAGGTAA 
GGACTAATAG 
TATTAAATAT 

25 GAAAATATGC 
AGTGGTGGTG 
ATGTTCTTTA 
GAATAGAGTT 
GTAGTATCTC 

30 TGAAATTGAA 
ATATAGAGCA 
AAGTATAATT 
GAATAAGGCG 
CTGAATCTAG 

35 CATAGAAATA 
AGAAAAAATA 
AGCAATTTCA 
TTTAATAATC 
AGAAAAGGTT 

40 TGAAGCTGAT 
AAAATTGGAT 
TATCTTAGAA 
GAATTGTTCT 
ATAAATGTTT 

45 ATTTGTTAGC 
TGGATATCGA 
GGTGTTGTGT 
TGAACAATTG 
CATATATAGA 

50 GAAAAAGAAC 
ATTATTAAGT 
TAGATGAAAA 
AATATTTA 



TCGACATGAA 
ATAAAAGAAC 
GAATGGAGTT 
AGATTGACGA 
ATCTCTCAAT 
TTCAATATTT 
AAAAAACTAG 
ATGCTCGTAG 
GATAACAAAT 
ATGATAGAAT 
AATGGATATA 
AAAAGAAATG 
CTAAAAGAAA 
CAAGGAATGA 
AATATGGAAA 
CGAAATCGTT 
AAAGAGCTTC 
TGGTCGCAAT 
AATTTCTTTG 
GCTATAATTG 
TCTAGAAAAA 
ACGTCAGTCA 
AATGTTATGT 
AGCAAGATTA 
TGAGTGGTGG 
GATTGTAATA 
TGCTGTTGAG 
TATTTGCTTC 
TTAATAATAG 
ATATTTAAAA 
TCAAAGAAAA 
AAAATCTCAA 
GTACAAATTG 



GAATGTAAAA 
TAAAGATTTA 
GGCAAAACAA 
AGGAAATATA 
TAGAAGAACC 
CAAGTTGAAA 
ATTTAAACTA 
ATGAACCTAC 
ACTTTTAAAG 
TTTTTTAGAT 
TTAGAGAATT 
CTTCTACGAA 
GCAATTGGAG 
TTAAGCCCCC 
ATGCAACATG 
GGAAACACGA 
CTTCTATTAA 
GTAATTAGTT 
GAGAGATGCT 
GTAACAATGG 
GTAGAATCAG 
AAACTTAGAT 
CTACCTCCAT 
CATTTTTATC 
AGAACAAATA 
CATTAATTCT 
GCATTAGAAG 
CCATGATAAA 
AAAATAATAA 
ATTAAAGATA 
AAAGATGATA 
TGGAAGAAAA 
AAATTAAAAG 



AAATATTATG 


CAGATAAATT 


50 


TAGTGGGGAT 


AAAATAGGTA 


100 


CACTTTTAAA 


AATAATAAAA 


150 


ATTATAAGTG 


AAAAAACAAC 


200 


ACATAGTAAG 


ATAATTGATG 


250 


ATAAGTGGAA 


TGACAATATG 


300 


GCAGAGGGAT 


TTCAAGATCA 


350 


AAGTAATTTA 


GATATCGAAG 


400 


AGTACCGTGA 


TACTTTTTTG 


450 


CAAGTTTGTA 


CAAAAATTTT 


500 


CATCGGTAAT 


TATACAAACT 


550 


AGCAACAAGA 


AGAATACGAA 


600 


CAAGCTATAA 


AGCTAAAAGA 


650 


TTCAAAAACA 


ATGGGAACAT 


700 


CTACTAAACA 


AAAAAAGATG 


750 


ATAGATAAAT 


TAAATCATGT 


800 


AATGGATTTA 


CCTAATAGAG 


850 


TAAAAAACTT 


ATCTATAAAA 


900 


TCATTTGTCA 


TTAAAGGTGG 


950 


TGTAGGAAAA 


ACAACATTGT 


1000 


TAATAATATC 


ACCATCAGTT 


1050 


GTTCTACAAT 


CTCATAAATC 


1100 


TCAAGATGAA 


ACAATAGCAA 


1150 


GCAATGATGT 


TCATAAAGAA 


1200 


AAGGTTGCTT 


TTGCCAAGCT 


1250 


TGATGAACCA 


ACAAACTATT 


1300 


AATTGTTAAT 


TACCTATGAA 


1350 


AAATTTATAC 


AAAACCTAGC 


1400 


AGTGAAAAAA 


TTCGAAGGAA 


1450 


AACCAAAATT 


AAATACAAAT 


1500 


CTAGAAATGC 


AAATTTCATC 


1550 


TGAAGAAAAA 


AACAAAGAAT 


1600 


AATTGAAAAG 


CCTAAATAAA 


1650 






1658 
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2) INFORMATION FOR SEQ ID NO: 1629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1629 
AAGGGGAAAG TTTGGATTAC ACAACA , 26 



2) INFORMATION FOR SEQ ID NO: 1630 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH : 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1630 
GAACCACAGG GCATTATCAG AACC 24 

30 

2) INFORMATION FOR SEQ ID NO: 1631 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1631 
45 CGACGATGCT TTATGGTTTG T 21 



2) INFORMATION FOR SEQ ID NO: 1632 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 26 bases 

(B) TYPE: Nucleic acid 

841 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1632 



GTTAATTTGC CTATCTTGTC ACACTC 



26 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M36022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1633 

ATGGAATTTA AATTACAAGA ATTAAATCTT ACTAACCAAG ATACAGGACC 50 

ATATGGTATA ACCGTTTCAG ATAAGGGGAA AGTTTGGATT ACACAACATA 100 

AAGCAAATAT GATAAGTTGC ATCAATTTAG ATGGAAAAAT TACAGAGTAC 150 

CCACTACCGA CACCAGATGC AAAAGTCATG TGTTTAACTA TATCCTCAGA 200 

TGGGGAAGTT TGGTTTACTG AGAATGCAGC AAACAAAATA GGGAGGATTA 250 

CAAAAAAAGG GATTATTAAG GAATATACAT TGCCTAACCC AGATTCAGCA 300 

CCCTACGGTA TTACAGAAGG ACCAAATGGA GATATATGGT TTACAGAAAT 350 

GAATGGCAAC CGTATTGGAC GTATTACGGA CGACGGTAAA ATTCGTGAAT 400 

ACGAGCTGCC TAATAAAGGA TCTTACCCTT CTTTTATCAC TTTGGGTTCT 450 

GATAATGCCC TGTGGTTCAC AGAAAATCAA AATAATGCTA TTGGTAGAAT 500 

TACAGAAAGT GGGGATATTA CAGAGTTTAA AATTCCTACA CCTGCATCAG 550 

GACCAGTTGG TATTACAAAG GGGAACGACG ATGCTTTATG GTTTGTGGAA 600 

ATTATCGGTA ATAAGATAGG GCGAATAACT CCTCTGGGGG AAATTACCGA 650 

ATTCAAAATT CCAACGCCAA ACGCTCGACC TCATGCAATT ACTGCTGGAG 700 

CAGGAATTGA TTTATGGTTT ACTGAATGGG GGGCTAATAA AATAGGAAGG 750 

CTGACAAGCA ATAATATAAT TGAGGAATAC CCAATTCAAA TCAAAAGTGG 800 

TGAACCACAT GGCATTTGTT TCGATGGTGA AACAATTTGG TTTGCAATGG 850 

AGTGTGACAA GATAGGCAAA TTAACTCTCA TTAAGGATAA TATGGAGTGA 900 



2) INFORMATION FOR SEQ ID NO: 1634 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1634 

TTAACTTGTC TATTCCCGAT TCAGG 25 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1635 



(i) SEQUENCE CHARACTERISTICS:, 
(A) LENGTH: 23 bases 
15- (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1635 
GCTGTGGCAA TGGATATTCT GTA 23- 

2) INFORMATION FOR SEQ ID NO: 1636 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1636 
TTCCTACCCC TGATGCTAAA GTGA 24 

40 



2) INFORMATION FOR SEQ ID NO: 1637 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1637 

843 
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CAAAGTGCGT TATCCGAACC TAA 



2) INFORMATION FOR SEQ ID NO: 1638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ 

GGTGGGGAAG ACTGTCTTCA TCCAGGAATT 
GCTCACCTAG TTTTCGCTCG ATCTTTTCAC 
AACATTGCCA AGGCCCACGG TGGTTACTCC 
GCGTACTCGT GAGGGTAACG ATCTGTACCA 
TCATTCAGCT CGAGGGTGAA TCCAAGGTCG 
AACGAGCCCC CCGGTGCCCG TGCCCGTGTC 
TGCCGAGTAC TTCCGTGACG AGGAGGGTCA 
ACAACATTTT CCGTTTCACC CAGGCCGGTT 
GGTCGTATCC CCTCTGCCGT CGGTTACCAG 
GGGTGGTATG CAGGAGCGTA TCACCACCAC 
CCGTCCARGC CGTCTACGTC CCCGCGA 



fumigatus 



ID NO: 1638 

GATTGTGAGT CGTTCCACAT 50 

TAACGCAAAC CATGTAGAAC 100 

GTCTTCACTG GTGTTGGTGA 150 

CGAAATGCAG GAGACTGGTG 200 

CACTGGTGTT CGGACAGATG 250 

GCCCTTACCG GTCTGACCAT 300 

GGACGTGCTG CTCTTCATTG 350 

CTGAGGTGTC TGCCCTTCTC 400 

CCCACCCTGG CCGTCGACAT 450 

CAAGAAGGGT TCTATTACCT 500 

527 



2) INFORMATION FOR SEQ ID NO: 1639 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1639 

ACTGTCTTGA TTCAAGAACT TATTAACAAC ATTGCTAAAG CCCATGGTGG 50 
TTACTCTATT TTCTGTGGTG TTGGTGAACG TACTCGTGAA GGTAACGATT 100 

844 
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TATACCACGA AATGATGGAA ACTGGTGTCA TTAAACTTGA AGGTGACTCC 150 

AAGTGTGCTC TTGTATTCGG TCAAATGAAC GAACCTCCTG GTGCTCGTGC 200 

CCGTGTTGCT TTAACTGGTT TAACCATTGC TGAATACTTC CGTGATGAAG 250 

AAGGTCAAGA TGTGTTACTT TTCATTGATA ACATTTTCCG TTTCACTCAA 300 

GCTGGTTCTG AAGTATCTGC CCTTTTAGGT CGTATTCCAT CTGCTGTAGG 350 

TTACCAACCC ACTTTATCTA CTGATATGGG TGGTATGCAA GAACGTATTA 4 00 

CTACTACCAA GAATGGTTCC ATTACCTCTG TACARGCCGT CTACGTCCCC 450 

GC 452 



2) INFORMATION FOR SEQ ID NO: 1640 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 
25 ' (C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1640 

ACGCCCTTAC GGTAAAACAA AGCAACGAAA ACGGAAGCAT GAACTTAACA 50 

30 TTTGAAGTTG CACTTCATTT AGGTGATGAT ACAGTTCGTA CAGTTGCGAT 100 

GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA GATACTGGTA 150 

AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG TGTATTCAAC 200 

GTATTAGGTG ATGCAATTGA CTTAGATGGT GAACTTCCTG CGGATGTACA 250 

CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA TTATCTACTA 300 

35 AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT ACTTGCTCCT 350 

TACATTAAGG GTGGTAAGAT CGGCCTATTC GGTGGTGCCG GCGTAGGTAA 400 

AACAGTATTA ATTCAAGAAT TAATTAACAA CATCGCACAA GAGCACGGTG 450 

GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA GGGTAATGAC 500 

TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA CTGCGATGGT 550 

40 ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT GTTGCATTAA 600 

CAGGTTTAAC AATGGCTGAG CATTTCCGTG ATGAGCAAGG ACAAGACGTA 650 

CTTCTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG GTTCTGAAGT 700 

ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC CAACCAACAC 750 

TTGCAACAGA AATGGGTCAA TTACAAGAGC GTA 783 

45 



2) INFORMATION FOR SEQ ID NO: 1641 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

845 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL NRS-319 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1641 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG , TTCGTGGCAC AGAAGTAGAA 150 

GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG 200 

15 TGTATTCAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA 300 

CTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

2 0 GAACACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

GGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT 600 

GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCAG 700 

25 GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 750 

CAACCAACAC TTGCAACAGA AATGGGTCAA TTACAAGAGC GTATTACATC 800 

TACAAATAAA GGATCTATCA CGT 823 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1642 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL BD-15 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1642 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAAAG 50 

CAACGAAAAC GGAGCAAACA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA TACAGTTCGT ACAGTTGCGA TGTCTTCCAC AGATGGACTT 150 

50 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGTA ACACTTGGTC GTGTATTCAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAAGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 
CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC . 350 

846 
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TGGTATTAAA GTAGTAGACT 
TTGGTCTATT CGGTGGTGCC 
TTAATTAACA ACATCGCACA 
TGTAGGTGAG CGTACTCGTG 
5 ATTCTGGCGT AATTAAGAAA 
CCACCTGGAG CACGTCAACG 
GCATTTCCGT GATGAGCAAG 
TCTTCCGTTT CACGCAAGCA 
ATGCCATCTG CGGTAGGTTA 
10 ATTACAAGAG CGTATTACAT 



TACTTGCTCC TTACATTAAG 
GGTGTAGGTA AAACAGTATT 
AGAACACGGT GGTATCTCTG 
AAGGTAACGA CTTATACCAC 
ACTGCGATGG TATTCGGACA 
TGTTGCATTA ACAGGTTTAA 
GACAAGACGT ACTACTGTTC 
GGTTCTGAAG TATCTGCCCT 
CCAGCCAACA CTTGCAACAG 
CTACAAATA 



PCT/CA00/01150 

GGTGGTAAGA 400 

AATTCAGGAA 450 

TATTCGCCGG 500 

GAAATGAGCG 550 

AATGAACGAG 600 

CAATGGCTGA 650 

ATCGATAACA 700 

TCTTGGTCGT 750 

AAATGGGTCA 800 
829 



2) INFORMATION FOR SEQ ID NO: 1643 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1643 

30 AATCTACAAT GCCCTTACGG TAAAACAAAG CAACGAAAAC GGAGCAAGCA 50 

TTAACTTAAC ATTTGAAGTT GCACTTCATT TAGGTGATGA CACAGTTCGT 100 

ACAGTTGCGA TGTCTTCCAC AGATGGACTT GTTCGTGGCA CAGAAGTAGA 150 

AGATACTGGT AAACCAATCT CTGTACCGGT TGGTGATGCA ACACTTGGTC 200 

GTGTATTTAA CGTATTAGGT GATGCAATTG ACTTAGATGG TGAAGTTCCT 250 

35 GCGGATGTAC GCCGTGATCC AATTCACCGT CAAGCACCTG CGTTCGAAGA 300 

GTTATCTACG AAAGTAGAAA TTCTTGAAAC TGGTATTAAA GTAGTAGACT 350 

TACTTGCTCC TTACATTAAA GGTGGTAAAA TCGGTCTATT CGGTGGTGCC 400 

GGTGTAGGTA AAACAGTATT AATCCAGGAA TTAATTAACA ACATCGCACA 450 

AGAGCACGGT GGTATTTCTG TATTCGCTGG TGTAGGTGAG CGTACTCGTG 500 

40 AAGGTAATGA CTTATACCAC GAAATGAGCG ATTCTGGCGT AATCAAGAAA 550 

ACAGCGATGG TATTCGGACA AATGAACGAG CCACCTGGTG CACGTCAACG 600 

TGTTGCATTA ACAGGATTAA CAATGGCTGA ACATTTCCGT GATGAGCAAG 650 

GACAAGACGT ACTATTGTTC ATCGATAACA TCTTCCGTTT CACGCAAGCG 700 

GGTTCTGAAG TATCTGCCCT TCTTGGTCGT ATGCCATCTG CGGTAGGTTA 750 

45 CCAACCAACT CTTGCAACAG AAATGGGTCA ATTACAAGAG CGTATTACAT 800 

CTACAAATAA AGGATCTATC ACG 823 



50 2) INFORMATION FOR SEQ ID NO: 1644 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 708 bases 

847 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1644 



GGAAAGCTAC CTGAAATCTA CAATGCCCTT, AGAATTAAAC AAGACGCAGT 50 

TAACTTAACT TTAGAAGTTG CACTTCACTT AGGTGATGAT ACAGTTCGTA 100 

15 CAGTTGCGAT GTCTTCCACA GACGGACTTG TTCGTGGTAC TGCAGTAGAA 150 

GATACTGGCA AAGCGATTTC TGTTCCAGTT GGTGATGCAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GAGGTTCCAG 250 

CAGATGTACG CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAG 300 

TTATCTACAA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

20 ACTTGCTCCT TACATTAAAG GTGGTAAAAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 450 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGTGC ACGTCAACGT 600 

25 GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 700 

GTTCTGAA 708 



30 

2) INFORMATION FOR SEQ ID NO: 1645 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 778 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1645 

GTACGATGCG CTTGAAGTTC AAAACGATGT CAAACTAGTG CTGGAAGTTC 50 

AACAGCAGCT CGGTGGTGGT GTCGTTCGTT GTATCGCTAT GGGTACCTCA 100 

GATGGCCTAA GCCGCGGTCT GAAAGTACTT GATTTAGAAC ACCCTATCGA 150 

50 AGTTCCTGTC GGTGTGGCAA CTCTGGGCCG TATTATGAAC GTGCTTGGTG 200 

AGCCTATCGA TATGAAGGGT GACATTGGTG AAGAAGAGCG TTGGGCTATC 250 

CATCGCGCTG CACCAAGCTA CGAAGATTTA TCCAGTGCTA ACGAACTGCT 300 

GGAAACGGGC ATCAAGGTTA TCGACCTGAT TTGTCCGTTT GCCAAAGGCG 350 

848 
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GTAAAGTTGG TCTGTTTGGT GGTGCCGGCG 

ATGGAGCTGA TTCGTAATAT TGCGACTGAG 

TGCCGGCGTT GGTGAGCGTA CTCGTGAGGG 

TGACAGAATC TAACGTATTA GACAAAGTAT 

5 AATGAGCCAC CGGGAAACCG TCTGCGCGTT 

GGCGGAAAAA TTCCGTGATG AAGGCCGTGA 

ACATTTATCG TTATACCTTA GCCGGTACAG 

CGTATGCCAT CAGCGGTAGG TTACCAACCA 

TACACTGCAA GAACGTATCA CYTCAACC 
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TAGGTAAAAC GGTAAACATG 400 

CACTCAGGTT ACTCTGTATT 450 

TAATGACTTC TACCACGAAA 500 

CTCTGGTTTA TGGCCAAATG 550 

GCGTTAACCG GCCTGACCAT 600 

CGTTCTGCTG TTTATCGATA 650 

AAGTATCAGC ACTGTTAGGT 700 

ACGCTGGCGG AAGAGATGGG 750 

778 



2) INFORMATION FOR SEQ ID NO: 164 6 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 
25 (B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1646 

GATGCCGTAC CAAAAGTGTA CGACGCTCTT GAGGTTACAA ATGGTAAAGA 50 

30 CCGTCTGGTG CTGGAAGTTC AGCAACAGTT AGGTGGTGGC GTAGTGCGTA 100 

CTATCGCCAT GGGTACTTCT GATGGTTTGC GTCGTGGTCT GGAAGTTTCT 150 

AACCTCGATC ACCCAATTGA AGTGCCAGTA GGTAAAGCAA CTCTGGGCCG 200 

TATCATGAAC GTCCTGGGCG AGCCTATCGA CATGAAAGGC GATATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTGCTG CTCCTAGCTA CGAAGAACTG 300 

35 TCTAGCTCCC AGGATCTGCT GGAAACCGGC ATCAAAGTAA TGGACCTGAT 350 

TTGCCCGTTC GCTAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TGGGTAAAAC TGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ACTCAGTGTT TGCAGGCGTG GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCATGAAA TGACCGATTC CAACGTTCTG GACAAAGTAT 550 

40 CACTGGTTTA TGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGTTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATTTACCG TTATACCCTG GCCGGTACAG 700 

AAGTATCTGC GCTGCTGGGT CGTATGCCAT CTGCGGTAGG TTACCAGCCA 750 

ACTCTGGCAG AAGAGATGGG TGTTTTGCAG GAGCGTATTA CCTCCACCAA 800 

45 AACTGG 806 



2) INFORMATION FOR SEQ ID NO: 1647 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 bases 

(B) TYPE: Nucleic acid 

849 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

-10- - (-xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164-7 _ 

GAAACCGTTG TTGACACTGG TTCCCCAATC ACCGTCCCAG TTGGTCGTGA 
AACCTTGGGT CGTATCATTA ACGTTATCGG , TGAACCAATC GATGAACGTG 
GTCCAATCAA CACCAAGCAA AGAAACCCAA TCCATGCTGA TCCTCCTTCA 

15 TTTGCTGAAC AATCCACTTC TGCTGAAGTT TTAGAAACCG GTATTAAAGT 
TGTCGATTTA TTGGCTCCTT ATGCCAGAGG TGGTAAAATT GGTTTATTCG 
GTGGTGCCGG TGTCGGTAAA ACCGTCTTTA TCCAAGAATT GATTAACAAC 
ATTGCTAAAG CTCATGGTGG TTTCTCAGTC TTCACCGGTG TCGGTGAAAG 
AACCAGAGAA GGTAACGATT TATACCGTGA AATGAAAGAA ACCGGTGTCA 

20 TTAACTTGGA AGGTGACTCT AAAGTCGCTT TAGTCTTCGG TCAAATGAAC 
GAACCTCCAG GTGCTAGAGC CCGTGTTGCC TTAACCGGTC TAACCATTGC 
TGAATACTTC AGAGATGAAG AAGGTCAAGA TGTCTTGTTA TTCGTTGATA 
ACATTTTCAG ATTCACCCAA GCTGGTTCAG AAGTCTCTGC CTTATTAGGT 
CGTATTCCAT CTGCTGTCGG TTATCAACCA ACCTTGGCCA CTGATATGGG 

25 TTTGTTGCAA GAACGTATCA CCACCACCAG AAAAGGTTCC GTCACTTCTG 
TCCAAGCTGT TTATGTCCCA GCTGATGATT TGACTGATCC TGCCCCAGCC 
ACCACTTTCG CGCATTTGGA TGCTACTACT GTCTTGTCTC GTGGTATCTC 
AGAATTGGGT ATCTACCCAG CTGTCGATCC ATTGGATTCC AAATCAAGAT 
TATTAGATGC CGCTGTTGTT GGTCAAGAAC ATTACGATGT TGCTACTCAA 

30 GTTCAACAAA CTTTACAAGC CTACAAATCT TTACAAGATA TTATCGCTAT 
TTTAGGTATG GATGAATTAT CAGAACAAGA TAAATTGACC GTTGAAAGAG 
CTAGAAAGAT CCAACGTTTC TTATCTCAAC CATTTGCTGT CGCTGAAGTT 
TTCACTGGTA TTCCAGGTAG ATTAGTTAGA TTGAAAGAAA CCGTTGCTTC 
ATTCAGAGAC GTTTTAGCTG GT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1122 



35 



2) INFORMATION FOR SEQ 



ID NO: 1648 



40 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700677 



50 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1648 



850 
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TACGTCTCTT AGCCTGACGA TGATTTGCAA 
TGTTTCAAAT GGTAAAGTCA TCGCCCAACT 
GTAACGTTTC ATTTGGTACC AACCAAGCTG 
GGATCAACTA TGAAACCAAT CACAGACTAT 
5 TGTCTACGAT TCAACTGCTA CTATCGTTCA 
CTGGGACAGA TATCCCTCTC TATAACTGGG 
ATTACTCTGC AATATGCCCT TCAACAATCT 
AACACTAAAC AAGGTCGGTC TAGATAAGGC 
TTGGTATCGA CTATCCAAGC ATGCATTATG 

10 ACAACTGAAT CCAACAAAAA ATATGGTGCA 
CGCCTATGCC GCATTCGCAA ATGGTGGTAT 
TCAATAAAGT TGTCTTTAGC GATGGTAGCG 
GGCACAAGAG CCATGAAAGA AACGACTGCT 
GAAAACAGTC TGGACGTACG GAACTGGTCG 

15 TTCCTCAAGC TGGTAAAACA GGTACCTCTA 
GAAAAGTATA TCAAGAACAC TGGTTACGTA 
GGGTTATACC CGT 



GTCGCATCTA CGGTCGTAGA 50 

TGGAGCTCGT CACCAAGCAA 100 

TGGAAACCAA TCGTGACTGG 150 

GCTCCTGCCT TGGAGTACGG 200 

CGATGAGCCC TATAACTACC 250 

ATCGAGCATA TTTCGGTAAT 300 

CGTAACGTAC CTGCCGTTGA 350 

TAAAACCTTC CTTAATGGTC 400 

CAAACGCCAT TTCAAGTAAT 450 

AGTAGTGAAA AAATAGCTAC 500 

TTACCACAAA CCAATGTACA 550 

AAAAAGAATT TTCTGACCCT 600 

TACATGATGA CAGAAATGAT 650 

TGGTGCCTAC CTGCCTTGGC 700 

ACTATACTGA CGAAGAAATT 750 

GCTCCAGATG AAATGTTTGT 800 

813 



20 



2) INFORMATION FOR SEQ ID NO: 1649 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 813 bases 
25 (B) TYPE: Nucleic. acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1649 

ATTAATGAAG CTATTGTTGT TAATTATGAA TTAGAAGGAA AAGAATGCAA 50 

GCTAGTTCTT GAAGTAGCTG CACATTTAGG CGATAATAAA GTAAGAACCA 100 

TCGCTATGGA TATGACAGAT GGTCTTGTTA GAGGTTTAAC AGCCGTCGCA 150 

40 ACTGGAAATC CAATTAGTGT TCCAGTAGGC GAAAAAGTTC TTGGAAGAAT 200 

TTTTAATGTA ACGGGTGATT TGATTGATGA GGGCGAAGAA ATCAATTTTG 250 

ATAAGCACTG GTCAATTCAT AGAGATCCAC CTCCATTTGA AGAACAAAGT 3 00 

ACAAAAAGCG AAATCTTTGA AACAGGTATA AAGGTTGTTG ATTTGCTAGC 350 

TCCTTATGCT AAAGGTGGAA AAGTTGGTCT TTTTGGTGGT GCAGGTGTTG 4 00 

45 GTAAAACCGT TATTATTATG GAATTAATTC ACAATGTTGC ATTTAAACAT 450 

AGCGGATATT CTGTTTTTGC AGGTGTTGGC GAAAGAACTC GTGAGGGTAA 500 

TGACCTTTAC AATGAAATGA AAGAAAGTAA TGTATTAGAT AAAGTTGCAT 550 

TGTGTTATGG TCAAATGAAT GAACCACCAG GGGCAAGAAA TCGTATAGCT 600 

TTAACAGGTC TTACTATGGC TGAGTATTTT AGAGATGAAA TGGGACTTGA 650 

50 TGTTTTAATG ATTATTGATA ATATTTTCAG ATTTTCTCAA TCAGGCTCAG 700 

AAATGTCAGC ACTTTTAGGA AGAATTCCTT CAGCTGTTGG TTATCAACCA 750 

ACCTTAGCTA GTGAAATGGG TAAGTTCCAA GAAAGAATTA CTTCAACCAA 800 

GAAAGGATCT ATT 813 

851 
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2) INFORMATION FOR SEQ ID NO: 1650 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:- Linear 



(ii) MOLECULE TYPE: Genomic DNA 

~10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1650 

GGTGGGGAAG ACTGTGTTCA TTCAGGAATT GATTGTAAGT TCTGTTATCA 50 

ACTAAAGCCG ACAGCGGTTG CTGATATGCT CTAGAACAAC ATTGCTAAGG 100 

CTCACGGTGG TTACTCCGTG TTCACTGGTG TCGGTGAGCG TACCCGTGAG 150 

20 GGTAACGATT TGTACCATGA AATGCAAGAG ACCCGTGTCA TTCAACTCGA 200 

CGGAGAGTCC AAGGTCGCTC TTGTCTTCGG TCAAATGAAC GAGCCCCCTG 250 

GTGCCCGTGC CCGTGTTGCC CTTACCGGTT TGACCATTGC TGAATACTTC 300 

CGTGACGAGG AAGGCCAAGA CGGTAGGCTT CATGCTTCTA TCGCTAGGGG 350 

CGTGTGATAC AGGAGGCTAA TCGCTTTTCT AGTGCTTCTC TTTATTGACA 4 00 

25 ACATTTTCCG TTTCACTCAA GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT 450 

CGTATTCCTT CCGCTGTCGG TTACCAACCT ACTCTCGCCG TCGACATGGG 500 

TGTTATGCAG GAACGTATCA CCACCACCAC CAAGGGATCC ATTACTTCAG 550 

TGCARGCCGT CTACGTCCCC 570 

30 



2) INFORMATION FOR SEQ ID NO: 1651 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1651 



TGTCTTCATC CAGGAGTTAA TTGTACGTTT TTTGTCTGCC TACTATAATA 50 

GACGAAGAAA TATTTTGATT TATTTTTCTA ATAATTCAAA TAGAACAACA 100 

50 TTGCCAAAGC CCACGGTGGT TACTCTGTCT TCACTGGTGT TGGCGAGCGG 150 

ACCCGTGAAG GAAACGATCT GTACCACGAA ATGCAGGAGA CCCGTGTCAT 200 

CCAGCTCGAT GGCGAGTCCA AGGTCGCCCT CGTCTTCGGT CAAATGAACG 250 

AACCCCCCGG AGCCCGTGCC CGTGTTGCCC TCACTGGTCT GACCGTTGCT 300 

852 
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GAATACTTCC GTGACGAGGA AGGTCAAGAT 
CCAGTAATTT GACTCGAAGC TCCACTCACA 
CATCGACAAC ATTTTCCGCT TCACCCAGGC 
TGCTCGGCCG TATCCCCTCC GCCGTCGGTT 
5 GACATGGGTA TGATGCAGGA ACGTATCACC 
CACCTCCGTG 



PCT/CA00/01150 

GGTGCGTATA TATATATTCG 350 
CATATATTAG TGCTCCTCTT 400 
AGGTTCCGAA GTGTCCGCCC 450 
ACCAGCCCAC CCTCGCTGTC 500 
ACCACCACCA AGGGCTCCAT 550 

560 



10 2) INFORMATION FOR" SEQ~ ID NO: 1652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY:' Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14976 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1652 



CCACAAGTGT ACAGCGCCCT TGAGGTTAAA AATGGTGATG CTCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TGGGCGGTGG CGTGGTTCGT ACCATCGCCA 100 

TGGGTTCTTC AGACGGCCTT AAGCGTGGTC TGGAAGCCGT TGACCTTCAG 150 

CACCCAATTG AAGTACCGGT AGGTACTGCC ACACTTGGCC GTATCATGAA 200 

30 CGTGCTGGGT GAGCCGATCG ATATGAAAGG CGACATTGGC GAAGAAGAGC 250 

GCTGGGCGAT TCACCGCTCT GCACCTTCTT ACGAAGATCA GTCGAACTCT 300 

CAGGATCTGC TGGAAACCGG CATCAAGGTG ATTGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCTTGTTCGG TGGTGCGGGC GTAGGTAAAA 400 

CCGTCAACAT GATGGAGCTT ATTCGTAACA TTGCGGCTGA GCACTCAGGT 450 

35 TTCTCGGTAT TTGCCGGTGT GGGTGAGCGT ACCCGTGAAG GTAACGACTT 500 

CTACCACGAA ATGACCGACT CCAACGTTAT CGACAAAGTT TCGCTGGTCT 550 

ATGGTCAGAT GAACGAGCCA CCGGGTAACC GTCTGCGCGT TGCGCTGACC 600 

GGTCTGACCA TGGCGGAGAA GTTCCGTGAC GAAGGTCGTG ACGTACTGCT 650 

GTTTATCGAT AACATCTATC GTTACACCCT GGCCGGTACT GAAGTCTCCG 700 

40 CTCTGCTGGG TCGTATGCCT TCTGCGGTAG GTTACCAGCC AACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAACGTATC 780 



45 2) INFORMATION FOR SEQ ID NO: 1653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

853 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1653 



TGTGTTCATT CAGGAGCTGA TTGTGAGTAC CCCGGAGATT TTCCTGCGAT 50 

TGCGCATGAA GCAAGCGCTG ACGTCCATCT AGAACAACAT CGCCAAGGCT 100 

10 CACGGTGGTT ACTCCGTGTT CTGCGGTGTC GGCGAGCGTA CTCGTGAGGG - 150 

TAACGATTTG TACCACGAAA TGCAGGAGAC CGGTGTCATC AACCTCGAGG 200 

GCGAGTCCAA GGTCGCCCTT GTCTTCGGTC AGATGAACGA GCCCCCGGGA 250 

GCCCGTGCCC GTGTCGCCCT TACTGGTCTT ,ACCGTCGCTG AGTAAGTTTT 300 

GACAACCAGA AGCGAGTATT GCCACAATTA CTGACTAAAA ATCAAGATAT 350 

15 TTCCGTGACG AGGAGGGCCA GGATGTGCTT CTCTTCATTG ACAACATTTT 400 

CCGTTTCACC CAGGCCGGTT CTGAGGTGTC CGCTCTTCTC GGCCGTATTC 450 

CCTCTGCCGT CGGTTACCAG CCCACTCTCG CCGTCGACAT GGGTATGATG 500 

CAGGAGCGTA TCACCACCAC CCAGAAGGGT TCCATCACTT CCGTC 545 

20 



2) INFORMATION FOR SEQ ID NO: 1654 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 564 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1654 



TGTGTTTATT 


CAGGAGCTCA 


TCAACAACAT 


CGCCAAGGCC 


CACGGTGGTT 


50 


ACTCTGTCTT 


CACTGGTGTC 


GGTGAGCGAA 


CCCGTGAGGG 


TAACGATCTG 


100 


TACCACGAAA 


TGCAGGAGAC 


TTCCGTTATT 


CAGCTTGATG 


GCGAGTCCAA 


150 


GGTTGCCCTG 


GTTTTCGGTC 


AGATGAACGA 


GCCCCCTGGA 


GCTCGTGCCC 


200 


GTGTCGCTCT 


TACCGGGTAA 


GTTGATAGAT 


AGTGCCTTCC 


CTTCCTGTTT 


250 


CAACACCTCA 


ACACCACCAC 


TCCCCAAGAC 


ATTGCTACTC 


ATACTGCACC 


300 


ATGATATTAT 


ATTTACGCCT 


CTTGGACGCT 


AGCTAATGTT 


GTATCGACAG 


350 


TTTGACTGTT 


GCTGAGTACT 


TCAGAGACGA 


GGAGGGTCAG 


GACGTGCTGC 


400 


TTTTCATTGA 


CAACATTTTC 


CGATTCACTC 


AGGCCGGTTC 


CGAGGTGTCT 


450 


GCCCTTCTCG 


GTCGTATCCC 


CTCTGCCGTC 


GGTTACCAGC 


CCACTCTGGC 


500 


CGTCGACATG 


GGTGGTATGC 


AGGAGCGTAT 


TACCACCACC 


ACCAAGGGTT 


550 


CCATTACCTC 


AGTC 








564 



50 

2) INFORMATION FOR SEQ ID NO: 1655 

854 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi)" ORIGINAL SOURCEr 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1655 



CGTACCGCGC GTGTACGAGG CTCTTGAGGT ACAAAATGGT AGTGAGAATC 50 

TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATTGT TCGTACCATC 100 

GCCATGGGTT CTTCCGACGG TCTGCGTCGC GGTCTGGAAG TCAAAGACCT 150 

C GAG CATC CG ATCGAAGTCC CGGTAGGTAA AGCAACGCTG GGTCGTATCA 200 

20 TGAACGTACT GGGCCAACCG GTAGACATGA AAGGCGACAT CGGCGAAGAA 250 

GAGCGTTGGG CGATTCACCG CGCAGCGCCT TCCTACGAAG AGTTGTCAAA 300 

CTCTCAGGAA CTGCTGGAAA CCGGCATCAA AGTTATCGAC CTGATGTGTC 350 

CGTTTGCGAA GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC GGGTGTAGGT 400 

AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA TCGAGCACTC 450 

25 CGGTTACTCC GTGTTTGCGG GCGTAGGTGA ACGTACTCGT GAGGGTAACG 500 

ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGATAA AGTATCCCTG 550 

GTGTATGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC GCGTTGCGCT 600 

GACCGGCCTG ACCATGGCTG AGAAGTTCCG TGACGAAGGT CGTGACGTTC 650 

TGCTGTTCGT CGATAACATC TATCGTTACA CCCTGGCCGG TACTGAAGTA 700 

30 TCCGCACTGC TGGGTCGTAT GCCTTCAGCG GTAGGTTACC AGCCGACTCT 750 

GGCGGAAGAG ATGGGCGTTC TGCAGG 776 



35 2) INFORMATION FOR SEQ ID NO: 1656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporia audouinii 

(B) STRAIN: ATCC 11347 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1656 

GTGTTCATCC AGGAGTTGAT TGTAAGTGAT TATATTCCCC TAGAAAGAAA 50 
TGTTTTGAAC AAAAGTCTCG AATTAGAAAA TTCTTTTCAG ATACTAATTT 100 
ACTATAGAAC AACATTGCCA AGGCTCACGG TGGTTACTCC GTCTTCACCG 150 

855 
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GTGTCGGAGA GCGAACCCGT GAAGGAAACG 
GAAACTCGTG TCATCCAACT TGATGGCGAG 
CGGTCAGATG AACGAGCCCC CAGGTGCCCG 
GTTTGACCAT TGCTGAGTAC TTCCGTGATG 
5 TTCTTTAAAT TAGATATCTT CTGGAGAAAC 
AGTGCTTCTC TTCATCGACA ACATCTTCCG 
AAGTGTCTGC CCTGCTTGGT CGTATTCCAT 
ACTCTTGCCG TCGACATGGG TGGTATGCAG 
GAAGGGATCC ATTACCTCCG TC 



PCT/CAOO/01150 

ATCTGTACCA TGAAATGCAG 200 

TCCAAGGTCG CCCTGGTCTT 250 

TGCCCGTGTT GCTCTTACTG 300 

AGGAAGGTCA AGACGGTATG 350 

AGCGTCTAAC AAATTCTTCC 400 

TTTCACTCAG GCTGGTTCCG 450 

CTGCCGTCGG TTACCAACCC 500 

GAACGTATTA CCACCACCAA 550 

572 



2) INFORMATION FOR SEQ ID NO: 1657 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 
25 (B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1657 

GCCTAAAGTG TATAACGCAC TTGAGGTGAA AGGCGGTGCC ACTAAACTGG 50 

30 TACTGGAAGT TCAGCAGCAG CTAGGCGGCG GCGTTGTACG CTGTATCGCT 100 

ATGGGTACTT CTGACGGTCT GCGTCGCGGA CTGGACGTTG TTGACCTGGA 150 

GCACCCGATT GAAGTCCCAG TAGGTAAAGC GACCTTAGGC CGCATTATGA 200 

ACGTACTGGG TGAGCCAATT GATATGAAGG GTGATATCGG CGAAGAAGAT 250 

CGCTGGGCTA TTCACCGTGA AGCTCCAAGC TACGAAGAAC TGTCTAACTC 300 

35 GCAAGAACTG CTGGAAACCG GTATCAAGGT AATGGACTTG ATTTGTCCGT 350 

TCGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCAGG 450 

TTACTCTGTA TTTGCCGGCG TGGGTGAACG TACTCGTGAG GGTAACGACT 500 

TCTACCACGA AATGACCGAC TCCAACGTAT TGGACAAAGT ATCACTGGTT 550 

40 TATGGCCAGA TGAACGAGCC ACCAGGAAAC CGTCTGCGCG TTGCGCTGAC 600 

CGGTCTGACT ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT GACGTACTGC 650 

TGTTCATCGA TAACATCTAC CGTTATACCT TGGCCGGTAC CGAAGTATCT 700 

GCACTGCTGG GTCGTATGCC TTCTGCGGTA GGTTATCAGC CAACGCTGGC 750 

GGAAGAGATG GGTGTTCTGC AAGAACGTAT CACCTCTACC 790 

45 



2) INFORMATION FOR SEQ ID NO: 1658 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

856 
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10 



25 



40 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 1658 



TGTCTTCATT CAGGAGCTTA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACTGGTGTG GGAGAGCGCA CTCGTGAGGG AAACGACTTG 100 

TATCACGAGA TGCAAGAGAC TTCCGTTATC CAGCTCGAAG GCGAATCCAA 150 

GGTTGCCCTC GTCTTCGGTC AAATGAACGA GCCTCCGGGT GCTCGTGCTC 200 

15 GTGTTGCTCT CACCGGTCGT AAGTGCTCCT TCCCAGATTT CTCTTCCCCA 250 

GTTTCTGGAC CCACTTTTTC CTTCCACCAC CATTCTACTG GGTAGGACCA 300 

AGATAGCACT GCCTATTCTG GTGCCTTCCT ACCGCCTACT CTACTGCCTA 350 

TTCCACCACC TTTTCTACCG CCTCTTCTAC TTGCTATTGT ATACTAACTT 400 

ACTCAAACAG TTACTATTGC TGAGTACTTC CGTGACGCTG AGGGCCAGGA 450 

20 TGTGCTTCTC TTCATCGACA ACATTTTCCG TTTCACCCAG GCCGGTTCCG 500 

AGGTGTCCGC TCTTCTCGGT CGTATCCCCT CCGCCGTCGG TTACCAGCCC 550 

ACCCTTGCCG TCGACATGGG TGGTATGCAG GAGCGTATCA CCACCACCAA 600 

GAAGGGATCC ATTACCTCCG TC 622 



2) INFORMATION FOR SEQ ID NO: 1659 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1659 



GACGCTGTAC CTCAGGTGTA CGATGCACTG ACAGTTGAGG GTGCTGAGCT 50 

GGTACTGGAA GTGCAGCAGC AGCTGGGTGG TGGTGTTGTT CGCTGTATCG 100 

45 CGATGGGTGC CTCTGATGGC CTCAAGCGCG GTCTGAAAGC GCACAATACT 150 

GGTGCTCCTA TCACTGTACC GGTGGGTGTG GAAACACTGG GCCGGATCAT 200 

GGATGTGTTG GGTAACCCGA TTGACCAGAA AGGTCCAATC GGTGAACAAG 250 

ATCGCTGGGT GATCCACCGT GAAGCACCAA GCTACGAAGA TCAGGCTAAC 300 

AGCACTGAAC TGCTGGAAAC CGGTATCAAG GTTATCGACC TGGTATGCCC 350 

50 GTTTGCGAAA GGCGGTAAAG TCGGTCTGTT CGGTGGTGCC GGTGTAGGTA 400 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTATTCCG TGTTTGCGGG CGTGGGTGAG CGTACCCGTG AAGGTAACGA 500 

CTTCTACCAC GAAATGACAG ACTCCAACGT ACTGGACAAA GTATCCCTGG 550 

857 
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TGTACGGTCA GATGAACGAG CCGCCAGGTA ACCGTCTGCG CGTAGCACTG 600 

ACCGGCCTGA CCATTGCGGA GAAATTCCGT GATGAAGGTC GTGACGTACT 650 

GCTGTTCATC GATAACATCT ACCGTTATAC CCTGGCGGGG ACCGAAGTAT 700 

CGGCACTGCT GGGCCGTATG CCTTCTGCGG TAGGTTATCA GCCAACGCTG 750 

5 GCGGAAGAGA TGGGTGTACT GCAAGAGCGT ATTACCTCTA CCCG 794 



. 2) INFORMATION FOR SEQ ID NO: 1660 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1660 

25 AGGTATATGA CGCTSTGAAG ATCACAGGTG AAGGCGCCTG TAATGGTTTG 50 

GTGCTGGAAG TTCAGCAACA GCTAGGCGGT GGTGTAGTTC GTACTATCGC 100 

TATGGGTTCT TCTGATGGTC TGCGTCGTGG TCTTGAGGTT GTTAACTCAG 150 

GTTCACCTAT TTCTGTTCCT GTTGGTACCG CCACGCTTGG CCGTATCATG 200 

AACGTATTAG GTGAGCCTAT TGATGAAGCG GGTCCAATCG GTGAAGAAGA 250 

30 GCGTTATGTT ATTCACCGTG CAGCACCTTC ATATGAAGAT CAATCGAACA 300 

CTACTGAACT GTTAGAGACA GGTATCAAGG TTATTGACCT TGTTTGTCCA 350 

TTCGCTAAGG GTGGTAAAGT AGGTCTGTTC GGTGGTGCGG GTGTTGGTAA 400 

AACAGTTAAC ATGATGGAAC TGATTAACAA CATCGCTAAA GCTCACTCGG 450 

GTCTTTCGGT GTTCGCCGGT GTGGGTGAAC GTACTCGTGA AGGTAACGAC 500 

35 TTCTACTACG AGATGAAAGA TTCTGGCGTT CTCGACAAAG TGGCCATGGT 550 

TTATGGTCAG ATGAACGAGC CACCAGGAAA CCGTTTACGC GTAGCACTGT 600 

CAGGTCTGAC AATGGCTGAG AAGTTCCGTG ACGAAGGTCG TGACGTATTG 650 

TTGTTCGTTG ACAACATCTA CCGTTATACC TTAGCCGGTA CTGAAGTATC 700 

TGCACTGTTA GGCCGTATGC CTTCTGCGGT AGGTTATCAA CCAACATTGG 750 

40 CTGAAGAAAT GGGCGTTCTG CAAGAGCGTA TTACTTCAAC TAAGACGGG 799 



45 



2) INFORMATION FOR SEQ ID NO: 1661 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



858 



SUBSTITUTE SHEET (RULE 26) 



5 



20 



10 



WO 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1661 

TGGGAAGCGA AAATCCTG 18 
2) INFORMATION FOR SEQ ID NO: 1662 



(i) SEQUENCE CHARACTERISTICS: 
10 - - (A) LENGTH: 774 -bases- 

IB) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1662 



CTATGCCTCA AACTAGAGAG CATATCTTGC TATCTCGCCA AGTAGGCGTT 50 

CCATATATCG TTGTATTTAT GAACAAAGCC GATATGGTCG ATGACGCTGA 100 

25. GCTTCTTGAG CTAGTCGAGA TGGAAATTCG CGAGCTTCTT AACGAGTACA 150 

ACTTCCCTGG CGATGATACT CCTATCATAT CAGGTTCTGC TCTTAAAGCC 200 

CTCGAAGAGG CTAAAGCAGG CGTTGATGGC GAGTGGTCAG CAAAAGTTCT 250 

TGAGCTTATG GATAAAGTCG ATGAGTATAT CCCAACTCCA GTTCGTGCTA 300 

CCGATAAAGA CTTCCTGATG CCTATCGAAG ACGTTTTCTC TATCTCAGGT 350 

30 CGTGGAACGG TCGTTACTGG TAGGATCGAA AAAGGTGTCG TAAAAGTTGG 400 

CGATACTATC GAGATCGTTG GTATCAAACC TACTCAAACT ACGACAGTTA 450 

CTGGCGTTGA GATGTTTAGG AAAGAGATGG AACAAGGCGA GGCCGGTGAT 500 

AACGTAGGTG TTCTTTTAAG AGGTACTAAA AAAGAAGACG TCGAGCGCGG 550 

CATGGTTCTT TGTAAGCCAA AATCAATCAC TCCTCATACA AAATTTGAGG 600 

35 GTGAGGTTTA TATCCTAACA AAAGAGGAAG GCGGACGCCA CACTCCATTC 650 

TTTAACAACT ATAGACCACA ATTTTATGTA AGAACAACAG ACGTTACAGG 700 

TTCTATCACA CTTCCAGAAG GAACTGAGAT GGTTATGCCT GGAGATAATG 750 

TCAGAATTTC CGTTGAACTC ATCG 774 



2) INFORMATION FOR SEQ ID NO: 1663 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter rectus 

859 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1663 

5 TTCTGCGGCT GACGGCCCAA TGCCACAAAC TAGAGAGCAC ATCTTGCTAT 50 

CTCGCCAAGT AGGCGTTCCG TATATCGTTG TTTTTATGAA CAAAGCCGAT 100 

ATGGTCGATG ATGCCGAGCT TCTTGAGCTG GTTGAGATGG AGATTCGCGA 150 

GCTTCTAAAC GAGTATGATT TCCCTGGTGA CGATACTCCA ATCGTAGCAG 200 

GCTCTGCTCT TCAAGCTCTT AATGAAGCCA AAGCCGGAAC AGAAGGCGAG 250 

- 10- TGGTCTGCAA AAATTCT-TGA GCTTATGGCT AAAGTTGACG AGTATATCCC 300 

GACTCCGGTT CGTGCAACGG ATAAAGACTT CTTGATGCCT ATTGAGGACG 350 

TTTTCTCTAT CTCCGGTCGC GGCACCGTCG TTACCGGCAG AATCGAAAAA 4 00 

GGTATCGTAA AAGTCGGTGA TACTATCGAG , ATCGTAGGTA TCCGCGATAC 4 50 

TCAAACAACT ACCGTTACCG GCGTTGAGAT GTTCAGAAAA GAGATGGATC 500 

15 AAGGCGAAGC GGGCGATAAC GTAGGCGTTC TTCTAAGAGG CACTAAAAAA 550 

GAAGACGTTG AGCGCGGTAT GGTTCTTTGC AAACCTAAAT CAATCACTCC 600 

TCACACTAAA TTTGAGGGAG AGGTTTATAT CTTAACTAAA GAGGAAGGCG 650 

GACGCCATAC TCCATTCTTT AATAACTATA GACCGCAGTT TTATGTAAGA 700 

ACTACCGACG TTACCGGTTC TATCACTCTT CCGGAAGGAA CAGAGATGGT 750 

20 TATGCCTGGC GATAACTTAA AGATAAGCGT TGAGCTTATC G 791 



2) INFORMATION FOR SEQ ID NO: 1664 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1664 

40 CGACGGACAG ATGCCCCAGA CCAGGGAGCA 
TCGGTGTCAA GCGCATTGTC GTCTTCGTCA 
GACAAGGAGA TGTTGGAGCT CGTCGAGATG 
CAGCTACGGC TTCGAGGGTG ACGACACTCC 
TTTGCGCCAT TGAGGGCCGC GAGCCCGACA 

45 GAGCTCCTCG AGCACGTCGA CACCTGGATC 
CGCCAAGCCT TTCCTCATGT CCGTTGAGGA 
GTGGTACCGT CGCTTCTGGC CGTGTCGAGC 
TCCGAAGTCG AGCTTGTCGG CAAGAACAAG 
TACCGACATC GAGACCTTCA AGAAGTCTTG 

50 ACAACTCCGG TCTCCTTCTC CGTGGTGTCA 
GGCATGGTCG TTGTCCAGCC CGGCACCACC 
TGCCTCCATG TACGTCCTCA CCAAGGAGGA 
TCGCCAACAA CTACAAGCCC CAGATGTTCA 

860 



CTTGCTCCTC 


GCCCGCCAGG 


50 


ACAAGGTCGA 


TGCCATTGAG 


100 


GAGATGCGTG 


AGCTTCTCTC 


150 


CATCGTCATG 


GGTTCCGCCC 


200 


TTGGTGTCGA 


GAAGATTGAC 


250 


CCCACCCCCG 


AGCGTGACAT 


300 


CGTCTTCTCC 


ATTCCCGGCC 


350 


GTGGTGTCCT 


GAAGAAGGAT 


400 


AACCCCATCA 


AGACCAAGGT 


450 


CGACGAGTCC 


CGCGCTGGTG 


500 


AGCGTGACGA 


TGTCCTCCGT 


550 


AAGGCCCACA 


AGAAGTTCCT 


600 


GGGTGGCCGC 


CACACTGGTT 


650 


TCCGTACCGC 


CGATGAGGCC 


700 
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GCCACTCTTA CCTGGCCCGA GGGTACCGAG GAGGACAAGA TGGTCATGCC 
CGGTGACAAT GTCGAGATGA TCTGCGAGAT CCACAAGCCC ATTGCCGTCG 
AGCAAGGCCA 



750 
800 
810 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1665 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1665 
CAGTACAGGT AGACTTCTG 19 



2) INFORMATION FOR SEQ ID NO: 1666 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1666 

ATGATTGCGA AACCTACTTG CTGTGGAAGA ATTTGGATAT TCTAACATTT 50 

CTCTAGGCCT CAAACCAGAG AGCATCTGCT CCTTGCCCGC CAGGTCGGTG 100 

TTCAGAAGCT CGTCGTTTTC GTTAACAAGG TCGACGCTGT TGAGGACCCA 150 

GAGATGTTGG AACTTGTCGA GCTAGAGATG CGTGAGCTGC TCAGCCACTA 200 

TGGTTTCGAG GGTGAGGAGA CCCCAATCAT TTTTGGCTCT GCTCTCTGCG 250 

CCCTTGAATC TCGACGACCA GAATTGGGTG TTGAGAAGAT CGATGAGCTA 300 

TTGAACGCTG TGGATACCTG GATTCCCACC CCAGAGCGTG CCACTGATAA 350 

GCCTTTCCTT ATGTCCATTG AGGAAGTTTT CTCCATCTCT GGTCGTGGTA 400 

CCGTCGTTTC CGGTCGTGTC GAGCGTGGTA TCCTCAAGAA GGACTCTGAT 450 

GTCGAAATTG TGGGTGGATC TGATACACCC ATCAAGACGA AGGTCACCGA 500 

CATTGAAACC TTCAAGAAGT CTTGTGACGA ATCCCGAGCT GGTGACAACT 550 

CCGGTCTACT TCTCCGAGGT GTCAAGCGTG AGGACTTGAG ACGTGGAATG 600 

GTTGTTGCTG CTCCCGGATC GACCAAGGCT CATACCGACT TCATGGTCTC 650 

CCTTTATGTT CTGACCGAGG CTGAGGGTGG CCGTTCCAAT GGATTCACCC 700 

ACAAGTACCG CCCACAGATG TTCATCCGTA CTGCCGGTAT GTAAACCCTT 750 
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TTTCTACCAT TCACTTTGTT TCACCACTGA CTTGTATACT TTACCGCAGA 
CGAAGCCGCA TCTTTCAGCT GGCCTGGAGA GGATCAAGAC AAGAAGGCCA 
TGCCTGGTGA CAATGTCGAG ATGATTTGCA AGACCCTC 



800 
850 
888 



10 



15 



20 



25 



30 



35 



40 
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2) INFORMATION FOR SEQ ID NO: 1667 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 793 bases " - 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1667 

ATGCCGCAGA CCCGCGAGCA CTTGCTGCTC GCCCGTCAGG TCGGTGTCCA 50 

GAAGATCGTT GTTTTCGTCA ACAAGGTTGA TGCTATCGAC GACCCGGAGA 100 

TGCTGGAGCT TGTCGAGATG GAGATGCGTG AACTTCTCAG CACATACGGT 150 

TTCGAGGGTG ACGAGACCCC TGTTATTATG GGCTCCGCGC TCATGGCTCT 200 

CAACAACCAG CGCCCCGAGA TTGGTCAACA GAAGATTGAT GAACTCATGG 250 

CCGCTGTCGA CGAGTGGATC CCTACTCCCC AGCGTGACCT CGACAAGCCT 300 

TTCCTGATGT CTGTTGAGGA TGTCTTCTCC ATTGCTGGCC GTGGTACCGT 350 

TGTGTCCGGC CGTGTGGAGC GCGGTACCCT CAAGCGTGAT GAGGAAGTCG 400 

AGCTTGTCGG CAAGGGTGTC GACCCCATCA AGACCAAGGT CACCGATATC 450 

GAGACTTTCA AGAAGTCCTG CGAGGAGGCT CAGGCTGGTG ACAACTCTGG 500 

TCTTCTGATC CGTGGTGTCC GCCGCGAGGA TGTTCGTCGC GGTATGGTTG 550 

TCTCCAAGCC CGGCACCGTC AAGTCTCACA CTCAGTTCCT GGCCTCGCTT 600 

TACGTTCTCA CCAAGGAGGA GGGTGGTCGC CACACTGGTT TCGGCGAGCA 650 

CTACCGTCCC CAGCTCTACC TCCGTACCTC AGACGAGTCT GTCGATCTGA 700 

CCTTCCCCGA GGGAACTGAG GATCACCACT CCAAGATCGT CATGCCTGGT 750 

GACAACATCG AGATGGTCGT CACGATGACT CACGCCAACG CTA 793 



2) INFORMATION FOR SEQ ID NO: 1668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
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(B) STRAIN: K-12 KL1699 

(C) ACCESSION NUMBER: J01717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1668 

5 

AACATGATCA CCGGTGCTGC GCAGATGGAC GGCGCGATCC TGGTAGTTGC 50 

TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC CTGCTGGGTC 100 

GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA ATGCGACATG 150 

GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG TTCGTGAACT 200 

10 TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC GTTCGTGGTT 250 

CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC GAAAATCCTG 300 

GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG AGCGTGCGAT 350 

TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC ATCTCCGGTC 400 

GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT CAAAGTTGGT 450 

15 GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT CTACCTGTAC 500 

TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT GCTGGTGAGA 550 

ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT CGAACGTGGT 600 

CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA AGTTCGAATC 650 

TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT ACTCCGTTCT 700 

20 TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA CGTGACTGGT 750 

ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG GCGACAACAT 800 

CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC GACGGTCTGC 850 

GTTTCGCAAT CCGTGAAGGC GGCCGTACCG TTGGCGCGGG C 891 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1669 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

(B) STRAIN: ATCC 60625 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1669 



TCCTCGTGGT CGCCGCCACC GACGGCCCGA TGCCGCAGAC CAAGGAGCAC 50 

GTGCTCCTGG CCCGCCAGGT CGGCGTTCCG TACATCGTCG TCGCCCTCAA 100 

45 CAAGGCCGAC ATGGTGGACG ACGAGGAGAT CCTGGAGCTC GTCGAGCTCG 150 

AGGTCCGTGA GCTCCTCTCC GAGTACGAGT TCCCGGGCGA CGACGTTCCC 200 

GTCGTCAAGG TCTCCGCTCT GAAGGCCCTC GAGGGCGACA AGGAGTGGGG 250 

CAACTCGGTT CTCGAGCTCA TGAACGCCGT CGACACGGCG ATCCCCGAGC 300 

CCGAGCGTGA CGTCGACAAG CCGTTCCTCA TGCCGATCGA GGACGTCTTC 350 

50 ACGATCACCG GTCGCGGTAC GGTCGTCACC GGCCGTATCG AGCGTGGTGT 400 

CCTGAAGGTC AACGAGACCG TCGACATCAT CGGCATCAAG ACCGAGAAGA 450 

CCACCACCAC GGTCACCGGC ATCGAGATGT TCCGGAAGCT CCTCGACGAG 500 

GGCCAGGCCG GTGAGAACGT CGGTCTGCTC CTCCGTGGCA TCAAGCGCGA 550 

863 
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CGCCACACGC CGTTCTTCAA CAACTACCGC 
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5 TGCCGGGTGA CAACACCGAG ATGAAGGTGG 
ATGGA 
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GCCGGGCTCG GTCACGCCGC 600 
TGTCCAAGGA CGAGGGTGGC 650 
CCGCAGTTCT ACTTCCGTAC 700 
CGAGGGCACC GAGATGGTCA 750 
AGCTCATCCA GCCCGTCGCC 800 

805 



10" 2) INFORMATION FOR SEQ ID "NOT 1670 ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 935 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1670 



GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACA GATGAAAGGA 50 

TTTGACGTTT CTAACATCAG TCTAGGCCTC AGACCAGAGA ACATTTGCTC 100 

CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG TTAACAAGGT 150 

CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA CTTGAAATGC 200 

3 0 GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC CCCCATCATT 250 

TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG AGCTTGGTGT 300 

CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG ATCCCCACCC 350 

CAGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA GGAAGTGTTC 400 

TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG AGCGTGGTAT 450 

35 CCTCAAGAAG GATTCGGACG TCGAAATTGT TGGTGGCTCT ACCACCCCTA 500 

TCAAGACCAA GGTCACCGAT ATCGAAACCT TCAAGAAGTC CTGCGATGAA 550 

TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA TCAAGCGTGA 600 

GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC ACCAAGGCTC 650 

ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC TGAGGGTGGT 700 

40 CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT TCATCCGTAC 750 

TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG ATCATTGCTA 800 

ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC TGGCCTGGAG 850 

AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA GATGATTTGC 900 

AAGACCCTCC ACCCCATTGC TGCCGAGGCT GGCCA 935 

45 



2) INFORMATION FOR SEQ ID NO: 1671 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

864 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter aerogenes 

(B) STRAIN: ATCC 13048 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1671 



ACGATGCCCT TGAGGTACAG AATGGTAATG AGAGCCTGGT GCTGGAAGTT 50 

CAGCAGCAGC TCGGCGGTGG CGTAGTCCGT GCTATCGCCA TGGGTTCTTC 100 

CGACGGTCTG CGTCGTGGTC TGGAAGTTAA AGACCTTGAG CACCCGATCG 150 

AAGTCCCGGT AGGTAAAGCG ACTCTGGGCC GTATCATGAA CGTCCTGGGT 200 

15 CAGCCGATCG ACATGAAAGG CGACATCGGC GAAGAAGAAC GTTGGGCTAT 250 

CCACCGCGCG GCGCCTTCCT ATGAAGAGCT GTCCAGCTCT CAGGAACTGC 300 

TGGAAACCGG CATCAAAGTT ATCGACTTGA TGTGTCCGTT CGCTAAGGGC 350 

GGTAAAGTTG GTCTGTTCGG TGGTGCGGGT GTAGGTAAAA CCGTAAACAT 400 

GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT TACTCCGTGT 4 50 

2 0 TTGCGGGCGT TGGTGAGCGT ACTCGTGAGG GTAACGACTT CTATCACGAA 500 

ATGACCGACT CCAACGTTCT GGATAAAGTA TCCCTGGTTT ACGGCCAGAT 550 

GAACGAGCCG CCGGGAAACC GTCTGCGCGT TGCGCTGACC GGCCTGACCA 600 

TGGCTGAGAA ATTCCGTGAC GAAGGTCGTG ACGTTCTGCT GTTCGTCGAT 650 

AACATCTATC GTTACACCCT GGCCGGTACT GAAGTATCTG CACTGCTGGG 700 

25 CCGTATGCCT TCAGCGGTAG GTTATCAGCC GACTCTGGCG GAAGAGATGG 750 

GCGTTCTGCA GGAACGTATC AC 772 



30 2) INFORMATION FOR SEQ ID NO: 1672 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: Tohama 1 

(C) ACCESSION NUMBER: Genome project 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1672 

ATGAGCAACG GAACCATCGT TCAGTGCATC GGCGCCGTGG TGGATATTCA 50 

GTTCCCCCGC GATAACATGC CCAAGATCTA CGAAGCGCTC ACCCTGGTCG 100 

ACGAGGGTTC CTCGTTCGCC GAGAAGGGCT TGACGCTGGA AGTGCAACAA 150 

50 CAGCTGGGCG ACGGCGTGGT . GCGTACCATC GCGCTGGGTT CCAGCGACGG 200 

GCTGCGCCGC GGCATGCAAG TGGCCGGCAC CGGCGCACCG ATCTCGGTGC 250 

CCGTGGGCCA CGGCACCCTG GGCCGCATCA TGGACGTGCT GGGCCGTCCC 300 

ATCGACGAAG CCGGTCCCAT CGCCTCCGAC GAGAAGCGCG CCATCCACCA 3 50 

865 
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GCCCGCGCCC CGTTTCGACG AGCTGTCGCC GTCGGTCGAG CTGCTGGAAA 400 

CCGGCATCAA GGTTATCGAC CTGGTGTGCC CGTTCGCCAA GGGCGGCAAG 450 

GTCGGCCTGT TCGGCGGCGC CGGCGTGGGC AAGACCGTCA ACATGATGGA 500 

ACTGATCAAC AACATCGCCA AGCAGCACAG CGGCTTGTCG GTGTTCGCCG 550 

5 GCGTGGGCGA GCGTACCCGC GAAGGCAACG ACTTCTACCA CGAAATGGAA 600 

GAGTCGAACG TTCTGGACAA GGTGGCCATG GTGTTCGGCC AGATGAACGA 650 

GCCCCCGGGC AACCGCCTGC . GCGTGGCGCT GACCGGCCTG ACCATGGCCG 700 

AGAAGTTCCG CGACGAAGGC CGTGACATCC TGTTCTTCGT CGACAACATC 750 

TACCGCTACA CCCTGGCCGG TACCGAAGTG TCGGCGCTGC TGGGCCGTAT 800 

10 GCCGTCGGCG GTGGGCTACC AGCCTACGCT GGCCGAGGAA ATGGGCGTGC -850 

TGCAAGAGCG CATCACCTCG ACCAAGACCG GTTCGATCAC CTCGATCCAG 900 

GCCGTGTACG TGCCTGCCGA CGACTTGACC GACCCGTCGC CCGCCACGAC 950 

CTTCCAGCAC TTGGACTCGA CCGTCGTGCT, GTCGCGTGAC ATCGCTGCGC 1000 

TGGGCATCTA TCCCGCCGTG GACCCGCTGG ATTCCTCCAG CCGCCAGCTC 1050 

15 GACCCGCAAG TCGTGGGCGA AGAGCACTAC CAGGTGGCCC GTGGCGTGCA 1100 

GCAGACGCTG CAGCGCTACA AGGAACTGCG CGACATCATC GCGATTCTGG 1150 

GCATGGACGA ACTGTCGCCG GAAGACAAGC AGGCCGTGGC CCGCGCGCGC 1200 

AAGATCCAGC GCTTCCTGTC GCAGCCCTTC TACGTGGCCG AAGTGTTCAC 1250 

CGGCTCGCCG GGCAAGTACG TGTCGCTGGC CGAAACGATC CGTGGCTTCA 1300 

20 AGATGATCGT CGACGGCGAG TGCGACGCGC TGCCCGAGCA GGCGTTCTAC 1350 

ATGGTCGGCA CGATCGACGA GGCCTTCGAG AAGGCCAAGA AACTCCAATA 1400 

A 1401 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1673 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 97 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 

(B) STRAIN: ATCC 9345 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1673 

CAGCTACCGA CGGTCCAATG GCTCAGACCC GCGAGCACGT TCTTCTTGCT 50 

CGCCAGGTTG GCGTTCCACA GATCATCGTT GCTCTCAACA AGGCTGACAT 100 

GGTTGACGAC GAGGAAATCC TCGAACTCGT CGAAATGGAA GTTCGTGAGC 150 

45 TTCTCTCTTC CCAGGAGTAC CCAGGTGACG ACCTCCCAGT CGTCAAGATC 200 

TCGGCACTCA AGGCTCTCGA AGGCGATGCC GAATGGAGCA AGGCAATCGA 250 

AGATCTCATG GAAGCTGTCG ATACCTACTT CGACGATCCA GTGCGTGACC 300 

TCGATAAGCC ATTCCTCATG CCAATCGAAG ACGTCTTCAC CATCACCGGT 350 

CGTGGCACCG TTGTTACCGG CCGTGCAGAG CGCGGTATGC TCAACTTGAA 400 

50 CGAAGAAGTT GAAATCCTCG GTATCCGTGC ACCACAGAAG ACAACCGTTA 4 50 

CCGGTATCGA AATGTTCCAC AAGTCGATGG ATCACGCAGA TGCAGGCGAA 500 

AACTGTGGTC TTCTCCTCCG TGGCACCAAG CGCGAAGATG TTGAACGTGG 550 

TCAGGTTGTT GCCAAGCCAG GCACCATCAC CCCACACACC AACTTCGAAG 600 

866 
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CTCAGGTCTA CGTGCTCGGT AAGGAAGAAG GTGGCCGTCA CAACCCATTC 
TTCTCCAACT ACCGTCCACA GTTCTACTTC CGTACCACGG ATGTTACCGG 
CGTGATCACC CTTCCAGAGG GCACCGAAAT GGTTATGCCA GGCGACAACA 
CCGACATGAC AGTTGAGCTC ATCCAGCCAA TCGCTATGGA AGAGGGC 
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2) INFORMATION FOR SEQ ID NO: 1674 

~(i) SEQUENCE CHARACTERISTICS: ------ 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:- Genomic DNA. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Butyrivibrio fibrisolvens 

(B) STRAIN: ATCC 19171 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1674 

CTGATGGTCC TATGCCACAG ACCCGTGAGC ACATCCTATT AGCACGTCAG 50 

GTAGGCGTAC CATACATCAT CGTATTCCTA AACAAGTGCG ATATGGTTGA 100 

CGACGAGGAA TTATTAGAGT TAGTTGAGAT GGACGTACGT GATCTATTAA 150 

ATCAGTACCA GTTCCCAGGC GACGACACTC CAATCATCCG TGGTTCAGCA 200 

CTAGGTGCAT TAAACGGCGA AGAGAAGTGG AAAGAGGCAA TCTATCAGTT 250 

AGCAGACACT CTAGATTCAT ACATTCCAGA GCCAAAGCGT GATATCGATG 300 

ATCCATTCCT ATTACCAATC GAAGATATCT TCTCAATCTC AGGTCGTGGT 350 

ACTGTAGTAA CCGGCCGTGT AGAGCGTGGT ATTGTACACG TAGGTGACGA 400 

AGTTGAAATC GTTGGTATTC GTCCAACCAC CAAGACCACT GTAACTGGCG 450 

TTGAAATGTT CCGTAAGTTA CTAGACGAAG GTCGTGCAGG TGATAACGTT 500 

GGTGTTCTAC TACGTGGTAC CAAGCGTGAT GAGGTTGAGC GTGGTCAGGT 550 

TCTAGCTGCT CCAGGCACAA TCACTCCACA CACCAAGTTC ACTGGTCAGG 600 

TTTACGTACT AAGCAAGGAT GAAGGTGGTC GTCACACTCC ATTCTTCAAG 650 

GGCTACCGTC CACAGTTCTT CTTCCGTACA ACCGATATTA CCGGTTCTAT 700 

CGATCTGAAA GAGGGCGTAG AGATGGTAAT GCCAGGTGAT AACACCGACA 750 

TGACCGTAAC CCTAATCCAC CCAGTAGCTA TGGCT 785 



2) INFORMATION FOR SEQ ID NO: 1675 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN : ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1675 

5 

TGGGGCGATC TTGGTTGTTT CTGCTGCAGA TGGTCCTATG CCACAAACTA 50 

GAGAGCACAT TCTTCTTTCT CGTCAAGTAG GCGTTCCATA TATTGTTGTT 100 

TTTATGAATA AAGCAGATAT GGTTGATGAT GCTGAGCTTT TAGAGTTAGT 150 

TGAAATGGAA ATTAGAGAAT TATTAAGCTC TTATGATTTC CCAGGCGATG 200 

10 ATACACCTAT TATTTCTGGT" TCTGCTTTAA AAGCTCTTGA AGAAGCTAAA - 250 

GCTGGACAAG ATGGTGAATG GTCAGCAAAA ATTATGGATC TTATGGCTGC 300 

AGTTGATAGT TATATTCCAA CTCCAACTCG TGATACTGAA AAAGACTTCT 350 

TGATGCCAAT TGAAGACGTT TTCTCAATTT , CAGGTCGTGG TACTGTTGTT 400 

ACAGGTAGAA TTGAAAAAGG TGTTGTAAAA GTAGGTGATA CTATCGAAAT 450 

15 CGTTGGTATT AAAGACACTC AAACAACAAC TGTAACAGGT GTTGAAATGT 500 

TCAGAAAAGA AATGGATCAA GGCGAGGCAG GAGATAACGT AGGTGTTCTT 550 

CTTCGTGGTA CTAAAAAAGA AGAAGTTATT CGCGGTATGG TTCTTGCTAA 600 

ACCAAAATCA ATTACTCCAC ACACTGACTT CGAAGCTGAA GTTTATATCT 650 

TAAATAAAGA TGAAGGTGGT AGACATACTC CATTCTTTAA CAACTATAGA 700 

20 CCACAGTTTT ATGTAAGAAC AACTGATGTT ACAGGTTCGA TTAAATTAGC 750 

TGATGGTGTT GAAATGGTTA TGCCAGGTGA AAATGTGAGA ATTACTGTAA 800 

GCTTGATCGC TCCAGTAGCA CTTGAAGAAG GAACT 835 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1676 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1676 

TTCTGCAGCA GACGGCCCTA TGCCACAAAC TAGAGAGCAT ATCTTACTTT 50 

CTCGTCAAGT AGGTGTACCA TATATTGTTG TTTTCATGAA CAAAGCTGAT 100 

ATGGTTGATG ATGCAGAATT ATTAGAATTA GTTGAAATGG AAATTAGAGA 150 

4 5 ATTACTAAGC TCTTATGATT TCCCAGGAGA TGATACTCCA ATTATTTCAG 2 00 

GTTCAGCATT ACAAGCTCTT GAAGAAGCAA AAGCTGGTCA AGATGGTGAA 250 

TGGTCTAAAA AAATCTTAGA TCTTATGGCT GCAGTTGATG ATTATATTCC 300 

AACTCCGGCT CGTGATACAG ATAAAGATTT CTTGATGCCA ATCGAAGATG 350 

TTTTCTCAAT CTCAGGTCGT GGAACTGTTG TTACCGGTAG AATTGAAAAA 4 00 

50 GGTGTTGTAA AAGTTGGTGA TACTATAGAA ATCGTTGGTA TTAGAGACAC 450 

TCAAACAACC ACAGTTACTG GTGTTGAAAT GTTTAGAAAA GAAATGGATC 500 

AAGGTGAAGC TGGTGATAAT GTTGGTGTAT TACTTCGTGG AACTAAAAAA 550 

GAAGATGTTG AACGTGGTAT GGTTCTTGCT AAACCAAAAT CAATCACTCC 600 

868 
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ACATACTGAT TTTGAAGCAG AAGTTTATAT 
GTCGTCATAC TCCATTCTTT AATAATTATA 
ACAACTGATG TTACAGGTGC TATTAAACTT 
TATGCCAGGC GATAATGATA GAATTACTGT 
5 CACTTGAGGA AG 



PCT/CA00/01150 

CTTAAATAAA GATGAAGGTG 650 
GACCGCAATT TTATGTAAGA 700 
GCAGAAGGCG TTGAGATGGT 750 
AAGTCTTATT GCTCCAGTTG 800 

812 



2) INFORMATION FOR SEQ ID NO: 1677 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Campylobacter sputorum subsp. sputorum 

(B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1677 

25 GCTATTCTTG TATGTTCAGC TGCAGATGGT CCAATGCCAC AGACTAGAGA 50 

GCATATTCTA CTATCAAGAC AAGTTGGTGT TCCATACATA GTTGTTTTCT 100 

TAAATAAAGA AGATATGGTT GATGATGCTG AGCTTATAGA GTTGGTTGAA 150 

GTTGAGGTTA GAGATTTATT AAATGAATAT GATTTCCCTG GAGATGATAC 2 00 

TCCAATCGTA ATAGGTTCTG CTCTTAAAGC TTTAGAAGAA GCAAAAGCTG 250 

30 GAACAGAGGG TGAATGGTCT GCTAAAATTA TGAAACTTAT GGATGCTGTT 300 

GATAGCTATA TCCCAACTCC AACAAGAGAT ACAGATAAAG ATTTCCTTAT 350 

GCCAATCGAA GATATCTTCT CAATTTCTGG TCGTGGTACA GTTGTAACAG 400 

GTAGAATTGA AAAAGGTGTA GTAAAAGTTG GCGAGACTAT TGAGATAGTT 4 50 

GGTATTAGAC CTACTCAAAC AACAACAGTT ACTGGTGTTG AAATGTTTAG 500 

35 AAAAGAGCTA GATCAAGGTG AAGCTGGAGA TAATGTTGGT ATCTTGTTAA 550 

GAGGTACAAA AAAAGAAGAT GTTGAAAGAG GTATGGTTTT ATGTAAACCA 600 

AAATCAATCA CTCCTCACAA GAAATTTGAA GGCGAAGTTT ATATTCTTAC 650 

AAAAGATGAA GGTGGTAGAC ATACTCCTTT CTTTAGTAAC TATAGACCAC 700 

AATTTTATGT TAGAACAACA GATGTAACAG GTTCTATATC TCTTCCTGAG 750 

40 GGAACAGAGA TGGATATGCC TGGTGATAAT GTAAAACTTA CAGTTGAACT 800 

TATAAACCCA ATTGCTCTTG AGCAAGGA 828 



45 2) INFORMATION FOR SEQ ID NO: 1678 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

869 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter upsaliensis 

(B) STRAIN: ATCC 49815 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1678 

CGATTTTGGT TGTTTCTGCT GCTGATGGTC CTATGCCACA AACTAGAGAG 50 

CACATTTTGC TTTCTCGTCA AGTGGGTGTT CCTTATATCG TAGTTTTTAT 100 

10 GAATAAGGCT GATATGGTTG ATGATGCAGA GCTTT-TAGAA TTGGTTGAAA 150 

TGGAAATTAG AGAACTTTTA AGCTCTTATG ATTTCCCGGG CGATGACACT 2 00 

CCTATCATTT CAGGCTCTGC TCTTCAAGCC TTAGAAGAGG CTAAGGCGGG 250 

ACAAGATGGC GAGTGGTCAG CTAAGATTTT, AGAGCTTATG AAGGCAGTTG 300 

ATGAGTATAT CCCAACTCCT GTTCGCGATA CTGAAAAAGA TTTCTTGATG 350 

15 CCTATTGAAG ATGTTTTTTC AATTTCTGGT CGTGGAACTG TTGTAACAGG 4 00 

TAGAATTGAA AAAGGTGTGG TTAAAGTCGG CGATACTATT GAGATAGTAG 4 50 

GTATCAAAGA TACTCAAACT ACAACAGTTA CAGGCGTTGA GATGTTTAGA 500 

AAAGAAATGG ATCAAGGTGA GGCTGGCGAT AATGTCGGTG TGCTTTTAAG 550 

AGGAACAAAA AAAGAAGATG TTCTTCGTGG TATGGTTCTT GCAAAGCCTA 600 

20 AATCTATCAC TCCTCATACT GATTTTGAAG CAGAAGTTTA TATTCTAAAT 650 

AAAGATGAGG GCGGTCGCCA TACTCCTTTC TTTAACAATT ATCGTCCGCA 700 

GTTTTATGTA AGAACGACTG ATGTAACTGG TTCTATTAAA TTAGCTGATG 750 

GTGTTGAGAT GGTTATGCCG GGTGAAAATG TAAGAATTAC AGTTAGCCTT 800 

ATCGCTCCAG TTGCACTTGA 820 

25 



2) INFORMATION FOR SEQ ID NO: 1679 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 
40 (B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1679 

CTGCAGCTGA TGGTCCAATG CCTCAAACAC GTGAACATAT CTTATTATCA 50 

45 CGTCAAGTAG GTGTTCCTTA CATGGTTGTC TTCTTAAACA AAGTTGACAT 100 

GGTTGACGAT GAAGAATTAT TAGAATTAGT TGAAATGGAA GTTCGTGATT 150 

TATTATCTGA ATACGAATTC CCTGGAGACG ACGTTCCAGT AATCGCTGGT 200 

TCAGCTTTAA AAGCTTTAGA AGGCGAAGAA CAATATGAAG CAAAAGTATT 250 

AGAATTAATG GAAGCTGTAG ATACATACAT TCCAGAACCA GTTCGTGATA 300 

50 CTGAAAAACC ATTCATGATG CCAGTTGAAG ATGTGTTCTC AATCACAGGT 350 

CGTGGTACAG TTGCTACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 400 

TGACGAAGTT GAAATCGTTG GTATTGCTGA AGAAATTAGC AAAACAACTG 450 

TAACTGGTGT TGAAATGTTC CGTAAATTAT TAGATTACGC TGAAGCTGGA 500 

870 
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GATAACATTG GTGCGTTATT ACGTGGTGTT 
TGGTCAAGTA TTAGCAAAAC CAGGAACAAT 
AGGCGGAAGT TTACGTATTA TCAAAAGAAG 
TTCTTCGCTA ACTACCGTCC TCAATTCTAC 
5 AGGTGTTGTA GAATTACCAG AAGGTACAGA 
ACGTATCAAT GACAGTTGAA TTAATTCACC 



PCT/CA00/01150 

ACACGTGAAC AAATCCAACG 550 

TACACCTCAT ACTAAATTCG 600 

AAGGTGGACG TCATACTCCA 650 

TTCCGTACAA CTGACGTTAC 700 

AATGGTAATG CCTGGAGATA 750 

CAA 783 



10 2) INFORMATION FOR SEQ ID NO:" 1680 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: ' Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 



25 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1680 



GCTATCTTAG TTGTTGCTGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

ACACATTTTG CTTGCTCGTC AAGTTGGTGT TAACTACATC GTAGTATTCT 100 

TGAACAAGTG CGATTTAGTT GACGACCCAG AATTGATCGA CTTGGTTGAA 150 

ATGGAAGTTC GTGACTTGTT GACTGAATAC GATTACCCTG GTGATGATAT 200 

30 TCCAGTTGTT CGTGGTTCAG CATTAAAGGC TTTACAAGGT GACAAGGAAG 250 

CTCAAGACCA AATCATGAAG TTGATGGACA TTGTTGATGA ATACATCCCA 300 

ACTCCAGAAC GTCAAACTGA CAAGCCATTC TTGATGCCAG TTGAAGACGT 350 

ATTCACTATC ACTGGTCGTG GTACTGTTGC TTCAGGTCGT ATCGACCGTG 400 

GTACTGTTAA GGTCGGTGAC GAAGTTGAAA TCGTTGGTTT GGTAGATAAA 450 

35 GTTCTTAAGT CAGTTGTTAC TGGTTTGGAA ATGTTCCACA AGACTTTGGA 500 

CTTAGGTGAA GCCGGCGATA ACGTTGGTGT ATTGCTTCGT GGTGTTGACC 550 

GTGATCAAGT TGTTCGTGGT CAAGTATTGG CTGCACCCGG CTCAATCCAA 600 

ACTCATAAGA AGTTTAAGGC ACAAGTTTAT GTTTTGAAGA AGGACGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCTCAGACTA CCGTCCACAA TTCTACTTCC 700 

40 ACACCACTGA TATTACTGGT GAAATTGAAT TGCCAGAAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACAC TGAATTCACT GTTACTTTGA TCAAGCCAGC 800 

TGCCATCGAA AAGGGTACTA AGT 823 



2) INFORMATION FOR SEQ ID NO: 1681 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 793 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



871 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leuconostoc mesenteroides subsp. 
5 dextranicum 

(B) STRAIN: ATCC 19255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1681 

10- GCTGCAACTG ATGGTCCTAT GCCACAAACA CGTGAAGACA -TCTTGTTGGC 50 

ACGTCAAGTT GGTGTTGACT ACTTGGTTGT CTTCTTGAAC AAGACTGATT 100 

TGGTTGATGA TGAAGAATTA GTTGAATTGG TTGAAATGGA AGTTCGTGAA 150 

TTGTTGTCAG AATATGACTT CCCAGGTGAC GATATTCCTG TACTTAAGGG 2 00 

TTCAGCTTTG AAGGCTTTGG AAGGTGATCC TGAACAAGTT AAGGTTATCG 250 

15 AAGAATTGAT GGATACTGTT GATTCATACA TTCCAGAACC AGCACGTGAA 3 00 

ACAGACAAGC CATTCTTGAT GCCTGTCGAA GACGTCTTCA CAATCACTGG 3 50 

TCGTGGTACA GTTGCTTCTG GTCGTGTTGA CCGTGGTGTA TTGACTACAG 4 00 

GAACTGAAAT TGAAATCGTT GGTTTGAAGG ACGAAGTTCA AAAGACTACT 4 50 

GTTACAGGTA TCGAAATGTT CCGTAAGACT TTGGAAGAAG CTCAAGCGGG 500 

20 TGATAACATT GGTGCATTGT TGCGTGGTGT TGATCGTAGC AACATTGAAC 550 

GTGGTCAAGT TTTGGCAAAG CCAGGTTCAA TTAAGACACA CAAGAAGTTC 600 

AAGGCTGAAG TTTATGTCTT GACAAAGGAA GAAGGTGGTC GTCATACACC 650 

ATTCTTCACT AACTACCGTC CACAATTCTA CTTCCACACA ACTGATGTTA 700 

CAGGTGTTGT TGAATTGCCA GCCGGTGTTG AAATGGTTAT GCCTGGTGAC 750 

25 CAAGTGACAT TCGAAATCGA ATTGATCTCA CCAGTTGCCA TCG 793 



2) INFORMATION FOR SEQ ID NO: 1682 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1682 

45 TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 200 

TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

50 GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 3 00 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 400 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 4 50 

872 
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GTCTGTTGTA ACAGGCGTTG 
AAGCAGGTGA TAACGTAGGA 
GTAAAGCGTG GTATGGTTGT 
TCACTTCAAG GCTTCCATCT 
ATACTCCATT CGGAAACAAG 
GACTGTACAG GTGAGATCAC 
TGGTGACAAC GTAGAGATTG 



AAATGTTCCG TAAGATTCTT 
CTTTTGCTCC GCGGTATCGA 
TGTACACCCA GGTGCCATCA 
ATGTATTGAA GAAGGAAGAG 
TATCGTCCTC AGTTCTATCT 
TTTGCCAGAA GGCGTAGAGA 
AGGTTACCTT GATTTACAAG 



TCTGAAGGTG 
TAAGGATGAA 
CTCCTCACGA 
GGTGGACGTC 
CCGTACAATG 
TGGTGATGCC 
GTTGCC 
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2) INFORMATION FOR SEQ ID NO: 1683 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ruminococcus bromii 

(B) STRAIN: ATCC 27255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1683 

GGTTGCTGCT ACTGACGGCC CGATGCCTCA GACTCGTGAG CACGTTCTGC 50 

TCGCTCGTCA GGTGGGTGTG CCCGCCATCG TCGTCGCCCT CAACAAGTGC 100 

GACATGGTTG ACGATGAGGA GCTCATTGAG CTTGTCGAGA TGGAGGTCCG 150 

CGAGCTGCTG ACCTCGCAGG AGTTCGACGG CGACAACTGC CCTGTCGTTC 200 

GCATCTCCGC CTTCCAGGCC CTCCAGGGTG ACGAGAAGTG GACCCAGTCG 250 

ATCCTCGACC TCATGGACGC CGTGGACGAG TACATCCCGC AGCCTGAGCG 300 

CGATCTCGAC AAGCCCTTCC TTATGCCGAT CGAGGACGTC TTCACCATCA 350 

CCGGCCGTGG CACCGTTGTC ACCGGTCGTG TCGAGCGTGG TGTCGTCAAG 400 

ACTGGCGAAG AGGTCGAGAT CGTCGGTATC CACGAGAAGA CCCAGAAGAC 450 

CACCGTTACC GGTGTCGAGA TGTTCCGTAA GATCCTCGAC GAGGGCCGCG 500 

CTGGTGAGAA CGTCGGCGTT CTGCTCCGTG GCACCAAGAA GGAGGACGTC 550 

GTTCGCGGCA TGGTCCTCTC CAAGCCTGGT TCCACCACCC CCCACACCGA 600 

CTTCGAGGGC CAGGTCTACG TCCTCAAGAA GGATGAGGGT GGCCGCCACA 650 

AGCCGTTCTT CTCCCATTAC AGCCCCCAGT TCTACTTCCG TACCACGGAC 700 

GTGACTGGCA CTGTTGAGCT CCCCGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACACC GACATGACTG TGCACCTGAT TCACCCGGTT GCCATGGAGG 800 



2) INFORMATION FOR SEQ ID NO: 1684 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
5 (B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1684 

TGTCTTTATT CAGGAACTGA TTGTATGTTT CTTCTCGTTT ATATATAACA 50 
10 TACCTTCTAT ATTTCATGTG TTTCTAACGA -ACTCATAGAA CAACATTGCC . _ 100 

AAGGCTCACG GTGGTTACTC CGTGTTCACC GGTGTCGGTG AGCGTACCCG 150 

TGAAGGAAAC GATCTGTACC ACGAAATGCA GGAAACCCGC GTCATCCAGC 200 

TGGACGGCGA GTCCAAAGTC GCCCTCGTCT TCGGCCAGAT GAACGAGCCC 250 

CCCGGAGCCC GTGCCCGTGT TGCCCTGACC GGTCTGACCA TCGCTGAATA 300 

15 CTTCCGTGAC GAAGAAGGCC AAGATGGTAC GTTCCCCCAT TCCATATATG 350 

TTTCTTGTGC GCTTTGCCAA CTAAACACCA CCTAGTGCTC CTCTTCATCG 4 00 

ACAATATCTT CCGCTTCACC CAAGCCGGTT CCGAAGTGTC CGCCCTGCTA 4 50 

GGCCGCATCC CCTCCGCCGT CGGCTATCAA CCCACCCTCG CCGTCGACAT 500 

GGGTGGTATG CAGGAGCGCA TCACAACTAC AACAAAAGGC TCCAT 545 

20 



2) INFORMATION FOR SEQ ID NO: 1685 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 
35 (B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1685 

GATATCGCTT TATGGAAATT CGAAACTCCA AAATTCCACG TTACYGTTAT 50 

40 CGATGCTCCA GGTCACAGAG ATTTCATCAA GAATATGATT ACYGGTACCT 100 

CCCAAGCTGA TTGTGCTATT TTAATCATTG CTGGTGGTGT TGGTGAATTC 150 

GAAGCTGGTA TCTCCAAAGA TGGTCAAACC AGAGAACACG CTTTGTTAGC 200 

TTTCACCTTA GGTGTYAAAC AATTGATTGT TGCCGTTAAC AAAATGGACT 250 

CTGTCAAATG GGATCAATCC CGTTTCGAAG AAATCGTCAA GGAAGCTTCC 300 

45 GGTTTCATCA AGAAAGTTGG TTACAACCCA AAGACTGTTC CATTCGTTCC 350 

AATCTCTGGT TGGAATGGTG ACAACATGAT TGAAGTYTCW GCTAACGCYC 400 

CATGGTACAA AGGTTGGGAA AAGGAAACCA AAGCYGGTGT CGTTAAAGGT 450 

AAAACTTTAT TAGAAGCCAT TGATGCTATT GAACCACCTT CAAGACCAAC 500 

TGAAAAACCA TTGAGATTGC CATTGCAAGA TGTCTACAAG ATTGGTGGTA 550 

50 TCGGAACCGT ACCAGTCGGT ARAGTTGAAA CCGGTGTCAT TAAACCAGGT 600 

ATGATTGTTA CTTTCGCCCC AGCCGGTGTT ACTACTGAAG TCAAATCTGT 650 

TGAAATGCAT CACGAACAAT TAGAAGCTGG TTACCCAGGT GACAATGTTG 700 

GTTTCAACGT CAAGAATGTT TCAGTTAAAG AAATCAGAAG AGGTAAHGTT 750 

874 
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GCTGGTGACT CCAAGAACGA TCCACCAAAA 

TCAAGTTATT GTCTTGAACC ATCCAGGTCA 

CAGTTTTGGA TTGTCACACT GCCCACATTG 

TTGGAAAAGA TTGACAGAAG ATCCGGTAAG 

5 ATTCATCAAA TCTGGTGACG CTGCTAWTGT 

CATTRTGTGT TGAAGCTTTC 



PCT/CA00/01150 

GGTGCTGAAT CTTTCAACGC 800 
AATCTYTGCT GGTTACTYTC 850 
CTTGTAAATT CGATGAAATC 900 
AAATTGGAAG AAAATCCAAA 950 
CAAATTTGTT CCATWTAAAC 1000 

1020 



"10 2) INFORMATION FOR SEQ ID NO: 1686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 929 bases 

(B) TYPE:' Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



25 



30 



35 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1686 



TGCTGCTTCC GATGGTCAAA TGTACGATTG ATATTCCTTC 
GGATAACAGC TGATACCAGT TGCAAATAGG CCCCAGACTC 
GTTGCTTGCC CGTCAGGTTG GTGTCCAGAA GATCGTTGTC 
AGGTTGACGC TGTCGATGAC CCTGAGATGT TGGAGCTTGT 
ATGCGTGAGC TCCTCAACAC TTACGGTTTC GAGGGAGAGG 
CATCTTCGGT TCCGCCCTGT GCGCTCTCGA AGGCCGCCGC 
GTACTCAGCG TATTGACTCC CTCCTCGAGG CCGTTGACAC 
ACCCCCCAGC GTGACTTGGA CAAGCCCTTC CTGATGTCCA 
TTTCTCCATT GGTGGTCGTG GTACCGTCGC CTCTGGTCGT 
GTCTCCTCAA GAAGGATACC GAAGTTGAAA TTCACGGTGC 
CTGAAGACCA AGGTCACCGA CATTGAGACC TTCAAGAAGA 
GTCTCGTGCT GGTGACAACT CCGGTCTTCT CCTCCGTGGT 
AGGATGTTCG TCGTGGTATG GTCATCGCTG CCCCTGGCTC 
TCCAAGAAGT TCATGGTCTC CATGTACGTC TTGACTGAGG 
CCGCAAGAAC GGCTTCGGTG CCAACTACCG CCCCCAGGCT 
CTGCTGGTAA GTTTCGAACT ATTTGATTCA TTGATCACGT 
TACTTTAGAC GAGGCTTGCG ACCTTCATTT CCCTGATGAG 
ACCGCCACGT CATGCCCGGT GACAACGTCG AAATGGTCCT 
AACCCCGTTG CTGCTGAGGC TGGACAGCG 



CAGCCAGTCA 50 

GTGAGCACTT 100 

TTCGTCAACA 150 

TGAGCTCGAG 200 

AGACCCCTAT 250 

GAGGACATTG 300 

TTGGATCCCT 350 

TTGAGGAAGT 400 

GTCGAGCGTG 450 

TGATGGTATT 500 

GCTGCGATGA 550 

ATCCGTCGTG 600 

CATCAAGGCC 650 

CTGAAGGTGG 700 

TTCATCCGCA 750 

CCCTAACTGT 800 

GCCGACAAGG 850 

CAACCTCAAC 900 
929 



45 



2) INFORMATION FOR SEQ ID NO: 1687 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

875 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 

(B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1687 



TGCCGCTTCC GATGGTCAGA TGTACGCTCA AGCCCCAGTT TCCATATAAA 50 

CATAAACGAT CTATCATCAG CACAACGCTG ACTTCTTCGC TTCCAGGCCC 100 

CAGACCCGTG AGCACTTGCT GTTGGCCCGT CAGGTCGGTG TCCAGAAGAT 150 

CGTGGTCTTC GTCAACAAGG TCGATGCCGT TGATGACCCG GAGATGTTGG 200 

15 AGCTCGTTGA GCTGGAAATG CGCGAGCTCC TGACCAGCTA CGGATTCGAG 250 

GGTGAAGAGA CCCCCATCAT CTTCGGTTCT GCTCTCTGCG CTCTTGAGGG 300 

CCGCCGTCCT GAGATTGGTA CTGAGAAGAT TGACGAGCTG ATGCACGCCG 350 

TCGACACCTG GATCCCCACC CCCCAGCGTG ACCTCGACAA GCCCTTCCTG 400 

ATGTCCGTCG AGGAAGTCTT CTCCATTGCT GGTCGTGGTA CCGTCGCTTC 450 

20 CGGCCGTGTC GAGCGTGGTA TTCTGAAGAA GGATAGCGAA GTCGAGATCA 500 

TCGGTGGTGC TTTCGACGCC ACGAAGACCA AGGTCACTGA CATCGAGACC 550 

TTCAAGAAGT CTTGCGACGA GTCTCGCGCT GGTGACAACT CTGGTCTCCT 600 

CCTCCGTGGT ATCCGTCGTG AGGATGTTCG GCGTGGTATG GTCATTGCTG 650 

CTCCTGGCAG CACCAAGGCC CACGACAAGT TCCTTGTCTC TATGTACGTC 700 

25 CTCACTGAGG CTGAGGGTGG CCGTCGTACC GGATTCGGTA CCAACTACCG 750 

CCCCCAGGTC TTCATCCGTA CTGCCGGTAA GTGTTCCTGG AAGAGGCTTT 800 

GAGCCTATAT AGGATCTCGG ATAATTTACT AATCCACCAT ATAGATGAGG 850 

CCGCTGACCT CAGCTTCCCC GACAACGATG ACTCCCGCCG TGTCATGCCC 900 

GGTGACAACG TTGAGATGGT CCTGAAGACC CACCGCCCCG TGGCTGCTGA 950 

30 G 951 



2) INFORMATION FOR SEQ ID NO: 1688 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1688 

50 CGTTGCCGCT ACCGATGGTC AAATGCCTCA AACTAGAGAA CATTTGCTAT 50 

TGGCTAGACA GGTTGGTGTT CAACACATTG TCGTGTTTGT TAACAAGGTT 100 

GACACTATTG ATGATCCAGA AATGTTGGAA TTGGTTGAAA TGGAAATGAG 150 

AGAGTTGATT GCCACTTATG GTTTCGATGG TGATAACACC CCAGTTATCA 200 

876 
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TGGGTTCTGC TCTATGTGCT TTGGAAGGTC GTGAACCTGA AATCGGTGCT 250 

CAATCAATCG ACAGATTGTT GGAAGCCGTT GATGAATACA TTCCAACTCC 300 

AACTAGAGAT TTGGAAAAAC CATTCTTGAT GGGTGTTGAA GATGTCTTCT 350 

CCATTTCTGG TAGAGGTACC GTCTGTACCG GTCGTGTTGA AAGAGGTAAC 400 

5 TTGAAGAAAG GTGATGAAAT CGAAATTGTC GGCTACAACA AGACTCCAAT 4 50 

CAAAACCACC GTCACCGGTA TTGAGATGTT CAAAAAGGAA TTAGACCAAG 500 

CTATGGCTGG TGATAACTGT GGTATCTTAT TACGTGGTGT TAAGAGAGAT 550 

GATATCAAGA GAGGTATGGT TATCTCTAAA GTCAACACCG TTTCCGCACA 600 

CACCAAATTC TTGGCCTCTT TATACGTCTT GACTAAAGAA GAAGGTGGTC 650 

10 GTCATTCAGG TTTTGCTGAA AACTACAGAC CTCAATTGTT CATCAGAACC - 700 

GGTGATGTCA CTGTTACTTT AACCTTCCCA GAAGATGCTG ATCACTCTCA 750 

GCAAGTCTTA CCAGGTGACA ACGTTGAAAT GGAATGTACC TTGGTTCATC 800 

CAACTGCTCT TGAAACCGGT CAA , 823 

15 

2) INFORMATION FOR SEQ ID NO: 1689 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 201076 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1689 



GCTGCTACTG ACGGTCAAAT GCCTCAAACT AGGGAACATA TGTTGTTGGC 50 

GAGACAAGTT GGTATCCAAA ACTTGGTTGT TTTTGTTAAC AAAGTTGATA 100 

35 CCATTGATGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGGGAA 150 

TTATTGAGCT CTTATGGGTT TGATGGTGAA AACACTCCAG TTATCATGGG 200 

ATCAGCCTTG TGTGCTTTAG AAGGTAAACA ACCAGAAATC GGTGTTCAAG 250 

CCATTCAAAA ATTATTGGAT GCTGTTGATG AATATATTCC AACTCCAGAA 300 

AGAGATGCTG ACCAACCATT TTTGATGCCA GTGGAAGATG TGTTTTCTAT 350 

4 0 TTCAGGTAGA GGAACCGTTG TCACCGGAAG AGTTGAAAGA GGTATGTTGA 400 

AGAAAGGTGA AGAAGTAKAA GTCATTGGTG AAAACTCATT TAAGGCTACT 4 50 

TCCACGGGTA TTGAGATGTT CAAAAAGGAA TTGGATGCCG CTATGGCCGG 500 

TGACAACTGT GGTATTTTGT TGAGAGGTGT CAAGAGAGAC GAAGTCAAGA 550 

GGGGTATGGT TTTGGCCAAA CCAGGTACCA CCACCCCACA CCAAAAGTTT 600 

45 TTGGCTTCCA TTTATATCTT GACTGCTGAA GAAGGTGGAC GTAGTACCCC 650 

TTTCAGTGAA GGATACAAAC CACAATGTTT CTTTAGAACT AGTGATGTTA 700 

CCACGACATT TACTTTCCCA GAAGGTGAAG GTGTTGACCA CTCACAAATG 750 

GTTATGCCAG GAGRCAATGT TGAAATGGTG GGAACTTTAA TCAAGAAAGC 800 

TCC 803 

50 



2) INFORMATION FOR SEQ ID NO: 1690 

877 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1690 " 
CAGGTCCTGT TGCGACTGAA GAA 



2) INFORMATION FOR SEQ ID NO: 1691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1691 
CACAGATAAA CCTGAGTGTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1692 
GGTGAGAACT GTGGTATCTT ACTT 



2) INFORMATION FOR SEQ ID NO: 1693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

878 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1693 
CATTTCAACG CCTTCTTTCA ACTG 

2) INFORMATION FOR SEQ ID NO: 1694 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1694 
AAGGCAAGGA TGACAACGGC 



2) INFORMATION FOR SEQ ID NO: 1695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1695 
ACGATTTCCA CTTCTTCCTG G 



2) INFORMATION FOR SEQ ID NO: 1696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1696 

879 
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ATGTTCCTGT AGTTGCTGGA 



2) INFORMATION FOR SEQ ID NO: 1697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ~ 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1697 
TTTCTTCAGC AATACCAACA AC 



2) INFORMATION FOR SEQ ID NO: 1698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1698 
GGAATCAACA GATGGTTTAC AAA 



2) INFORMATION FOR SEQ ID NO: 1699 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1699 
GCATCTTCTG GGAAAGGTGT 



880 
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2) INFORMATION FOR SEQ ID NO: 1700 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1700 
AAGATGCGGA AAGAAGCGAA , 20 

15 

2) INFORMATION FOR SEQ ID NO: 1701 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1701 
3 0 ATTATGGATC AGTTCTTGGA TCA 23 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1702 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: Challis V288 

(C) ACCESSION NUMBER: L20574 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1702 

TTCATAGACG CTGAGCACGC TTTGGATCCA TCTTACGCGG CTGCTCTAGG 50 
TGTAAATATT GATGAGCTGT TGCTATCTCA ACCAGATTCT GGTGAGCAAG 100 
GTTTAGAAAT TGCAGGAAAA TTGATTGACT CTGGGGCAGT TGATTTAGTT 150 

881 
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GTCATCGACT CTGTTGCAGC TCTTGTACCA CGTGCGGAAA 
TATCGGTGAT AGC 



TCGATGGAGA 



200 
213 



-10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1703 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 692 bases 
-(B) TYPE: Nucleic acid . . . . . 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: strain GS-5 

(C) ACCESSION NUMBER: M61897 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1703 



GGGCCGGAAT 


CTTCTGGTAA 


GACAACTGTC 


GCTCTTCATG 


CTGCTGCTCA 


50 


GGCGCAAAAA 


GATGGCGGTA 


TTGCCGCTTT 


CATTGATGCA 


GAACATGCCC 


100 


TTGATCCAGC 


CTATGCTGCT 


GCTCTTGGCG 


TTAATATTGA 


TGAGCTTTTG 


150 


CTTTCACAAC 


CAGATTCAGG 


AGAACAGGGT 


CTTGAAATTG 


CAGGGAAATT 


200 


GATTGATTCT 


GGCGCTGTTG 


ATTTAGTTGT 


TGTTGACTCA 


GTGGCAGCTT 


250 


TAGTACCACG 


TGCGGAGATT 


GACGGAGATA 


TTGGTAATAG 


TCATGTTGGC 


300 


TTACAAGCAC 


GCATGATGAG 


TCAAGCGATG 


CGTAAATTAT 


CAGCTTCAAT 


350 


CAATAAAACA 


AAAACCATTG 


CTATTTTTAT 


TAATCAATTG 


CGGGAAAAAG 


400 


TTGGTATTAT 


GTTTGGTAAT 


CCAGAAACAA 


CCCCTGGCGG 


GCGTGCCTTG 


450 


AAGTTTTATT 


CTTCTGTGCG 


TCTTGATGTC 


CGCGGCAATA 


CTCAAATTAA 


500 


AGGAACCGGG 


GAACAAAAAG 


ACAGCAATAT 


TGGTAAAGAG 


ACCAAAATTA 


550 


AAGTTGTTAA 


AAATAAAGTT 


GCTCCACCAT 


TTAAGGAAGC 


TTTTGTAGAA 


600 


ATTATATATG 


GTGAAGGCAT 


TTCTCGTACA 


GGTGAATTAG 


TTAAGATTGC 


650 


CAGTGATTTG 


GGAATTATCC 


AAAAAGCTGG 


AGCTTGGTAC 


TC 


692 



2) INFORMATION FOR SEQ ID NO: 1704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
(C) ACCESSION NUMBER: Z17307 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1704 





ATGGCGAAAA 


AACCAAAAAA 


ATTAGAAGAA 


ATTTCAAAAA 


AATTTGGGGC 


50 




AGAACGTGAA 


AAGGCCTTGA 


ATGACGCTCT 


TAAATTGATT 


GAGAAAGACT 


100 


5 


TTGGTAAAGG 


ATCAATCATG 


CGTTTGGGTG 


AACGTGCGGA 


GCAAAAGGTG 


150 




CAAGTGATGA 


GCTCAGGTTC 


TTTAGCTCTT 


GACATTGCCC 


TTGGCTCAGG 


200 




TGGTTATCCT 


AAGGGACGTA 


TCATCGAAAT 


CTATGGCCCA 


GAGTCATCTG 


250 




GTAAGACAAC 


GGTTGCCCTT 


CATGCAGTTG 


CACAAGCGCA 


AAAAGAAGGT 


300 




GGGATTGCTG 


CCTTTATCGA 


TGCGGAACAT 


GCCCTTGATC 


CAGCTTATGC 


350 


10 


■ TGCGGCCCTT - 


GGTGTCAA-TA 


- TTGACGAAT-T 


GCTCTTGTCT 


CAACCAGAGT 


400 




CAGGAGAGCA 


AGGTCTTGAG 


ATTGCGGGAA 


AATTGATTGA 


CTCAGGTGCA 


450 




GTTGATCTTG 


TCGTAGTCGA 


CTCAGTTGCT 


GCCCTTGTTC 


CTCGTGCGGA 


500 




AATTGATGGA 


GATATCGGAG 


ATAGCCATGT 


TGGTTTGCAG 


GCTCGTATGA 


550 




TGAGCCAGGC 


CATGCGTAAA 


CTTGGCGCCT 


CTATCAATAA 


AACCAAAACA 


600 


15 


ATTGCCATTT 


TTATCAACCA 


ATTGCGTGAA 


AAAGTTGGAG 


TGATGTTTGG 


650 




AAATCCAGAA 


ACAACACCGG 


GCGGACGTGC 


TTTGAAATTC 


TATGCTTCAG 


700 




TCCGCTTGGA 


TGTTCGTGGT 


AATACACAAA 


TTAAGGGAAC 


TGGTGATCAA 


750 




AAAGAAACCA 


ATGTCGGTAA 


AGAAACTAAG 


ATTAAGGTTG 


TAAAAAATAA 


800 




GGTAGCTCCA 


CCGTTTAAGG 


AAGCCGTAGT 


TGAAATTATG 


TACGGAGAAG 


850 


20 


GAATTTCTAA 


GACTGGTGAG 


CTTTTGAAGA 


TTGCAAGCGA 


TTTGGATATT 


900 




ATCAAAAAAG 


CAGGGGCTTG 


GTATTCTTAC 


AAAGATGAAA 


AAATTGGGCA 


950 




AGGTTCTGAG 


AATGCTAAGA 


AATACTTGGC 


AGAGCACCCA 


GAAATCTTTG 


1000 




ATGAAATTGA 


TAAGCAAGTC 


CGTTCTAAAT 


TTGGCTTGAT 


TGATGGAGAA 


1050 




GAAGTTTCAG 


AACAAGATAC 


TGAAAACAAA 


AAAGATGAGC 


CAAAGAAAGA 


1100 


25 


AGAAGCAGTG 


AATGAAGAAG 


TTCCGCTTGA 


CTTAGGCGAT 


GAACTTGAAA 


1150 




TCGAAATTGA 


AGAATAAGCT 


GTTAAAGCAG 


TGGAGAAATC 


CGCTACTTTT 


1200 




TCGA 










1204 



30. 

2) INFORMATION FOR SEQ ID NO: 1705 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 981 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: NZ131 

(C) ACCESSION NUMBER: U21934 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1705 



ATGCGTTCAG GAAGTCTAGC TCTTGATATT 

TCCTAAAGGA CGTATCATCG AAATCTATGG 

50 CGACTGTGGC TTTACATGCT GTAGCACAAG 

GCAGCCTTTA TCGATGCCGA GCATGCGCTT 

GCTTGGGGTT AATATTGATG AACTTCTCTT 

AACAAGGACT TGAAATTGCA GGTAAATTGA 

883 



GCTTGGATAG 


CTGGTGGTTA 


50 


TCCAGAGTCT 


TCCGGTAAAA 


100 


CTCAAAAAGA 


AGGTGGAATC 


150 


GATCCAGCTT 


ATGCTGCTGC 


200 


GTCTCAACCA 


GATTCTGGAG 


250 


TTGATTCTGG 


TGCGGTTGAC 


300 
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CTGGTTGTTG 
TGGTGATATT 
AGGCCATGCG 
ATCTTTATCA 
5 TGAAACAACA 
TGGATGTGCG 
GCCAGCATTG 
TCCGCCATTT 
CTCGTACAGG 
"lO" " AAAGCAGGTG 
TGAAAATGCT 
TCGACCGTAA 
GAATCTGCTA 
TTTAGATAAT 

15 



TCGATTCAGT 
GGCGATAGCC 
TAAATTATCA 
ACCAATTGCG 
CCAGGTGGTC 
TGGAAACAAC 
GTAAGGAGAC 
AAGGTAGCAG 
GGAGCTTGTG 
CTTGGTTCTC 
AAGCGTTATT 
AGTACGTGTT 
TGGCAGTAGC 
GGTATTGAAA 



AGCAGCTTTA 
ATGTCGGATT 
GCTTCTATTA 
TGAAAAAGTT 
GAGCTTTGAA 
CAAATTAAAG 
CAAAATCAAG 
AAGTTGAAAT 
AAAATTGCTT 
"TTATAATGGT 
TGGCCGATCA 
AAATTTGGTT 
ATCAGAAGAA 
TTGAAGATTA 



GTGCCACGTG 
GCAAGCACGT 
ATAAAACAAA 
GGTGTGATGT 
ATTCTATGCT 
GAACTGGTGA 
GTTGTTAAAA 
CATGTATGGG 
CTGATTTGGA 
GAGAAGATTG 
TCCACAATTG 
TGCTTGAAGA 
ACCGATGATC 
A 



CTGAAATTGA 350 

ATGATGAGTC 400 

AACTATCGCA 450 

TTGGAAATCC 500 

TCTGTTCGGC 550 

CCAAAAGATA 600 

ACAAGGTCGC 650 

GAAGGTATTT 700 

CATTATCCAA 750 

GCCAAGGTTC 800 

TTTGATGAAA 850 

AAGCGAAGAA 900 

TTGCTTTAGA 950 
981 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
'(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE 
(vi) 



Genomic DNA 



ORIGINAL SOURCE: 
(A) ORGANISM: Streptococcus salivarius subsp. 
thermophilus 

(C) ACCESSION NUMBER: M94062 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1706 



35 


GCGTATGCAC 


GAGCTCTAGG 


TGTTAATATC 


GATGAGCTTC 


TTTTGTCGCA 


50 




GCCTGATTCT 


GGTGAGCAAG 


GTCTCGAAAT 


TGCAGGTAAG 


CTGATTGACT 


100 




CTGGTGCAGT 


GGATTTAGTT 


GTTGTTGACT 


CAGTTGCGGC 


CTTCGTACCA 


150 




CGTGCAGAAA 


TTGATGGAGA 


TAGTGGTGAC 


AGTCATGTAG 


GACTTCAAGC 


200 




GCGTATGATG 


AGTCAAGCCA 


TGCGTAAACT 


TTCTGCATCT 


ATTAATAAAA 


250 


40 


CAAAAACGAT 


TGCTATCTTT 


ATTAACCAGT 


TGCGTGAAAA 


AGTTGGTATC 


300 




ATGTTTGGTA 


AC 








312 



45 2) INFORMATION FOR SEQ ID NO: 1707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

884 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J02967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1707 



ATGAAAAACA 


CAATACATAT 


• CAACTTCGCT 


ATTTTTTTAA 


TAATTGCAAA 


50 


TATTATCTAC 


AGCAGCGCCA 


GTGCATCAAC 


AGATATCTCT 


ACTGTTGCAT 


100 


XTCCATTATT 


TGAAGGAACT 


GAAGGTTGTT 


TTTTACTTTA 


CGATGCATCC 


150 


ACAAACGCTG 


AAATTGCTCA 


ATTCAATAAA 


GCAAAGTGTG 


CAACGCAAAT 


200 


GGCACCAGAT 


TCAACTTTCA 


AGATCGCATT 


ATCACTTATG 


GCATTTGATG 


250 


CGGAAATAAT 


AGATCAGAAA 


ACCATATTCA 


AATGGGATAA 


AACCCCCAAA 


300 


GGAATGGAGA 


TCTGGAACAG 


CAATCATACA 


CCAAAGACGT 


GGATGCAATT 


350 


TTCTGTTGTT 


TGGGTTTCGC 


AAGAAATAAC 


CCAAAAAATT 


AGATTAAATA 


400 


AAATCAAGAA 


TTATCTCAAA 


GATTTTGATT 


ATGGAAATCA 


AGACTTCTCT 


450 


GGAGATAAAG 


AAAGAAACAA 


CGGATTAACA 


GAAGCATGGC 


TCGAAAGTAG 


500 


CTTAAAAATT 


TCACCAGAAG 


AACAAATTCA 


ATTCCTGCGT 


AAAATTATTA 


550 


ATCACAATCT 


CCCAGTTAAA 


AACTCAGCCA 


TAGAAAACAC 


CATAGAGAAC 


600 


ATGTATCTAC 


AAGATCTGGA 


TAATAGTACA 


AAACTGTATG 


GGAAAACTGG 


650 


TGCAGGATTC 


ACAGCAAATA 


GAACCTTACA 


AAACGGATGG 


TTTGAAGGGT 


700 


TTATTATAAG 


CAAATCAGGA 


CATAAATATG 


TTTTTGTGTC 


CGCACTTACA 


750 


GGAAACTTGG 


GGTCGAATTT 


AACATCAAGC 


ATAAAAGCCA 


AGAAAAATGC 


800 


GATCACCATT 


CTAAACACAC 


TAAATTTATA 


A 




831 



25 



2) INFORMATION FOR SEQ ID NO: 1708 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
40 (B) STRAIN: HH22 

(C) ACCESSION NUMBER: M60253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1708 



45 TTGAAAAAGT TAATATTTTT AATTGTAATT 

TAATTCAAAC AGTTCACATG CCAAAGAGTT 

ATAATGCTCA TATTGGTGTT TATGCTTTAG 

GTAAAATTTA ATTCAGATAA GAGATTTGCC 

GATAAATAGT GCTATTTTGT TAGAACAAGT 

50 AAAAAGTACA TATTAACAAA GATGATATAG 

GAAAAATATG TAGGAAAAGA TATCACTTTA 

AATGACATAT AGTGATAATA CAGCAAACAA 

GTGGAATCAA AAAAGTTAAA CAACGTCTAA 

885 



GCTTTAGTTT 


TAAGTGCATG 


50 


AAATGATTTA 


GAAAAAAAAT 


100 


ATACTAAAAG 


TGGTAAGGAA 


150 


TATGCTTCAA 


CTTCAAAAGC 


200 


ACCTTATAAT 


AAGTTAAATA 


250 


TTGCTTATTC 


TCCTATTTTA 


300 


AAAGCACTTA 


TTGAGGCTTC 


350 


TAAAATTATA 


AAAGAAATCG 


400 


AAGAACTAGG 


AGATAAAGTA 


450 
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ACAAATCCAG 
CAAAAAAGAT 
TTATCGCAAA 
TTAATGTTAA 
5 AAAAGACTAT 
CTAGAAATGA 
GTTTTAGTCA 
TAAGTTGATA 



TTAGATATGA 
ACTTCAACAC 
TGGAAAATTA 
ATAATAAAAG 
AAGGTTGCTG 
TGTTGCTTTT 
TTTTTACGAA 
AGTGAAACCG 



GATAGAATTA 
CTGCTGCTTT 
AGCAAAGAAA 
CGGAGATACT 
ATAAAAGTGG 
GTTTATCCTA 
TAAAGACAAT 
CCAAGAGTGT 



AATTACTATT 
CGGTAAGACT 
ACAAAAAATT 
TTAATTAAAG 
TCAAGCAATA 
AGGGCCAATC 
AAAAGTGATA 
AATGAAGGAA 



CACCAAAGAG 500 

TTAAATAAAC 550 

CTTACTTGAT 600 

ACGGTGTTCC 650 

ACATATGCTT 700 

TGAACCTATT 750 

AGCCAAATGA 800 

TTTTAA 846 



10 



2) INFORMATION FOR SEQ ID NO: , 1709 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 555 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: M29695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1709 



ATGTCCGCGA 
CCTGCCGATG 

30 GGGGTGGCGA 
CTGCCCAGAG 
GGGCGAGGAA 
GTGATGGCTG 
CAGTCTCTGG 

35 TGTCCGCGCT 
TTCAGACCGA 
AAGGCAGGAT 
GGTTTACATG 
CCTAA 

40 



GCACCCCCCC 
CTCCATGACT 
CGAAGAGCGA 
CGATGGCGGA 
CCGATCGGCT 
GTGGGAAGAT 
CTGACCCGAC 
CTCGTTGAAC 
CCCGACTCCG 
TCGTGCGGGA 
GTTCAAACAC 



CATAACTCTT 
GGCTCAACCG 
CCGACTCTTG 
AGAGTCCGTA 
ATGCTCAGTC 
GAAACTGATC 
ACAGTTGAAC 
TACTGTTCTC 
AACAACCATC 
GAAGATCATC 
GACAAGCCTT 



CGCCTCATGA 
GCCGCACATC 
ATGAAGTGCT 
ACACCGTACA 
GTACGTCGCG 
CAGGAGTGCG 
AAAGGCCTAG 
GGACCCCACC 
GAGCCATACG 
ACCACGCCTG 
CGAGAGAAAG 



CCGAGCGCGA 50 

GTTGAGTGGT 100 

GGAACACTAC 150 

TCGCAATGCT 200 

CTCGGAAGCG 250 

AGGAATAGAC 300 

GAACAAGGCT 350 

GTGACGAAGA 400 

CTGCTATGAG 450 

ACGGGCCGGC 500 

CGCGGTGTTG 550 
555 



2) INFORMATION FOR SEQ ID NO: 1710 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

886 



SUBSTITUTE SHEET (RULE 26) 
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(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: K02987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1710 

5 





ATGAACCAGA 


AAAACCCTAA 


AGACACGCAA 


AATTTTATTA 


CTTCTAAAAA 


50 




GCATGTAAAA 


GAAATATTGA 


ATCACACGAA 


TATCAGTAAA 


CAAGACAACG 


100 




TAATAGAAAT 


CGGATCAGGA 


AAAGGACATT 


TTACCAAAGA 


GCTAGTCAAA 


150 




ATGAGTCGAT 


CAGTTACTGC 


TATAGAAATT 


GATGGAGGCT 


TATGTCAAGT 


200 


10 


GACTAAAGAA 


GCGGTAAACC 


CCTCTGAGAA 


- TATAAAAGTG 


ATTCAAACGG 


250 




ATATTCTAAA 


ATTTTCCTTC 


CCAAAACATA 


TAAACTATAA 


GATATATGGT 


300 




AATATTCCTT 


ATAACATCAG 


TACGGATATT 


GTCAAAAGAA 

V> X X AX AJ IX X X X« A 


TTACCTTTGA 


350 




AAGTCAGGCT 


AAATATAGCT 


ATCTTATCGT 


TGAGAAGGGA 


TTTGCGAAAA 


400 




GATTGCAAAA 


TCTGCAACGA 


GCTTTGGGTT 


TACTATTAAT 


GGTGGAGATG 


450 


15 


GATATAAAAA 


TGCTCAAAAA 


AGTACCACCA 


CTATATTTTC 


ATCCTAAGCC 


500 




AAGTGTAGAC 


TCTGTATTGA 


TTGTTCTTGA 


ACGACATCAA 


CCATTGATTT 


550 




CAAAGAAGGA 


CTACAAAAAG 


TATCGATCTT 


TTGTTTATAA 


GTGGGTAAAC 


600 




CGTGAATATC 


GTGTTCTTTT 


CACTAAAAAC 


CAATTCCGAC 


AGGCTTTGAA 


650 




GCATGCAAAT 


GTCACTAATA 


TTAATAAACT 


ATCGAAGGAA 


CAATTTCTTT 


700 


20 


CTATTTTCAA 


TAGTTACAAA 


TTGTTTCACT 


AA 




732 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2570 

(C) ACCESSION NUMBER: M19270 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1711 



ATGAACAAAA 


ATATAAAATA 


TTCTCAAAAC 


TTTTTAACGA 


GTGAAAAAGT 


50 


ACTCAACCAA 


ATAATAAAAC 


AATTGAATTT 


AAAAGAAACC 


GATACCGTTT 


100 


ACGAAATTGG 


AACAGGTAAA 


GGGCATTTAA 


CGACGAAACT 


GGCTAAAATA 


150 


AGTAAACAGG 


TAACGTCTAT 


TGAATTAGAC 


AGTCATCTAT 


TCAACTTATC 


200 


GTCAGAAAAA 


TTAAAATCGA 


ATACTCGTGT 


CACTTTAATT 


CACCAAGATA 


250 


TTCTACAGTT 


TCAATTCCCT 


AACAAACAGA 


GGTATAAAAT 


TGTTGGGAAT 


300 


ATTCCTTACC 


ATTTAAGCAC 


ACAAATTATT 


AAAAAAGTGG 


TTTTTGAAAG 


350 


CCATGCGTCT 


GACATCTATC 


TGATTGTTGA 


AGAAGGATTC 


TACAAGCGTA 


400 


CCTTGGATAT 


TCACCGAACA 


CTAGGGTTGC 


TCTTGCACAC 


TCAAGTCTCG 


450 


ATTCAGCAAT 


TGCTTAAGCT 


GCCAGCGGAA 


TGCTTTCATC 


CTAAACCAAG 


500 


AGTAAACAGT 


GTCTTAATAA 


AACTTACCCG 


CCATACCACA 


GATGTTCCAG 


550 


ATAAATATTG 


GAAGCTATAT 


ACGTACTTTG 


TTTCAAAATG 


GGTCAATCGA 


600 


GAATATCGTC 


AACTGTTTAC 


TAAAAATCAG 
887 


TTTCATCAAG 


CAATGAAACA 


650 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CGCCAAAGTA AACAATTTAA GTACCGTTAC TTATGAGCAA GTATTGTCTA 700 
TTTTTAATAG TTATCTATTA TTTAACGGGA GGAAATAA 738 



2) INFORMATION FOR SEQ ID NO: 1712 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 735 bases 

10 (B) TYPE: "Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN451 

(C) ACCESSION NUMBER: M17990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1712 



ATGAACGAGA 
TAATATAGAT 

25 TCTTTGAAAT 
AGGTGTAATT 
TACAGAAAAT 
ATATATTGCA 
AATATACCTT 

30 TAGTATAGCT 
GATTATTAAA 
GATATTTCTA 
TAGAGTGAAT 
CACACAAAGA 

35 AAAGAATACA 
ACATGCAGGA 
CTCTTTTCAA 



AAAATATAAA 
AAAATAATGA 
CGGCTCAGGA 
TCGTAACTGC 
AAACTTGTTG 
GTTTAAATTT 
ATAACATAAG 
GATGAGATTT 
TACAAAACGC 
TATTAAGTAT 
AGCTCACTTA 
TAAACAGAAG 
AGAAAATATT 
ATTGACGATT 
TAGCTATAAA 



ACACAGTCAA 
CAAATATAAG 
AAAGGGCATT 
CATTGAAATA 
ATCACGATAA 
CCTAAAAACC 
TACGGATATA 
ATTTAATCGT 
TCATTGGCAT 
GGTTCCAAGA 
TCAGATTAAA 
TATAATTATT 
TACAAAAAAT 
TAAACAATAT 
TTATTTAATA 



AACTTTATTA 
ATTAAATGAA 
TTACCCTTGA 
GACCATAAAT 
TTTCCAAGTT 
AATCCTATAA 
ATACGCAAAA 
GGAATACGGG 
TATTTTTAAT 
GAATATTTTC 
TAGAAAAAAA 
TCGTTATGAA 
CAATTTAACA 
TAGCTTTGAA 
AGTAA 



CTTCAAAACA 50 

CATGATAATA 100 

ATTAGTACAG 150 

TATGCAAAAC 200 

TTAAACAAGG 250 

AATATTTGGT 300 

TTGTTTTTGA 350 

TTTGCTAAAA 400 

GGCAGAAGTT 4 50 

ATCCTAAACC 500 

TCAAGAATAT 550 

ATGGGTTAAC 600 

ATTCCTTAAA 650 

CAATTCTTAT 700 
735 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1713 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1029 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

888 
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(C) ACCESSION NUMBER: U00456 . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1713 



5 


ATGAATAAAA 


TAAAAGTCGC 


AATTATCTTC 


GGCGGTTGCT 


CGGAGGAACA 


50 




TGATGTGTCG 


GTAAAATCCG 


CAATAGAAAT 


TGCTGCGAAC 


ATTAATACTG 


100 




AAAAATTCGA 


TCCGCACTAC 


ATCGGAATTA 


CAAAAAACGG 


CGTATGGAAG 


150 




CTATGCAAGA 


AGCCATGTAC 


GGAATGGGAA 


GCCGATAGTC 


TCCCCGCCAT 


200 




ATTCTCCCCG 


GATAGGAAAA 


CGCATGGTCT 


GCTTGTCATG 


AAAGAAAGAG 


250 


10 


AATACGAAAC 


TCGGCGTATT 


GACGTGGCTT 


TCCCGGTTTT 


TV ^1 TV TV TV 

GGATGGCAAA 


300 




TGCGGGGAGG 


ATGGTGCGAT 


ACAGGGTCTG 


TTTGAATTGT 


CTGGTATCCC 


350 




CTATGTAGGC 


TGCGATATTC 


AAAGCTCCGC 


AGCTTGCATG 


GACAAATCAC 


400 




TGGCCTACAT 


TCTTACAAAA 


AATGCGGGCA 


TCGCCGTCCC 


CGAATTTCAA 


450 




ATGATTGAAA 


AAGGTGACAA 


ACCGGAGGCG 


AGGACGCTTA 


CCTACCCTGT 


500 


15 


CTTTGTGAAG 


CCGGCACGGT 


CAGGTTCGTC 


CTTTGGCGTA 


ACCAAAGTAA 


550 




ACAGTACGGA 


AGAACTAAAC 


GCTGCGATAG 


AAGCAGCAGG 


' ACAATATGAT 


600 




GGAAAAATCT 


TAATTGAGCA 


AGCGATTTCG 


GGCTGTGAGG 


TCGGCTGCGC 


650 




GGTCATGGGA 


AACGAGGATG 


ATTTGATTGT 


CGGCGAAGTG 


GATCAAATCC 


700 




GGTTGAGCCA 


CGGTATCTTC 


CGCATCCATC 


AGGAAAACGA 


GCCGGAAAAA 


750 


20 


GGCTCAGAGA 


ATGCGATGAT 


TATCGTTCCA 


GCAGACATTC 


CGGTCGAGGA 


800 




ACGAAATCGG 


GTGCAAGAAA 


CGGCAAAGAA 


AGTATATCGG 


GTGCTTGGAT 


850 




GCAGAGGGCT 


TGCTCGTGTT 


GATCTTTTTT 


TGCAGGAGGA 


TGGCGGCATC 


900 




GTTCTAAACG 


AGGTCAATAC 


CCTGCCCGGT 


TTTACATCGT 


ACAGCCGCTA 


950 




TCCACGCATG 


GCGGCTGCCG 


CAGGAATCAC 


GCTTCCCGCA 


CTAATTGACA 


1000 


25 


GCCTGATTAC 


ATTGGCGATA 


GAGAGGTGA 






1029 



2) INFORMATION FOR SEQ ID NO: 1714 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1714 

45 GCACAGGTTC AGTTGGGCTT GATCTTGCTT TAGGTATAGG CGGTGTTCCA 50 

AAAGGAAGAA TTATAGAAAT TTATGGRCCT GAAAGTTCAG GTAAAACCAC 100 

TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT GGRGTTTGTG 150 

CTTTTATCGA TGCAGAACAT GCRCTTGATG TRAAATATGC TAAAAATTTG 200 

GGTGTAAATA CAGATGATTT GTATGTTTCT CAGCCTGATT TTGGAGARCA 250 

50 AGCCTTAGAA ATT GT AG AAA CTATAGCAAG AAGTGGCGCA GTAGATCTTA 300 

TCGTAGTAGA TAGCGTTGCA GCGCTTACCC CAAAAGCAGA AATTGAAGGC 350 

GATATGGGTG ATCAACATGT AGGACTTCAA GCAAGACTTA TGTCTCAAGC 4 00 

TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC ACAGTAATTT 4 50 

889 
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TYATCAACCA 
GAAACCACAA 
AGATGTTAGA 
ACCGCGTTAA 
5 CAAGCTGAAT 
ATTGATCGAT 
GGTTTTCTTA 
GCTTTCTTAA 



AATTCGTATG 
CAGGTGGAAA 
AAAGTAGCAA 
AGTAAAAGTA 
TTGATGTGAT 
TATGGTGTAA 
TAAAGATAAA 
AAGAAAAC 



AAAATCGGTG 
TGCATTAAAA 
CCTTAAAACA 
GTTAAAAATA 
GTTTGGAGAG 
AACTTGATAT 
AAACTTGGAC 



CTATGGGTTA 
TTTTATGCTT 
AAACGAAGAA 
AAGTTGCTCC 
GGTTTAAGCC 
CGTAGATAAA 
AAGGTAGAGA 



TGGTACTCCT 500 

CTGTGCGTTT 550 

CCTATAGGAA 600 

TCCATTCAGA 650 

GTGAAGGTGA 700 

AGTGGTGCGT 750 

AAATTCAAAA 800 
818 



10 



2) INFORMATION FOR SEQ ID NO:, 1715 



. (i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(C) ACCESSION NUMBER: AF124224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1715 



TGGTGCTATC 

30 GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ACACTCCAGT 
TCATACRAAG 

35 TCCAACTCCA 
ACGTGTTCTC 
CGTGGACAAG 
AGAAACTTCA 
TAGACTACGC 

40 ACACGTGACA 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 

45 CAGTAGCGA 



TTAGTAGTAT 
CTTATTATCA 
AAGTTGACAT 
GTTCGTGACT 
TGTTGCAGGT 
AAAAAATCTT 
GAACGYGACG 
AATCACAGGT 
TTCGTGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCGAACG 
ACTAAATTCA 
TCATACTCCA 
CAGACATCAC 
CCTGGTGATA 



CTGCAGCTGA 
CGTCAAGTAG 
GGTTGACGAT 
TATTATCAGA 
TCTGCTTTAC 
AGAATTAATG 
TTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTT 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTTGT 
ACGTAACTAT 



TGGTCCAATG 
GTGTTCCTTA 
GAAGAATTAT 
ATACGATTTC 
GCGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GT AC ATT ATT 
CTTGCTAAAC 
TTACGTATTA 
ACTACCGTCC 
GTGTTACCAG 
GGAAGTTGAA 



CCTCAAACAC 


50 


CATCGTTGTA 


100 


TAGAATTAGT 


150 


CCAGGCGATG 


200 


AGGCGACGCT 


250 


ACGAATACAT 


300 


CCAGTTGAAG 


350 


TCGTGTTGAA 


400 


GTATTTCAGA 


450 


CGTAAATTGT 


500 


ACGTGGTGTT 


550 


CAGGAACAAT 


600 


ACTAAAGAAG 


650 


TCAATTCTAC 


700 


AAGGCGTTGA 


750 


TTAATTCACC 


800 




809 



2) INFORMATION FOR SEQ ID NO: 1716 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

890 



SUBSTITUTE SHEET (RULE 26) 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia defectiva 

(B) STRAIN: ATCC 4 9176 

(C) ACCESSION NUMBER: AF124225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1716 



CGGCGCGATC 
GTGAACACAT 

15 TTCTTGAACA 
TGAAATGGAA 
ACACTCCAGT 
AACTACGAAG 
TCCAGAACCA 

20 ACGTATTCTC 
CGTGGTCAAG 
AGAAACTTCT 
TGGATTACGC 
ACTCGTGACC 

25 CACTCCGYAC 
AAGGTGGTCG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCGAT 

30 



CTCGTTGTAT 
CCTCTTGTCT 
AAGTTGACAT 
GTTCGTGACC 
TATCGCTGGT 
CTAAAGTTTT 
GAACGTGACA 
TATCACTGGT 
TTCGCGTTGG 
AAGACTACCG 
TGAAGCTGGG 
AAATCCAACG 
ACTAAGTTCG 
TCACACTCCA 
CTGACGTAAC 
CCAGGCGACA 
CGAAGAA 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAC 
TCTTGTCTGA 
TCAGCTTTGA 
AGAATTGATG 
CTGACAAGCC 
CGTGGTACTG 
TGACGAAGTT 
TTACCGGTGT 
GACAACGTTG 
TGGTCAAGTA 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTGTT 
ACGTACAAAT 



CGGCCCAATG 
GTGTTCCTTA 
GAAGAATTGC 
ATACGACTTC 
AAGCTTTAGA 
GAACAAGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTTG 
TGAAATGTTC 
GTACCTTGTT 
TTATCTAAAC 
GTACGTATTG 
ACTACCGTCC 
ACTTTACCAG 
GGTTGTTGAA 



CCACAAACTC 50 

CATCGTAGTA 100 

TCGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACGCT 250 

ATGCTTACAT 300 

CCAGTCGAAG 350 

TCGTGTTGAA 400 

GTATCGAAGA 450 

CGTAAGTTAT 500 

ACGTGGTGTA 550 

CAGGTTCAAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 750 

TTGATCCACC 800 
817 



2) INFORMATION FOR SEQ ID NO: 1717 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium accolens 
45 (B) STRAIN: ATCC 49725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1717 

CGGCGCTATC CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC 50 

50 GCGAGCACGT TCTGCTTGCT CGCCAGGTTG GCGTTCCTTA CATCCTCGTT 100 

GCACTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

GGAGATGGAG ATCTCCGAGC TGCTCGCAGA GCAGGACTAC GATGAGGAAG 200 

CTCCTATCGT TCACATCTCC GCTCTGAAGG CACTCGAGGG TGACGAGAAG 250 

891 
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TGGGTACAGT CCATCGTTGA CCTGATGGAT 

TGATCCGGAG CGCGCTACCG ATCAGCCGTT 

TCTTCACCAT TACCGGCCGC GGTACCGTTG 

GGTCGTCTGA ACGTCAACGA GGACGTTGAG 

5 GTCCCAGAAC ACCACCGTTA CCGGTATCGA 

ACTACACCGA GGCTGGCGAC AACTGTGGTC 

CGTGAGGACG TTGAGCGTGG CCAGGTTGTT 

CCCTCACACC AAGTTCGAGG GTTCCGTCTA 

GCGGCCGCCA CACCCCGYTC ATGAACAACT 

10 CGCACCACCG ACGTTACCGG TGTTGTGAAC 

GGTTATGCCT GGCGACAACG TTGAGATGTC 

TTGCTATGGA CGAG 



GCCTGCGACA 


ACTCCATCCC 


300 


CTTGATGCCT 


ATCGAGGACA 


350 


TTACCGGCCG 


TGTTGAGCGT 


400 


ATCATCGGTA 


TCCAGGAGAA 


450 


GATGTTCCGC 


AAGATGATGG 


500 


TGCTTCTGCG 


TGGTACCAAG 


550 


ATCAAGCCGG 


GCGCTTACAC 


600 


CGTCCTGAAG 


AAGGAAGAGG 


650 


ACCGTCCTCA 


GTTCTACTTC 


700 


CTGCCTGAGG " 


GCACCGAGAT- 


750 


TGTTGAGCTC 


ATCCAGCCTG 


800 






814 



15 



2) INFORMATION FOR SEQ ID NO: 1718 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coryn ebacter iiun genitalium 

(B) STRAIN: ATCC 33031 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1718 



CGGCGCCATC 
GTGAGCACGT 
GCACTGAACA 

35 CGAGATGGAG 
CACCTGTTGT 
TGGGCTAAGC 
GGATCCGGAG 
TCTTCACCAT 

40 GGCGTCCTGA 
GTCCACCAAG 
ACACCGCAGA 
CGCGAAGATG 
CCCGCACACC 

45 GTGGCCGCCA 
CGCACCACCG 
GGTTATGCCG 
TTGCTATGGA 



CTGGTTGTTG 
TCTGCTGGCT 
AGTGCGACAT 
GTCCGCGAGC 
TCACATCTCC 
AGATCCTGGA 
CGCGAGACCG 
TACCGGCCGC 
ACCTGAACGA 
ACCACCGTTA 
GGCTGGCGAC 
TTGAGCGTGG 
GAGTTCGAGG 
CACCCCGTTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CTGCAACCGA 
CGCCAGGTTG 
GGTTGATGAT 
TGCTGGCTGA 
GCACTGAAGG 
GCTCATGGAG 
ACAAGCCGTT 
GGTACCGTTG 
CGAGGTCGAG 
CCTCCATCGA 
AACGCCGCAC 
TCAGATCGTT 
GCTCCGTCTA 
TTCGACAACT 
TGTTGTGAAG 
TTGACATGTC 



TGGCCCGATG 
GCGTTCCGTA 
GAGGAGCTGC 
GCAGGACTTC 
CCCTGGAGGG 
GCTTGCGACA 
CCTGATGCCG 
TTACCGGCCG 
ATCCTGGGCA 
GATGTTCAAC 
TGCTGCTGCG 
GCTAAGCCGG 
CGTTCTGTCC 
ACCGTCCGCA 
CTGCCGGAGG 
CGTCACCCTG 



CCGCAGACCC 50 

CATCCTAGTT 100 

TGGAGCTCGT 150 

GACGAGGAAG 200 

CGACGAGAAG 250 

ACTCCATCCC 300 

GTTGRGGACA 350 

TGTTGAGCGT 400 

TCCGCGAGAA 450 

AAGCTGCTGG 500 

TGGCCTGAAG 550 

GCGAGTACAC 600 

AAGGACGAGG 650 

GTTCTATTTC 700 

GCACCGAGAT 750 

ATCCAGCCGG 800 
814 



50 



2) INFORMATION FOR SEQ ID NO: 1719 



892 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Corynebacterium jeikeium 

(B) STRAIN: ATCC 43216 



(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1719 



15 


CGGCGCCATC 


CTGGTTGTTG 


CCGCAACCGA 


TGGCCCGATG 


CCGCAGACCC 


50 




GCGAGCACGT 


TCTGCTGGCY 


CGCCAGGTTG 


GCGTTCCGTA 


CATCCTGGTT 


100 




GCACTGAACA 


AGTGTGACAT 


GGTTGACGAT 


GAGGAGCTGC 


TGGAGCTCGT 


150 




CGAGATGGAG 


GTCCGCGAGC 


TGCTGGCTGA 


GCAGGACTTC 


GACGAGGAAG 


200 




CTCCGGTTGT 


TCACATCTCC 


GCACTGAAGG 


CCCTGGAGGG 


CGACGAGAAG 


250 


20 


TGGGCTAACC 


AGATTCTCGA 


GCTGATGCAG 


GCTTGCGACG 


AGTCTATCCC 


300 




GGATCCGGAG 


CGCGAGACCG 


ACAAGCCGTT 


CCTGATGCCG 


GTTGWGGACA 


350 




TCTTCACCAT 


TACCGGTCGC 


GGTACCGTTG 


TTACCGGCCG 


TGTTGAGCGT 


400 




GGCATCCTGA 


ACCTGAACGA 


CGAGGTTGAG 


ATCCTGGGTA 


TCCGCGAGAA 


450 




GTCCCAGAAG 


ACCACCGTTA 


CCTCCATCGA 


GATGTTCAAC 


AAGCTGCTGG 


500 


25 


ACACCGCAGA 


GGCTGGCRAC 


AACGCTGCAC 


TGCTGCTGCG 


TGGTCTGAAG 


550 




CGCGAGGACG 


TTGAGCGTGG 


CCAGATCATC 


GCTAAGCCGG 


GCGAGTACAC 


600 




CCCGCACACC 


GAGTTCGAGG 


GCTCCGTCTA 


CGTTCTGTCC 


AAGGACGAGG 


650 




GCGGCCGCCA 


CACCCCGTTC 


TTCGACAACT 


ACCGTCCGCA 


GTTCTACTTC 


700 




CGCACCACCG 


ACGTTACCGG 


TGTTGTGAAG 


CTGCCTGAGG 


GCACCGAGAT 


750 


30 


GGTTATGCCG 


GGCGACAACG 


TYGACATGTC 


CGTCACCCTG 


ATCCAGCCGG 


800 




TTGCTATGGA 


CGAG 








814 



35 2) INFORMATION FOR SEQ ID NO: 1720 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1720 

CGGCGCTATC TTGGTTGTTG CAGCTACCGA CGGCCCAATG CCACAGACTC 50 
GCGAGCACGT TCTGCTGGCT CGCCAGGTTG GCGTTCCTTA CATCCTGGTT 100 
GCACTAAACA AGTGCGACAT GGTTGACGAC GAGGAAATCC TCGAGCTCGT 150 

893 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGAGATGGAG ATCCGCGAAT TGCTGGCTGA 

CTCCAATCGT TCACATCTCC GCAGTCGGCG 

TGGGTTAACG CCATCGTTGA ACTGATGGAT 

TGATCCAGAC CGTGCTACCG ACAAGCCATT 

5 TCTTCACCAT TACCGGTCGT GGCACCGTTG 

GGTTCCCTGA AGGTCAACGA AGAAGTCGAG 

GTCCCAGAAG ACCACCATCA CCGGTATCGA 

ACTACACCGA GGCCGGCGAC AACGCTGGTC 

CGTGAAGACG TTGAGCGTGG ACAGGTTATC 

10 CACCCACAAG AAGTTCGAAG GTTCCGTCTA 

GCGGCCGCCA CACCCCGTTC TTCGACAACT 

CGCACCACCG ACGTTACCGG TGTTGTTACC 



LLAbbAAl lb 


GACGAAGAAG 


zUU 


LL1 ibbAAbb 


bGAAGAGAGG 




i 1 b 1 oHto 


AblbbAlLLL 




bb 1 bn J. uLl 1 


A 1 Lb Abb AC A 


OCA 


i i ALbbb 1 Lb 


Ibl I bAbbbl 


/An 


HI CHI LobLA 


1 L AAbbAAAA 




AA1 bl ILLbb 


AAbATGCTbb 




lbblbbi lbb 


LGGTACCAAG 




GTTGCTCCAG 


GTGCTTACAG 


600 


CGTTCTTTCC 


AAGGACGAGG . 


650 


ACCGTCCTCA 


GTTCTACTTC 


700 


CTGCCTGAGG 


GCACCGAG 


748 



15 



2) INFORMATION FOR SEQ ID NO: 1721 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 813 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium striatum 

(B) STRAIN: ATCC 6940 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1721 



GGCGCTATCT 
CGAGCACGTT 
CACTGAACAA 

35 GAGATGGAGA 
TCCGATCGTT 
GGGTACAGGC 
GATCCGGAGC 
CTTCACCATC 

4 0 GCTCCCTGAA 
TCCATCTCCA 
CTACACCGAG 
GTGAAGAGGT 
CCTCACACCC 

45 CGGCCGCCAC 
GCACCACCGA 
GTTATGCCTG 
CGCTATGGAC 



TGGTTGTTGC 
CTTCTGGCTC 
GTGCGACATG 
TCCGCGAACT 
CACATCTCTG 
TATCGTTGAC 
GCGAGCTGGA 
ACCGGCCGCG 
CGTCAACGAG 
CCACCGTTAC 
GCTGGCGACA 
TGAGCGCGGC 
AGTTCGAGGG 
ACCCCGTTCA 
CGTTACCGGC 
GCGACAACGT 
GAG 



TGCAACCGAT 
GCCAGGTTGG 
GTTGACGACG 
GCTCGCAGAG 
CTCTGAAGGC 
CTGATGCAGG 
CAAGCCGTTC 
GTACCGTTGT 
GACGTTGAGA 
CGGTATCGAG 
ACTGTGGTCT 
CAGGTTGTTA 
TTCCGTCTAC 
TGGACAACTA 
GTCATCAAGC 
CGAGATGTCY 



GGCCCGRTGC 
CGTTCCTTAC 
AGGAAATTAT 
CAGGACTACG 
TCTTGAGGGC 
CTTGCGATGA 
CTGATGCCAA 
TACTGGCCGT 
TCATCGGTAT 
ATGYTCCGCA 
GCTTCTGCGT 
TTAAGCCGGG 
GTCCTGAAGA 
CCGTCCGCAG 
TGCCTGAGGG 
GTCGAGCTGA 



CGCAGACCCG 50 

ATCCTCGTTG 100 

CGAGCTCGTC 150 

ATGAGGAAGC 200 

GRCGAGAAGT 250 

CTCCATCCCG 300 

TCGAGGACAT 350 

GTTGAGCGTG 400 

CCAGGACARG 450 

AGATGATGGA 500 

GGTACCAAGC 550 

CGCTTACACC 600 

AGGAAGAGGG 650 

TTCTACTTCC 700 

CACCGAGATG 750 

TCCAGCCGGT 800 
813 



50 



2) INFORMATION FOR SEQ ID NO: 1722 



894 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



25 



30 



(vi) ORIGINAL SOURCE: 

"(A) ORGANISM: Enterococcus avium _ 

(B) STRAIN: ATCC 14025 

(C) ACCESSION NUMBER: AF124220 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 1722 



CGGAGCTATC TTAGTAGTAT 

GTGAACACAT CTTGTTATCT 

TTCTTAAACA AAATGGATAT 

TGAAATGGAA GTTCGTGACT 

ACACTCCAGT TATCGCAGGT 

TCATACGAAG AAAAAATCTT 

CCCAACACCA GTTCGTGATA 

ACGTATTCTC AATCACTGGT 

CGTGGACAAG TTCGCGTTGG 

CGAAACTGCT AAAACAACTG 

TAGACTACGC TGAAGCAGGT 

GCACGTGAAG ATATCCAACG 

CACTCCACAT ACAAAATTCT 

AAGGTGGACG TCATACTCCA 

TTCCGTACAA CTGACGTAAC 

AATGGTWATG CCTGGGGATA 

CAATYGCGGT AGAAGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCGTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTAGTT 
ACGTAACTAT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGACTTC 
AAGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCTTTGTT 
TTGGCTAAAC 
TTATGTTCTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 



CCTCAAACTC 50 

CATCGTTGTA 100 

TTGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACGCT 250 

ACGAATATAT 300 

CCAGTCGAAG 350 

TCGTGTTGAA 4 00 

GTATCGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 750 

TTGATYCACC 800 
817 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1723 



(i) SEQUENCE CHARACTERISTICS: 
(A) . LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 14018 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1723 

TGGCGCAATC CTCGTGGTTG CTGCTACCGA CGGTCCAATG GCTCAGACCC 50 
GTGAACACGT CTTGCTTGCT AAGCAGGTCG GCGTTCCAAA AATTCTTGTT 100 

895 
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GCTTTGAACA 


AGTGCGATAT 


GGTTGACGAC 


GAAGAGCTTA 


TCGATCTCGT 


150 




TGAAGAAGAG 


GTCCGTGACC 


TCCTCGAAGA 


AAACGGCTTC 


GATCGCGATT 


200 




GCCCAGTCYT 


CCGTACTTCC 


GCTTACGGCG 


CTTTGCATGA 


TGACGCTCCA 


250 




GACCACGACA 


AGTGGGTAGA 


GACCGTCAAG 


GAACTCATGA 


AGGCTGTTGA 


300 


5 


CGAGTACATC 


CCAACCCCAA 


CTCACGATCT 


TGACAAGCCA 


TTCTTGATGC 


350 




CAATCGAAGA 


TGTGTTCACC 


ATCTCCGGTC 


GTGGTYCCGT 


TGTCACCGGT 


400 




CGTGTTGAGC 


GTGGTAAGCT 


CCCAATCAAC 


ACCCCAGTTG 


AGATCGTTGG 


450 




TTTGCGCGAT 


ACCCAGACCA 


CCACCGTCAC 


CTCTATCGAG 


ACCTTCCACA 


500 




AGCAGATGGA 


TGAGGCAGAG 


GCTGGCGATA 


ACACTGGTCT 


TCTTCTCCGC 


550 


10 


GGTATCAACC 


GTACCGACGT 


TGAGOGTGGT 


CAGGTTGTGG 


CTGCTCCAGG 


600 




TTCTGTGACT 


CCACACACCA 


AGTTCGAAGG 


CGAAGTTTAC 


GTCTTGACCA 


650 




AGGACGAAGG 


TGGCCGTCAC 


TCGCCATTCT 


TCTCCAACTA 


CCGTCCACAG 


700 




TTCTACTTCC 


GTACCACCGA 


TGTTACTGGC 


GTTATCACCT 


TGCCAGACGG 


750 




CATCGAAATG 


GTTCAGCCAG 


GCGATCACGC 


AACCTTCACT 


GTTGAGTTGA 


800 


i c; 












O C. O 




2) INFORMATION FOR SEQ ID NO: 1724 








20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 817 


bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS: Double 








25 


(D) 


TOPOLOGY: Linear 










(ii) MOLECULE TYPE: Genomic DNA 










(vi) ORIGINAL SOURCE: 










•J V 


(A) 


ORGANISM: Listeria innocua 








<B) 


STRAIN: ATCC 33090 










(Xi) SEQUENCE DESCRIPTION: SEQ ID 


NO: 1724 






35 


CGGAGCTATC 


TTAGTAGTAT 


CTGCTGCTGA 


TGGCCCAATG 


CCACAAACTC 


50 




GTGAACATAT 


CTTACTTTCA 


CGTCAAGTTG 


GTGTTCCATA 


CATCGTTGTA 


100 




TTCATGAACA 


AATGTGACAT 


GGTTGACGAT 


GAAGAATTAC 


TAGAATTAGT 


150 




TGAAATGGAA 


ATTCGTGATC 


TATTAACTGA 


ATATGAATTC 


CCTGGCGATG 


200 




ACATTCCTGT 


AATCAAAGGT 


TCAGCTCTTA 


AAGCACTTCA 


AGGTGAAGCT 


250 


40 


GACTGGGAAG 


CTAAAATTGA 


CGAGTTAATG 


GAAGCTGTAG 


ATTCTTACAT 


300 




TCCAACTCCA 


GAACGTGATA 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 




ATGTATTCTC 


AATCACTGGT 


CGTGGAACAG 


TTGCAACTGG 


ACGTGTTGAA 


400 




CGTGGACAAG 


TTAAAGTTGG 


TGACGAAGTA 


GAAGTTATCG 


GTATTGAAGA 


450 




AGAAAGCAAA 


AAAGTAGTAG 


TAACTGGAGT 


AGAAATGTTC 


CGTAAATTAC 


500 


45 


TAGACTACGC 


TGAAGCTGGC 


GACAACATTG 


GCGCACTTCT 


ACGTGGTGTT 


550 




GCTCGTGAAG 


ATATCCAACG 


TGGTCAAGTA 


TTAGCTAAAC 


CAGGTTCGAT 


600 




TACTCCACAC 


ACTAACTTCA 


AAGCTGAAAC 


TTATGTTTTA 


ACTAAAGAAG 


650 




AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGCCC 


ACAATTCTAT 


700 




TTCCGTACTA 


CTGACGTAAC 


TGGTATTGTT 


ACACTTCCAG 


AAGGTACTGA 


750 


50 


AATGGTAATG 


CCTGGTGATA 


ACATTGAGCT 


TGCAGTTGAA 


CTAATTGCAC 


800 




CAATCGCTAT 


CGAAGAC 








817 



896 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1725 



PCT/CAOO/01150 



10 



CGGAGCTATC 
GTGAACATAT 
ATTCATGAAC 

20 TTGAAATGGA 
GACATTCCTG 
TGATTGGGAA 
TTCCAACTCC 
GATGTATTCT 

25 ACGTGGACAA 
AAGAAAGCAA 
CTAGACTACG 
TGCTCGTGAA 
TTACTCCACA 

30 GAAGGTGGAC 
TTTCCGTACT 
AAATGGTAAT 
CCAATCGCTA 



TTAGTAGTAT 
TCTTACTTTC 
AAATGTGACA 
AATTCGTGAT 
TAATCAAAGG 
GCTAAAATTG 
AGAACGTGAT 
CAATCACTGG 
GTTAAAGTTG 
AAAAGTAGTA 
CTGAAGCTGG 
GATATCCAAC 
TACTAACTTC 
GTCATACTCC 
ACTGACGTAA 
GCCTGGTGAT 
TCGAAGAC 



CTGCTGCTGA 
ACGTCAAGTT 
TGGTTGACGA 
CTATTAACTG 
TTCAGCTCTT 
ACGAGTTAAT 
ACTGACAAAC 
TCGTGGAACA 
GTGACGAAGT 
GTAACTGGAG 
CGACAACATT 
GTGGTCAAGT 
AAAGCTGAAA 
ATTCTTCAAC 
CTGGTATTGT 
AACATTGAGC 



TGGTCCAATG 
GGTGTTCCAT 
TGAAGAATTA 
AATATGAATT 
AAAGCACTTC 
GGAAGCTGTA 
CATTCATGAT 
GTTGCAACTG 
AGAAGTTATC 
TAGAAATGTT 
GGCGCACTTC 
ATTAGCTAAA 
CTTATGTTTT 
AACTACCGCC 
TACACTTCCA 
TTGCAGTTGA 



CCACAAACTC 50 

ACATCGTTGT 100 

CTTGAATTAG 150 

CCCTGGCGAC 200 

AAGGTGAAGC 250 

GATTCTTACA 300 

GCCAGTTGAG 350 

GACGTGTTGA 400 

GGTATTGAAG 450 

CCGTAAATTA 500 

TACGTGGTGT 550 

CCAGGTTCGA 600 

AACTAAAGAA 650 

CACAATTCTA 700 

GAAGGTACTG 750 

ACTAATTGCA 800 
818 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: LSPQ 5093202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1726 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 

897 



50 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GTGAACATAT CTTACTTTCA 

TTCATGAACA AATGTGACAT 

TGAAATGGAA ATTCGTGATC 

ACATTCCTGT AATCAAAGGT 

5 GACTGGGAAG CTAAAATTGA 

TCCAACTCCW GAACGTGATA 

ATGTATTCTC AATCACTGGT 

CGTGGACAAG TTAAAGTTGG 

AGAAAGCAAA AAAGTAGTAG 

- 1-0 TAGACTACGC TGAAGCTGGC 

GCTCGTGAAG ATATCCAACR 

TACTCCACAC ACTAACTTCA 

AAGGTGGACG TCACACTCCA 

TTCCGTACTA CTGACGTAAC 

15 AATGGTAAYG CCTGGTGATA 

CAATCGCTAT CGAAGAC 



CGTCAAGTTG GTGTTCCATA 

GGTTGACGAT GAAGAATTAC 

TATTAACTGA ATATGAATTC 

TCAGCTCTTA AAGCACTTCA 

CGAGTTAATG GAAGCTGTAG' 

CTGACAAACC ATTCATGATG 

CGTGGAACAG TTGCAACTGG 

TGACGAAGTA GAAGTTATCG 

TAACTGGAGT AGAAATGTTC 

GACAACATTG GCGCACTTCT- 

TGGTCAAGTA TTAGCTAAAC 

AAGCTGAAAC TTATGTTTTA 

TTCTTCAACA ACTACCGCCC 

TGGTATTGTT ACACTTCCAG 

ACATTGAGCT TGCAGTTGAA 



CATCGTTGTA 100 

TAGAATTAGT 150 

CCTGGCGATG 200 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 400 

GTATCGAAGA 450 

CGTAAATTAC 500 

ACGTGGTGTT 550 - 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 
817 



20 2) INFORMATION FOR SEQ ID NO: 1727 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria seeligeri 

(B) STRAIN: ATCC 35967 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1727 



35 



40 



45 



50 



CGGAGCTATC 


TTAGTAGTAT 


CTGCTGCTGA 


TGGCCCAATG 


CCACAAACTC 


50 


GTGAACATAT 


CTTACTTTCA 


CGTCAAGTTG 


GTGTTCCATA 


CATCGTTGTA 


100 


TTCATGAACA 


AATGTGACAT 


GGTTGACGAT 


GAAGAATTAC 


TTGAATTAGT 


150 


TGAAATGGAA 


ATTCGTGATC 


TATTAACTGA 


ATATGAATTC 


CCTGGTGATG 


200 


ACATTCCTGT 


AATCAAAGGT 


TCAGCTCTTA 


AAGCACTTCA 


AGGTGAAGCT 


250 


GACTGGGAAG 


CTAAAATTGA 


CGAGTTAATG 


GAAGCTGTAG 


ATTCTTACAT 


300 


TCCAACTCCA 


GAACGTGATA 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 


ATGTATTCTC 


AATCACTGGT 


CGTGGAACTG 


TTGCAACTGG 


ACGTGTTGAA 


400 


CGTGGACAAG 


TTAAAGTTGG 


TGACGAAGTA 


GAAGTTATCG 


GTATTGAAGA 


450 


AGAAAGCAAA 


AAAGTAATAG 


TAACTGGAGT 


AGAAATGTTC 


CGTAAATTAC 


500 


TAGACTACGC 


TGAAGCTGGC 


GACAACATTG 


GCGCACTTCT 


ACGTGGTGTT 


550 


GCTCGTGAAG 


ATATCCAACG 


TGGTCAAGTA 


TTAGCTAAAC 


CAGGTTCGAT 


600 


TACTCCACAT 


ACTAACTTCA 


AAGCTGAAAC 


TTATGTTTTA 


ACTAAAGAAG 


650 


AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGCCC 


ACAATTCTAT 


700 


TTCCGTACTA 


CTGACGTAAC 


TGGTATTGTT 


ACACTTCCAG 


AAGGTACTGA 


750 


AATGGTAATG 


CCTGGTGATA 


ACATTGAGCT 


TGCAGTTGAA 


CTAATTGCAC 


800 


CAATCGCTAT 


CGAAGAC 








817 



898 
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15 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1728 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 25923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1728 



PCT/CAOO/01150 



20 



25 



30 



CGGTGGTATC 


TTAGTAGTAT 


CTGCTGCTGA 


CGGTCCAATG 


CCACAAACTC 


50 


GTGAACACAT 


TCTTTTATCA 


CGTAACGTTG 


GTGTACCAGC 


ATTAGTAGTA 


100 


TTCTTAAACA 


AAGTTGACAT 


GGTTGACGAT 


GAAGAATTAT 


TAGAATTAGT 


150 


AGAAATGGAA 


GTTCGTGACT 


TATTAAGCGA 


ATATGACTTC 


CCAGGTGACG 


200 


ATGTACCTGT 


AATCGCTGGT 


TCAGCATTAR 


AAGCTTTAGA 


AGGCGATGCT 


250 


CAATACGAAG 


AAAAAATCTT 


AGAATTARTG 


GAAGCTGTAG 


ATACTTACAT 


300 


TCCAACTCCA 


GAACGTGATT 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 


ACGTATTCTC 


AATCACTGGT 


CGTGGTACTG 


TTGCTACAGG 


CCGTGTTGAA 


4 00 


CGTGGTCAAA 


TCAAAGTTGG 


TGAAGAAGTT 


GAAATCATCG 


GTTTACATGA 


450 


CACATCTAAA 


ACAACTGTTA 


CAGGTGTTGA 


AATGTTCCGT 


AAATTATTAG 


500 


ACTACGCTGA 


AGCTGGTGAC 


AACATTGGTG 


CATTATTACG 


TGGTGTTGCT 


550 


CGTGAAGACG 


TACAACGTGG 


TCAAGTATTA 


GCTGCTCCTG 


GTTCAATTAC 


600 


ACCACATACT 


GAATTCAAAG 


CAGAAGTATA 


CGTATTATCA 


AAAGACGAAG 


650 


GTGGACGTCA 


CACTCCATTC 


TTCTCAAACT 


ATCGTCCACA 


ATTCTATTTC 


700 


CGTACTACTG 


ACGTAACTGG 


TGTTGTTCAC 


TTACCAGAAG 


GTACTGAAAT 


750 


GGTAATGCCT 


GGTGATAACG 


TTGAAATGAC 


AGTAGAATTA 


ATCGCTCCAA 


800 


TCGCGATTGA 


AGAC 








814 
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2) INFORMATION FOR SEQ ID NO: 1729 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1729 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 

899 
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GTGAACACAT TCTTTTATCA 

TTCTTAAACA AAGTTGACAT 

AGAAATGGAA GTTCGTGRCT 

ATGTACCTGT AATCTCTGGT 

5 GACTATGAGC AAAAAATCTT 

TCCAACACCA GAACGTGATT 

ACGTATTCTC AATCACTGGT 

CGTGGTCAAA TCAAAGTCGG 

AGAATCAAGC AAAACAACTG 

10 TAGACTACGC -TGAAGCTGGT 

TCACGTGATG ATGTACAACG 

CACACCACAT ACAAAATTCA 

AAGGTGGTCG TCATACGCCA 

TTCCGTACTA CTGACGTAAC 

15 AATGGTTATG CCTGGCGATA 

CAATCGCTAT TGAAGAC 



CGTRACGTTG GTGYTCCAGC 

GGTTGACGAY GAAGAATTAT 

TATTAAGCGA ATATGACTTC 

TCTGCATTAA AAGCTTTAGA 

AGACTTAATG CAAGCTGTTG 

CTGACAAACC ATTCATGATG 

CGTGGTACTG TTGCTACAGG 

TGAAGAAATC GARATCATCG 

TTACTGGTGT AGAAATGTTC 

GACAACATTG.. GTGCATTATT 

TGGTCAAGTT TTAGCTGCTC 

AAGCGGATGT TTACGTTTTA 

TTCTTCACTA ACTACCGCCC 

TGGTGTTGTT AACTTACCAG 

ACGTTGAAAT GGATGTTGAA 



ATTAGTTGTA 


100 


TAGAATTRGT 


150 


CCAGGTGACG 


200 


AGGCGACGCT 


250 


ATGACTYCAT 


300 


CCAGTTGAGG 


350 


CCGTGTTGAA 


400 


GTATGCAAGA 


450 


CGTAAATTAT 


500 


ACGTGGTGTT . 


550. 


CTGGTACTAT 


600 


TCTAAAGATG 


650 


ACAATTCTAT 


700 


AAGGTACTGA 


750 


TTAATTTCTC 


800 




817 



20 2) INFORMATION FOR SEQ ID NO: 1730 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1730 



35 



40 



45 



50 



CGGCGGTATC TTAGTAGTAT 

GTGAACACAT CTTATTATCA 

TTCTTAAACA AAGCTGACAT 

TGAAATGGAA GTTCGTGACT 

ATGTACCAGT TATCGTTGGT 

GAATACGAAC AAAAAATCTT 

CCCAACTCCA GAACGTGACT 

ACGTATTCTC AATCACTGGT 

CGTGGTCAAA TCAAAGTCGG 

AGAAAGCAAG AAAACAACAG 

TAGACTACGC TGAAGCTGGT 

GCACGTGAAG ACGTACAACG 

TACTCCACAC ACAAAATTCA 

AAGGTGGACG TCATACTCCA 

TTCCGTACTA CTGACGTAAC 

AATGGTTATG CCTGGCGATA 

CAATCGCGAT TGAAGAC 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAGAAAT 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGACG 200 

AGGCGACCCA 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTATCACTGA 450 

CGTAAATTAT 500 

ACGTGGTGTT 550 

CTGGCTCTAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 750 

TTGATCGCTC 800 
817 



900 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1731 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 
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{ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1731 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACCTTCCAGT 
AAATACGAAG 
TCCAGAACCA 
ATGTATTCTC 
CGTGGTACTG 
AGATATCCAA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTTG 
CCTTCTTTCA 
AAGTTGACCT 
ATTCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGATA 
AATCACTGGA 
TTCGTGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTTA 
TCATACTCCA 
CTGACGTAAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAAGTTG 
TGTTGATGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAGTT 
TTACTGGTGT 
GACAACGTTG 
TGGTCAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTTACTAT 



TGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCACTTGA 
AGCACTGTTG 
TTTACTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
CTTGCTAAAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGAAGTTGAA 



CCACAAACTC 50 

CCTTATCGTA 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGCGACGAA 250 

ATGAGTACAT 300 

CCAGTTGAAG 350 

ACGTATCGAC 400 

GTATTAAAGA 450 

CGTAAACAAC 500 

TCGTGGTGTT 550 

CAGGTTCAAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

CAGGAACAGA 750 

TTGATTCACC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 27336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1732 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
5 ATCTTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGTGGTATCG 

10' AGAAACTCRA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 

15 TTCCGTACTA 
AATGGTAATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCGTTAT 
GAACGTGACA 
AATCACTGGA 
TTAAAGTCAA 
AAAGCAGTTG 
TCTTGCTGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTTAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAGGTTG 
GGTTGACGAC 
TATTGTCAGA 
TCAGCACTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAATC 
TTACTGGTGT 
GATAACGTAG 
TGGACAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACAAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCTCTTGA 
AACACAGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTCCTTCT 
ATCGCTAAAC 
CTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGACG 200 

AGGTGACTCT 250 

ATGAGTATAT 300 

CCAGTCGAGG 350 

ACGTATCGAC 400 

GTATCAAAGA 450 

CGTAAACAAC 500 

TCGTGGTGTT 550 

CAGGTTCAAT 600 

ACTAAAGAAG 650 

ACAATTCTAC 700 

CAGGTACTGA 750 

TTGATTCACC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1733 



CGGTGCGATC 
GTGAGCACAT 
TTCATGAACA 

40 TGAAATGGAA 
ACATTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 

45 CGTGGTGTTG 
AGACATCCAA 
TTGACGRAGG 
CAACGTGATG 
CAACCCACAC 

50 AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGACA 
AATCACTGGT 
TTCGTGTCAA 
AAAGCAGTTG 
TATTGCCGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
TGAACAA 



CATCTACTGA 
CGTCAGGTTG 
GGTTGACGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGACTTGATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GATAACGTCG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACTAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGATTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGTTGCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
TTGGCTGCAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCTG 
CGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGTGATTCT 250 

ACGAATACAT 300 

CCAGTCGAAG 350 

ACGTATCGAC 400 

GTCTTAAAGA 450 

CGTAAACAAC 500 

TCGTGGTATC 550 

CTGGTTCAAT 600 

TCTAAAGAAG 650 

ACAGTTCTAC 700 

CAGGTACTGA 750 

TTGATCCACC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1734 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrojba.cterium tumefaciens 
(C) ACCESSION NUMBER: x99673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1734 
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AACATGATCA 
GGCTGCCGAC 
GTCAGGTGGG 
GTTGACGACG 
TCTGTCGTCC 
CGGCACTTGC 
ATCCGCGAGC 
TCCGATCGAC 
CGGGTCGTGG 
GTTGGTGAAG 
TGTTACCGGC 
GCGACAACAT 
CGTGGTCAGA 
CATGGCAGAA 
CGTTCTTCAC 
ACCGGTATCG 
CAACGTCACT 
AGCTGCGCTT 



CCGGTGCTGC 


CGAGATGGAC 


GGCGCGATCC 


TGGTTTGCTC 


50 


GGCCCGATGC 


CACAGACCCG 


CGAGCACATC 


CTGCTTGCCC 


100 


CGTTCCGGCC 


ATCGTCGTGT 


TCCTCAACAA 


GGTCGACCAG 


150 


CCGAGCTTCT 


CGAGCTCGTC 


GAGCTTGAAG 


TTCGCGAACT 


200 


TACGACTTCC 


CGGGCGACGA 


TATCCCGATC 


ATCAAGGGTT 


250 


TGCTCTTGAA 


GATTCTGACA 


AGAAGATCGG 


TGAAGACGCG 


300 


TGATGGCTGC 


TGTCGACGCC 


TACATCCCGA 


CGCCTGAGCG 


350 


CAGCCGTTCC 


TGATGCCGAT 


CGAAGACGTG 


TTCTCGATCT 


400 


TACGGTTGTG 


ACGGGTCGCG 


TTGAGCGCGG 


TATCGTCAAG 


450 


AAGTCGAAAT 


CGTCGGCATC 


CGTCCGACCT 


CGAAGACGAC 


500 


GTTGAAATGT 


TCCGCAAGCT 


GCTCGACCAG 


GGCCAGGCCG 


550 


CGGTGCACTC 


GTTCGCGGCG 


TTACCCGTGA 


CGGCGTCGAG 


600 


TCCTGTGCAA 


GCCGGGTTCG 


GTCAAGCCGC 


ACAAGAAGTT 


650 


GCCTACATCC 


TGACGAAGGA 


AGAAGGCGGC 


CGTCATACGC 


700 


GAACTACCGT 


CCGCAGTTCT 


ACTTCCGTAC 


GACTGACGTT 


750 


TTTCGCTTCC 


TGAAGGCACG 


GAAATGGTTA 


TGCCTGGCGA 


800 


GTTGAAGTCG 


AGCTGATCGT 


TCCGATCGCG 


ATGGAAGAAA 


850 


CGCTATCCGC 


GAAGGCGGCC 


GTACCGTCGG 


CGCCGGC 


897 
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2) INFORMATION FOR SEQ ID NO: 1735 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus subtilis 
50 (B) STRAIN: 168 

(C) ACCESSION NUMBER: Z99104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1735 

903 
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ATGATCACTG 


GTGCTGCGCA 


AATGGACGGA 


GCTATCCTTG 


TAGTATCTGC 


50 




TGCTGATGGC 


CCAATGCCAC 


AAACTCGTGA 


GCACATCCTT 


CTTTCTAAAA 


100 




ACGTTGGTGT 


ACCATACATC 


GTTGTATTCT 


TAAACAAATG 


CGACATGGTA 


150 


5 


GACGACGAAG 


AGCTTCTTGA 


ACTAGTTGAA 


ATGGAAGTTC 


GCGATCTTCT 


200 




TAGCGAATAC 


GACTTCCCTG 


GTGATGATGT 


ACCAGTTGTT 


AAAGGTTCTG 


250 




CTCTTAAAGC 


TCTTGAAGGA 


GACGCTGAGT 


GGGAAGCTAA 


AATCTTCGAA 


300 




CTTATGGATG 


CGGTTGATGA 


GTACATCCCA 


ACTCCAGAAC 


GCGACACTGA 


350 


- - 


- AAAACCATTC 


ATGATGCCAG 


TTGAGGACGT 


ATTCTCAATC 


ACTGGTCGTG 


400 


10 


GTACAGTTGC 


TACTGGCCGT 


" GTAGAACGCG 


GACAAGTTAA" 


-AGTCGGTGAC 


450 




GAAGTTGAAA 


TCATCGGTCT 


TCAAGAAGAG 


AACAAGAAAA 


CAACTGTTAC 


500 




AGGTGTTGAA 


ATGTTCCGTA 


AGCTTCTTGA 


TTACGCTGAA 


GCTGGTGACA 


550 




ACATTGGTGC 


CCTTCTTCGC 


GGTGTATCTC 


GTGAAGAAAT 


CCAACGTGGT 


600 




CAAGTACTTG 


CTAAACCAGG 


TACAATCACT 


CCACACAGCA 


AATTCAAAGC 


650 


15 


TGAAGTTTAC 


GTTCTTTCTA 


AAGAAGAGGG 


TGGACGTCAT 


ACTCCATTCT 


700 




TCTCTAACTA 


CCGTCCTCAG 


TTCTACTTCC 


GTACAACTGA 


CGTAACTGGT 


750 




ATCATCCATC 


TTCCAGAAGG 


CGTAGAAATG 


GTTATGCCTG 


GAGATAACAC 


800 




TGAAATGAAC 


GTTGAACTTA 


TTTCTACAAT 


CGCTATCGAA 


GAAGGAACTC 


850 




GTTTCTCTAT 


TCGTGAAGGC 


GGACGTACTG 


TTGGT 




885 
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2) INFORMATION FOR SEQ ID NO: 1736 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 
35 (B) STRAIN: DSM 2151 

(C) ACCESSION NUMBER: P33165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1736 



40 ATGGTTACTG 
TACTGATGGT 
AGGTAAACGT 
GAAGATGCTG 
TTCATTCTAT 

45 CTCTTGGTGC 
CTGATGGAAG 
TAAACCTTTC 
GTACTGTAGC 
GAAATCGAAA 

50 TGTTGAAATG 
TAGGTCTGTT 
GTTCTTTGTA 
GGTTTATATC 



GTGCTGCTCA 
CCGATGCCTC 
TCCGAAGCTG 
AGATGTTGGA 
GATTTCGACG 
ATTGAACGGC 
CTGTTGATAC 
TTGATGCCGG 
TACAGGTCGT 
TCCTCGGTTT 
TTCCGCAAAC 
GCTTCGTGGT 
AACCGGGTCA 
CTGAAGAAAG 



GATGGACGGT 
AGACTCGTGA 
GTTGTATTCA 
ACTTGTTGAA 
GTGACAATAC 
GTAGAAAAAT 
TTGGATTCCA 
TAGAAGACGT 
ATCGAAACTG 
GGGTGAAGAT 
TTCTGGATCA 
GTTGACAAGA 
GATTAAACCT 
AAGAAGGTGG 



GCTATCATTG 
GCACATCCTT 
TGAACAAGTG 
ATGGAAATGA 
TCCGATCATT 
GGGAAGACAA 
CTGCCTCCGC 
GTTCTCTATC 
GTGTTATCCA 
AAGAAATCAG 
GGGTGAAGCT 
ACGAAATCAA 
CACTCTAAAT 
TCGTCACACT 



TAGTTGCTGC 50 

TTGGCTCGTC 100 

CGATATGGTT 150 

GAGAATTGCT 200 

CAGGGTTCTG 250 

AGTAATGGAA 300 

GCGATGTTGA 350 

ACAGGTCGTG 400 

TGTAGGTGAT 450 

TTGTAACAGG 500 

GGTGACAACG 550 

ACGTGGTATG 600 

TCAAAGCAGA 650 

CCATTCCATA 700 
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ACAAATATCG TCCTCAGTTC TACCTGCGTA CTATGGACTG TACAGGTGAA 750 

ATCACTCTTC CGGAAGGAAC TGAAATGGTA ATGCCGGGTG ATAACGTAAC 800 

TATCACTGTA GAGTTGATCT ATCCGGTTGC ACTGAACATC GGTCTTCGTT 850 

TCGCTATCCG CGAAGGTGGA CGTACAGTAG GT 882 



2) INFORMATION FOR SEQ ID NO: 1737 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE:' Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 
20 (B) STRAIN: U78183 

(C) ACCESSION NUMBER: U78183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1737 



25 AATATGATTA 
TGCTGATAGT 
AAAGAATGGG 
GCAGATCCTG 
TGAAAAATAT 

30 TTGGGGCTAT 
GAACTTCTTG 
TGACAAGCCA 
GAGGCACTGT 
CAAGAAGTTG 

35 TGGTGTTGAA 
ATGTTGGTCT 
CAAGTTTTGT 
TTCAATTTAT 
TCCCAGGGTA 

40 GTTGTTGCTT 
TATTATTGTT 
TTGCTGTTCG 



CAGGAGCAGC 
GGTGCTGAGC 
AATAAAGAAA 
AACTTGTTGA 
GGCTTTTCAG 
GTCAAATCCA 
AATCTATGGA 
TTTTTGCTTG 
TGCTACTGGG 
AAATAGTTGG 
ATGTTCCAGA 
TCTTTTGAGA 
CAGCTCCAGG 
TGTTTGACTA 
TAGACCACAG 
TAGAGGGCAA 
GAGCTGATCT 
AGAAGGTGGA 



TCAAATGGAT 
CTCAAACAAA 
ATAATAGTTT 
GCTTGTTGAA 
CTGATACTCC 
GAAGATCCTG 
TAATTATTTT 
CTGTTGAAGA 
CGTATTGAAA 
AATTAAAGAA 
AAATTCTTGA 
GGCGTTGATA 
TACAATTACT 
AAGAAGAAGG 
TTCTTTTTTA 
AGAAATGGTT 
CTTCAATAGC 
AGAACCGTTG 



GCAGCGATAC 
AGAGCATTTG 
TTTTAAATAA 
GTTGAAGTTT 
AATAATCAAA 
AATCTACAAA 
GATCTTCCAG 
TGTATTTTCT 
GAGGTATTAT 
ACCAGAAAAA 
GCAAGGTCAA 
AAAAAGACAT 
CCACACAAGA 
CGGTAGGCAC 
GAACAACCGA 
ATGCCTGGTG 
TATGGATAAG 
CTTCAGGA 



TTTTAGTTGC 50 

CTTCTTGCTC 100 

ATTGGACTTA 150 

TAGAACTTGT 200 

GGTTCAGCTT 250 

ATGCGTTAAA 300 

AAAGAGATAT 350 

ATTTCAGGAA 400 

TAAAGTTGGT 450 

CTACTGTTAC 500 

GCAGGGGATA 550 

TGAGAGGGGG 600 

AATTTAAAGC 650 

AAGCCATTTT 700 

TGTTACTGGA 750 

ATAATGTTGA .800 

AATGTAGAAT 850 
888 



45 

2) INFORMATION FOR SEQ ID NO: 1738 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevibacterium linens 
5 (B) STRAIN: DSM 20425 

(C) ACCESSION NUMBER: X768 63 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1738 



1 fi 


ft ftPftTPftTPft" 
>VHbH 1 bri i b.H 


bbbbl bbbbb 


TP ft P ft TPP ft P 
1 b Ab A 1 bbAb- 


PPTPPPftTPP 
bb 1 bbbrllbb 


TPpTr*PTPPP 

Ibb Ibb Ibbb 


c n 
jU 




bbb 1 HLLoHL 


bbAbbbA 1 bb 


bbbAbAbbbb 


1 bAbbAbb 1 b 


blbblbbbbb 


1 on 




b 1 LHbb 1 bbb 


PPTTPPPTft P 

bb 1 Ibbb 1 Ab 


7\TPPTPPTPP 
Ai bb 1 bb 1 bb 


PTPTPft ftPft ft 
b I b 1 b/i/ib/lft 


rfprrs r*ft tp 
b i bbbAbA J. b 


lOU 




bl bbAl bAbb 


ft PP ftPPTPPT 

AbbAbb 1 bb 1 


r'P ft PPTPPTP 
bbAbb Ibb lb 


pft ft TTprarr 
bAAl IbbAbb 


i bbbbbAbbl 


zUU 




bbibi bbAbb 


bAbbAbi Ibb 


Ab bb AbAbAA 


bbb Ibbbblb 


Ai lbbbbrGT 


OCA 


lb 


bbbbTbTbAA 


GGbbLTGGAA 


GGCGACGAGA 


AbTbbbTbAA 


GAGCGTTCAG 


O A A 

300 




GATCTCATGG 


CTGCCGTCGA 


TGACAACGTT 


CCGGAGCCGG 


AGCGCGATGT 


350 




CGACAAGCCG 


TTCCTCATGC 


CCGTCGAGGA 


CGTCTTCACG 


ATCACCGGTC 


400 




GTGGAACCGT 


CGTCACCGGT 


CGTGTCGAGC 


GCGGCGTGCT 


CCTGCCTAAC 


450 




GACGAAATCG 


AAATCGTCGG 


CATCAAGGAG 


AAGTCGTCCA 


AGACGACTGT 


500 


20 


CACCGCTATC 


GAGATGTTCC 


GCAAGACCCT 


GCCGGATGCC 


CGTGCAGGTG 


550 




AGAACGTCGG 


TCTGCTCCTC 


CGCGGCACCA 


AGCGCGAGGA 


TGTTGAGCGC 


600 




GGTCAGGTCA 


TCGTGAAGCC 


GGGTTCGATC 


ACCCCGCACA 


CCAAGTTCGA 


650 




GGCTCAGGTC 


TACATCCTGA 


GCAAGGACGA 


GGGCGGACGT 


CACAACCCGT 


700 




TCTACTCGAA 


CTACCGTCCG 


CAGTTCTACT 


TCCGGACCAC 


GGACGTCACC 


750 


25 


GGTGTCATCA 


CGCTGCCCGA 


GGGCACCGAG 


ATGGTCATGC 


CCGGCGACAA 


800 




CACCGATATG 


TCGGTCGAGC 


TCATCCAGCC 


GATCGCTATG 


GAGGACCGCC 


850 




TCCGCTTCGC 


AATCCGCGAA 


GGTGGCCGCA 


CCGTCGGCGC 


CGGT 


894 



30 



2) INFORMATION FOR SEQ ID NO: 1739 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 888 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: F/IC-Cal-13 

(C) ACCESSION NUMBER: L22216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1739 



ATGATCACGG GCGCAGCGCA GATGGACGGC GCGATCCTGG 

AGCAGACGGC CCGATGCCGC AAACGCGTGA GCACATCCTG 

50 AGGTTGGTGT TCCGTACATC ATCGTGTTCC TGAACAAGTG 

GACGACGCTG AACTGCTCGA GCTGGTCGAG ATGGAAGTTC 

GTCGAAGTAC GACTTCCCGG GCGACGACAC GCCGATCGTG 

CCAAGCTGGC GCTGGAAGGC GACACGGGCG AGCTGGGCGA 



TTTGCTCGGC 50 

CTGGCGCGTC 100 

CGACAGTGTG 150 

GCGAACTCCT 200 

AAGGGTTCGG 250 

AGTGGCGATC 300 



906 



SUBSTITUTE SHEET (RULE 26) 





WO 01/23604 








r CI / \* AUU/U1 15U 






ATGAGCCTGG 


CAGACGCGCT 


GGACACGTAC 


ATCCCGACGC 


CGGAGCGTGC 


350 




AGTTGACGGC 


GCGTTCCTGA 


TGCCGGTGGA 


AGACGTGTTC 


TCGATCTCGG 


400 




GCCGTGGTAC 


GGTGGTGACG 


GGTCGTGTCG 


AGCGCGGCAT 


CGTGAAGGTC 


450 




GGCGAAGAAA 


TCGAAATCGT 


CGGTATCAAG 


CCGACGGTGA 


AGACGACCTG 


500 


5 


CACGGGCGTT 


GAAATGTTCC 


GCAAGCTGCT 


GGACCAAGGT 


CAGGCAGGCG 


550 




ACAACGTCGG 


TATCCTGCTG 


CGCGGCACGA 


AGCGTGAAGA 


CGTGGAGCGT 


600 




GGCCAGGTTC 


TGGCGAAGCC 


GGGTTCGATC 


ACGCCGCACA 


CGCACTTCAC 


650 




GGCTGAAGTG 


TACGTGCTGA 


GCAAGGACGA 


AGGCGGCCGT 


CACACGCCGT 


700 


_ _ 


TCTTCAACAA 


CTACCGTCCG 


CAGTTCTACT 


TCCGTACGAC 


GGACGTGACG 


750 


10 


GGCTCGATCG 


AGCTGCCGAA 


GGACAAGGAA 


ATGGTGATGC - 


GGGGCGACAA 


800 




CGTGTCGATC 


ACGGTGAAGC 


TGATTGCTCC 


GATCGCGATG 


GAAGAAGGTC 


850 




TGCGCTTCGC 


AATCCGTGAA 


GGCGGCCGTA 


CGGTCGGC 




888 



15 



2) INFORMATION FOR SEQ ID NO: 1740 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fibrobacter succinogenes 

(B) STRAIN: S85 

(C) ACCESSION NUMBER: X76866 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1740 



AACATGGTGA 
CGCTACTGAC 

35 ACCAGGTTGG 
GTTGACGATG 
CCTCTCCAAG 
CCGCTCTCAA 
GAACTCATGA 

40 CGACAAGCCG 
GCGGCACTGT 
GACAAGGTTG 
CGGTGTTGAA 
ACGTTGGTCT 

45 ATGGTTCTCG 
TGAAATCTAC 
TGAATGGCTA 
ACGATCCAGC 
CACGATCCAC 

50 GCTTCGCTAT 



CTGGTGCTGC 
GGTCCGATGC 
CGTGCCGAAG 
CTGAAATTCT 
TATGACTTCG 
GGCCCTCGAA 
ACGCTTGCGA 
TTCCTCATGC 
CGCTACTGGC 
AACGTATCGG 
ATGTTCCGTA 
CCTCCTCCGT 
CAGCTCCGAA 
GTTCTCACGA 
CCGTCCGCAG 
TCCCGGAAGG 
GTGAACCTCA 
CCGTGAAGGT 



TCAGATGGAC 
CGCAGACTCG 
ATCGTCGTGT 
CGACCTCGTC 
ACGGTGACAA 
GGCGATCCGG 
CGAATACATC 
CGATCGAAGA 
CGTATCGAAC 
TCTCGGTGAA 
AGCTCCTCGA 
GGTGCTGAAA 
GTCTGTCACT 
AGGACGAAGG 
TTCTACTTCC 
TGTCGAAATG 
TCGCTCCGAT 
GGACGTACTG 



GGCGCTATCC 
CGAACACATC 
TCATGAACAA 
GAAATGGAAG 
CACCCCGATC 
AATACCAGGA 
CCGCTCCCGC 
CGTGTTCACG 
GCGGTGTCGT 
ACCACCGAAT 
CGACGCTCAG 
AGAAGGACAT 
CCGCACACCG 
TGGCCGTCAC 
GCACCACCGA 
GTTACTCCGG 
CGCTATGGAA 
TTGGTGCTGG 



TCGTTGTTGC 50 

CTTCTCGCTC 100 

GTGCGACATG 150 

TTCGCGAACT 200 

ATCCGTGGTT 250 

CAAGGTCATG 300 

AGCGCGATAC 350 

ATTACTGGCC 400 

TCGCTTGAAC 4 50 

ACGTCATCAC 500 

GCAGGTGACA 550 

CGTCCGTGGC 600 

AATTTAAGGC 650 

ACGCCGTTCA 700 

CGTTACTGGT 750 

GTGACACGGT 800 

AAGCAGCTCC 850 

C 891 



907 



SUBSTITUTE SHEET (RULE 26) 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 1741 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

- (ii) .MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Flavobacterium ferrugineum 

(B) STRAIN: DSM 13524 

(C) ACCESSION NUMBER: X76867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1741 





AACATGATCA 


CCGGTGCTGC 


CCAGATGGAC 


GGTGCTATCT 


TAGTTGTGGC 


50 




TGCATCAGAC 


GGTCCTATGC 


CTCAAACAAA 


AGAACACATC 


CTGCTTGCTG 


100 


20 


CCCAGGTAGG 


TGTACCTAAA 


ATGGTTGTGT 


TTCTGAATAA 


AGTTGACCTC 


150 




GTTGACGACG 


AAGAGCTCCT 


GGAGCTGGTT 


GAGATCGAGG 


TTCGCGAAGA 


200 




ACTGACTAAA 


CGCGGTTTCG 


ACGGCGACAA 


CACTCCAATC 


ATCAAAGGTT 


250 




CCGCTACAGG 


CGCCCTCGCT 


GGTGAAGAAA 


AGTGGGTTAA 


AGAAATTGAA 


300 




AACCTGATGG 


ACGCTGTTGA 


CAGCTACATC 


CCACTGCCTC 


CTCGTCCGGT 


350 


25 


TGATCTGCCG 


TTCCTGATGA 


GCGTAGAGGA 


CGTATTCTCT 


ATCACTGGTC 


400 




GTGGTACTGT 


TGCTACCGGT 


CGTATCGAGC 


GTGGCCGTAT 


CAAAGTTGGT 


450 




GAGCCTGTTG 


AGATCGTAGG 


TCTGCAGGAG 


TCTCCCCTGA 


ACTCTACCGT 


500 




TACAGGTGTT 


GAGATGTTCC 


GCAAACTCCT 


CGACGAAGGT 


GAAGCTGGTG 


550 




ATAACGCCGG 


TCTCCTCCTC 


CGTGGTGTTG 


AAAAAACACA 


GATCCGTCGC 


600 


30 


GGTATGGTAA 


TCGTTAAACC 


CGGTTCCATC 


ACTCCGCACA 


CGGACTTCAA 


650 




AGGCGAAGTT 


TACGTACTGA 


GCAAAGACGA 


AGGTGGCCGT 


CACACTCCAT 


700 




TCTTCAACAA 


ATACCGTCCT 


CAATTCTACT 


TCCGTACAAC 


TGACGTTACA 


750 




GGTGAAGTAG 


AACTGAACGC 


AGGAACAGAA 


ATGGTTATGC 


CTGGTGATAA 


800 




CACCAACCTG 


ACCGTTAAAC 


TGATCCAACC 


GATCGCTATG 


GAAAAAGGTC 


850 


35 


TGAAATTCGC 


GATCCGCGAA 


GGTGGCCGTA 


CCGTAGGTGC 


AGGA 


894 



2) INFORMATION FOR SEQ ID NO: 1742 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Helicobacter pylori 

(B) STRAIN: 26695 

(C) ACCESSION NUMBER: AE000626 



908 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1742 



AACATGATCA 
TGCAGCTGAT 
5 GTCAAGTAGG 
GTAGATGACC 
GTTGAGCGCG 
CAGCTTTAAG 
GGTGAAAAAG 

10 TCCAGAAAGA 
TCTCTATTGC 
GTGGTGAAAG 
AAAAACGACT 
GTGAAGCCGG 

15 GAAGTGGAAC 
CAAGAAATTT 
GACACACTCC 
ACTGATGTGA 
GCCTGGCGAT 

20 TAGAGTTGGG 
GCTGGT 



CCGGTGCGGC 
GGCCCTATGC 
CGTGCCTCAC 
AAGAATTGTT 
TATGAATTTC 
AGCTTTAGAA 
TGCTTAAACT 
GACACTGAAA 
GGGTAGAGGG 
TAGGCGATGA 
GTAACCGGTG 
CGATAATGTG 
GCGGTATGGT 
GAGGGAGAAA 
ATTCTTCACC 
CTGGCTCTAT 
AATGTGAAAA 
AACTAAATTT 



GCAAATGGAC 
CTCAAACTAG 
ATCGTTGTTT 
AGAACTTGTA 
CTGGCGATGA 
GAAGCAAAGG 
TATGGCTGAA 
AAACTTTCTT 
ACTGTGGTTA 
AGTGGAAATC 
TAGAAATGTT 
GGCGTGCTTT 
TCTATGCAAA 
TTTATGTCCT 
AATTACCGCC 
CACCCTTCCT 
TCACTGTAGA 
GCGATTCGTG 



GGAGCGATTT 
GGAGCATATC 
TCTTAAACAA 
GAAATGGAAG 
CACTCCTATC 
CTGGTAATGT 
GTGGATGCCT 
GATGCCGGTT 
CAGGTAGGAT 
GTTGGTATCA 
TAGGAAAGAG 
TGAGAGGAAC 
CCAGGTTCTA 
TTCTAAAGAA 
CGCAATTCTA 
GAAGGCGTAG 
GTTGATTAGC 
AAGGCGGTAG 



PCT/CA00/01150 



TGGTTGTTTC 50 

TTATTGTCTC 100 

ACAAGACATG 150 

TGCGCGAATT 200 

GTAGCGGGTT 250 

GGGTGAATGG 300 

ATATCCCTAC 350 

GAAGATGTGT 400 

TGAAAGAGGC 450 

GACCTACACA 500 

TTGGAAAAAG 550 

TAAAAAAGAA 600 

TCACTCCGCA 650 

GAAGGCGGGA 700 

TGTGCGCACA 750 

AAATGGTTAT 800 

CCTGTTGCGT 850 

GACCGTTGGT 900 
906 



25 2) INFORMATION FOR SEQ ID NO: 1743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus luteus 

(B) STRAIN: IFO 3333 

(C) ACCESSION NUMBER: M17788 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1743 



AACATGATCA 
CGCTACCGAC 
GCCAGGTCGG 

45 GTGGAGGACG 
GCTGTCCTCC 
CCGCTCTGAA 
GACCTCATGG 
GGACAAGCCG 

50 GTGGCACCGT 
TCCGAGGTCG 
CGGCATCGAG 
ACTGCGGTCT 



CCGGCGCCGC 
GGCCCGATGG 
CGTGCCGGCC 
AGGAGCTCCT 
AGGAGCTTCG 
GGCCCTCGAG 
ATGCCGTGGA 
TTCCTGATGC 
GGTGACCGGT 
AGATCGTCGG 
ATGTTCCACA 
GCTCGTGCGC 



TCAGATGGAC 
CCCAGACCCG 
CTGCTCGTGG 
CGAGCGTGTC 
ACGTCGACGA 
GGCGACCCCC 
CGAGTACATC 
CGATCGAGGA 
CGCGCCGAGC 
CATCCGCGAC 
AGCAGCTCGA 
GGTCTGAAGC 



GGCGCGATCC 
TGAGCACGTG 
CCCTGAACAA 
GAGATGGAGG 
GGCCCCGGTC 
AGTGGGTCAA 
CCGGACCCGG 
CGTCTTCACG 
GCGGCACCCT 
GTGCAGAAGA 
CGAGGCCTGG 
GCGACGACGT 



TCGTGGTCGC 50 

CTCCTGGCCC 100 

GTCGGACATG 150 

TCCGGCAGCT 200 

ATCCGCACCT 250 

GTCCGTCGAG 300 

TGCGCGACAA 350 

ATCACCGGCC 400 

GAAGATCAAC 450 

CCACTGTCAC 500 

GCCGGCGAGA 550 

CGAGCGCGGC 600 



909 
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CAGGTGCTGG TGGAGCCGGG CTCCATCACC 

GAACGTCTAC ATCCTGTCCA AGGACGAGGG 

ACTCGAACTA CCGCGCGCAG TTCTACTTCC 

GTCATCACGC TGCCCGAGGG CACCGAGATG 

5 CGAGATGTCG GTCGAGCTCA TCCAGCCGAT 

GCTTCGCCAT CCGCGAGGGT GGCCGCACCG 



CCGCACACCA 


ACTTCGAGGC 


650 


TGGGCGTCAC 


ACCCCGTTCT 


700 


GCACCACCGA 


CGTCACCGGC 


750 


GTCATGCCCG 


GCGACACCAC 


800 


CGCCATGGAG 


GAGGGCCTCG 


850 


TGGGCTCCGG 


C 


891 



10' 2) INFORMATION FOR "SEQ ID NO: -17-44 



15 



[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: ■ Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Erdmann 

(C) ACCESSION NUMBER: X63539 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1744 



AACATGATCA 
CGCCACCGAC 
GTCAAGTGGG 

30 GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 

35 GAACCGTGGT 
GAAGTTGAGA 
CGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 

40 CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCGAT 

45 



CCGGCGCCGC 
GGCCCGATGC 
TGTGCCCTAC 
AGGAGCTGCT 
CAGGAATTCG 
GCTCGAGGGT 
CGGTCGACGA 
CTGATGCCGG 
CACCGGACGT 
TCGTCGGCAT 
ATGTTCCGCA 
GCTGCTGCGG 
CCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGTTGA 
CCGCGAGGGT 



GCAGATGGAC 
CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 
CACCACCACG 
AGGACGAGGG 
TTCTACTTCC 
CACCGAGATG 
TCCAGCCCGT 
GGCCGCACCG 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 
TGGGCGCCGG 



TGGTGGTCGC 50 

CTGCTGGCGC 100 

GGCCGACGCA 150 

TCCGCGAGCT 200 

CGGGTCTCGG 250 

TGTCGAGGAA 300 

GCGAGACCGA 350 

ACCGGCCGCG 400 

CGTGAACGAG 450 

CCACCGTCAC 500 

GCGGGCGACA 550 

CGAGCGTGGC 600 

AGTTCGAAGG 650 

ACGCCGTTCT 700 

CGTGACCGGT 750 

GTGACAACAC 800 

GAAGGTCTGC 850 

C 891 



2) INFORMATION FOR SEQ ID NO: 1745 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

910 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(D) 



TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma genitalium 

(B) STRAIN: G37 

(C) ACCESSION NUMBER: U397 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 5 



AATATGATCA 
AGCAACTGAT 
GCCAAGTAGG 

15 GCTAGTGATG 
GTTAACTTCC 
CAGCTTTAAA 
GATTTGATTA 
AGATAAACCT 

20 GAGGTACAGT 
CAAGAAGTTG 
TGGAATTGAA 
ATGCTGGGGT 
CAAGTTTTAG 

25 TGAGATCTAT 
TAAACGGTTA 
TCTATTGCTT 
TTCTATTACT 
AGTTCTCAAT 

30 



CAGGTGCTGC 
AGTGTGATGC 
GGTTCCTAAA 
AAGAGGTACA 
TATGGTTTTG 
AGCATTGGAA 
AAGCAGTTGA 
TTCTTATTAG 
TGTTACAGGA 
AAATTGTTGG 
ATGTTCAAAA 
ATTATTACGT 
CAAAACCAGG 
GCTTTAAAGA 
CCGTCCTCAA 
TAGCTGAAAA 
GTTGAGTTAA 
TCGTGAAGGT 



ACAAATGGAT 
CCCAAACCCG 
ATGGTAGTTT 
AGAACTTGTT 
ATGGTAAGAA 
GGTGATCCAA 
TGAATGGATT 
CAATTGAAGA 
AGAGTTGAAA 
TTTAAAACCA 
AGGAACTTGA 
GGTGTTGAAC 
CTCTATTAAA 
AAGAAGAAGG 
TTCTATTTCC 
TACTGAAATG 
TTGCTCCTAT 
GGTAGAACTG 



GGAGCTATTC 
CGAGCACATC 
TTCTAAACAA 
GCTGAAGAAG 
CACTCCTATT 
AGTGGGAGGC 
CCAACTCCTA 
TACGATGACC 
GAGGTGAACT 
ATTAGAAAAG 
TTCAGCAATG 
GTAAAGAAGT 
CCGCACAAGA 
TGGTAGACAC 
GTACCACTGA 
GTTCTACCTG 
CGCTTGTGAA 
TAGGGGCAGG 



TAGTTGTTTC 50 

TTACTTGCCC 100 

GTGTGATATT 150 

TACGTGATCT 200 

ATTTATGGCT 250 

TAAGATCCAT 300 

CACGTGAAGT 350 

ATTACTGGTA 400 

CAAAGTAGGT 4 50 

CAGTTGTTAC 500 

GCTGGTGACA 550 

TGAAAGAGGT 600 

AATTTAAAGC 650 

ACTGGTTTTT 700 

TGTAACTGGT 750 

GTGATAATGC 800 

AAAGGTAGTA 850 

C 891 



2) INFORMATION FOR SEQ ID NO: 1746 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
45 (B) STRAIN: MS11 

(C) ACCESSION NUMBER: L36380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1746 

50 AACATGATTA CCGGCGCCGC ACAAATGGAC GGTGCAATCC TGGTATGTTC 50 

TGCTGCCGAC GGCCCTATGC CGCAAACCCG CGAACACATC CTGCTGGCCC 100 

GTCAAGTAGG CGTACCTTAC ATCATCGTGT TCATGAACAA ATGCGACATG 150 

GTCGACGATG CCGAGCTGTT CCAACTGGTT GAAATGGAAA TCCGCGACCT 200 

911 
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GCTGTCCAGC 
CCGCACTGAA 
GAACTGGCTA 
GGACAAACCA 
5 GCGGTACCGT 
GACGAGATTG 
CGGCGTTGAA 
ACGTAGGCGT 
- CAGGTATTGG- 
10 AGAAGTGTAC 
TCGCCAACTA 
AC GAT TACT T 
AACCATTACT 
GCTTTGCGAT 

15 



TACGACTTCC 
AGCCTTGGAA 
CCGCATTGGA 
TTCCTGCTGC 
AGTCACCGGC 
AAATCGTCGG 
ATGTTCCGCA 
ATTGCTGCGC 
CCAAACGGGG 
GTATTGAGCA 
CCGTCCCCAA 
TGGAAAAAGG 
GTAGAACTGA 
TCGCGAAGGC 



CCGGCGACGA 
GGCGATGCCG 
CAGATACATC 
CTATCGAAGA 
CGTGTAGAGC 
TCTGAAAGAA 
AACTGCTGGA 
GGTACCAAAC 
TACTATCACT 
AAGAAGAGGG 
TTCTACTTCC 
TGTGGAAATG 
TTGCGCCTAT 
GGCCGTACCG 



CTGCCCGATC 
CTTACGAAGA 
CCGACTCCCG 
CGTGTTCTCC 
GAGGTATCAT 
ACCCAAAAAA 
CGAAGGTCAG 
GTGAAGACGT 
CCTCACACCA 
CGGCCCCCAT 
GTACCACTGA 
GTAATGCCGG 
CGCTATGGAA 
TGGGTGCCGG 



GTACAAGGTT 250 

AAAAATCTTC 300 

AGCGTGCCGT 350 

ATTTCCGGCC 400 

CCACGTTGGT 450 

CCACCTGTAC 500 

GCGGGCGACA 550 

AGAACGCGGT 600 

AGTTCAAAGC 650 

ACCCCGTTTT 700 

CGTAACCGGC 750 

GTGAGAACGT 800 

GAAGGTCTGC 850 

C 891 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rickettsia prowazekii 

(B) STRAIN: Madrid E 

(C) ACCESSION NUMBER: Z54170 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1747 



35 AATATGATAA 
TGCTGCTGAT 
AACAGGTAGG 
GTAGATGATC 
ATTATCAAAA 

40 CTGCACTTCA 
GAGTTAATGA 
AGATAAACCT 
GAGGTACCGT 
GAAGAAATTG 

45 AGGTGTAGAA 
ATGTCGGTAT 
CAAGTACTTG 
TGAAGTGTAT 
CTAATGATTA 

50 ACAATAAAAT 
TACTTTTTCA 
AATTCTCTAT 



CTGGTGCCGC 
GGTCCTATGC 
TGTACCTGCT 
CTGACCTATT 
TATGGTTTCC 
AGCTTTAGAA 
ATGCAGTAGA 
TTTTTAATGC 
TGTAACTGGT 
AAATAGTAGG 
ATGTTCAGAA 
ATTACTACGT 
CAAAACCTGG 
GTGCTTAGTA 
TCGCCCACAG 
TGCCTTCTGA 
GTAGAATTAA 
ACGTGAAGGT 



TCAGATGGAT 
CTCAAACTAG 
ATGGTAGTAT 
AGAATTAGTT 
CTGGTAATGA 
GGAAAACCTG 
TACGTATATA 
CAATAGAGGA 
AGAGTGGAGT 
TCTAAAAAAT 
AATTACTTGA 
GGTACAAAAA 
GAGCATAAAA 
AAGAGGAAGG 
TTCTATTTTA 
TAAGCAGATG 
TTAAGCCGAT 
GGTAGAACAG 



GGTGCTATAT 
AGAACATATA 
TTTTGAATAA 
GAGATGGAAG 
AATACCTATT 
AAGGTGAAAA 
CCTCAGCCTA 
TGTATTTTCT 
CAGGCATAAT 
ACGCAAAAAA 
TGAAGGACAA 
GAGAAGAAGT 
CCGCATGATA 
TGGACGTCAT 
GAACAACAGA 
GTTATGCCTG 
TGCTATGCAA 
TAGGAGCCGG 



TAGTAGTTTC 50 

TTACTGGCAA 100 

AGTAGATATG 150 

TAAGAGAATT 200 

ATTAAAGGTT 250 

AGCTATTAAT 300 

TAGAGCTACA 350 

ATTTCAGGCA 400 

TAAGGTGGGT 450 

CGACTTGTAC 500 

TCTGGAGATA 550 

AGAAAGAGGA 600 

AATTTGAAGC 650 

ACCCCATTTA 700 

TGTTACCGGC 750 

GAGATAATGC 800 

GAAGGGTTAA 850 

T 891 



912 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 1748 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: LT2 trpE91 

(C) ACCESSION NUMBER: X55116 

t 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1748 





AACATGATCA 


CCGGTGCTGC 


TCAGATGGAC 


GGCGCGATCC 


TGGTTGTTGC 


50 


20 


TGCGACTGAC 


GGCCCGATGC 


CGCAGACCCG 


TGAGCACATC 


CTGCTGGGTC 


100 




GTCAGGTAGG 


CGTTCCGTAC 


ATCATCGTGT 


TCCTGAACAA 


ATGCGACATG 


150 




GTTGATGACG 


AAGAGCTGCT 


GGAACTGGTT 


GAGATGGAAG 


TTCGCGAACT 


200 




GCTGTCTCAG 


TACGACTTCC 


CGGGCGACGA 


CACTCCGATC 


GTTCGTGGTT 


250 




CTGCTCTGAA 


AGCGCTGGAA 


GGCGACGCAG 


AGTGGGAAGC 


GAAAATCATC 


300 


25 


GAACTGGCTG 


GCTTCCTGGA 


TTCTTATATT 


CCGGAACCAG 


AGCGTGCGAT 


350 




TGACAAGCCG 


TTCCTGCTGC 


CGATCGAAGA 


CGTATTCTCC 


ATCTCCGGTC 


400 




GTGGTACCGT 


TGTTACCGGT 


CGTGTAGAGC 


GCGGTATCAT 


CAAAGTGGGC 


450 




GAAGAAGTTG 


AAATCGTTGG 


TATCAAAGAG 


ACTCAGAAGT 


CTACCTGTAC 


500 




TGGCGTTGAA 


ATGTTCCGCA 


AACTGCTGGA 


CGAAGGCCGT 


GCCGGTGAGA 


550 


30 


ACGTAGGTGT 


TCTGCTGCGT 


GGTATCAAAC 


GTGAAGAAAT 


CGAACGTGGT 


600 




CAGGTACTGG 


CTAAGCCGGG 


CACCATCAAG 


CCGCACACCA 


AGTTCGAATC 


650 




TGAAGTGTAC 


ATTCTGTCCA 


AAGATGAAGG 


CGGCCGTCAT 


ACTCCGTTCT 


700 




TCAAAGGCTA 


CCGTCCGCAG 


TTCTACTTCC 


GTACTACTGA 


CGTGACTGGT 


750 




ACCATCGAAC 


TGCCGGAAGG 


CGTAGAGATG 


GTAATGCCGG 


GCGACAACAT 


800 


35 


CAAAATGGTT 


GTTACCCTGA 


TCCACCCGAT 


CGCGATGGAC 


GACGGTCTGC 


850 




GTTTCGCAAT 


CCGTGAAGGC 


GGCCGTACCG 


TTGGCGCGGG 


C 


891 



40 2) INFORMATION FOR SEQ ID NO: 1749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: DSM 50426 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1749 





ATGATCACTG 


GTGCTGCACA 


GATGGACGGC 


GCGATTCTGG 


TAGTCGCTTC 


50 




AACAGACGGT 


CCAATGCCAC 


AGACTCGTGA 


GCACATCCTG 


CTTTCTCGTC 


100 


5 


AGGTTGGCGT 


ACCATTCATC 


ATCGTATTCA 


TGAACAAATG 


TGACATGGTA 


150 




GATGACGAAG 


AGCTGTTAGA 


GCTAGTTGAG 


ATGGAAGTGC 


GTGAACTGTT 


200 




ATCAGAATAC 


GATTTCCCAG 


GTGATGACTT 


ACCGGTAATC 


CAAGGTTCAG 


250 




CTCTGAAAGC 


GCTAGAAGGC 


GAGCCAGAGT 


GGGAAGCAAA 


AATCCTTGAA 


300 




TTAGCAGCGG 


CGCTGGATTC 


TTACATTCCA 


GAACCACAAC 


GTGACATCGA 


350 


10 


" TAAGCCGTTC 


CTACTGCCAA" 


TCGAAGACGT 


-ATTGTCAATT 


TCAGGCCGTG 


400 




GTACAGTAGT 


AACAGGTCGT 


GTTGAGCGTG 


GTATTGTACG 


CGTAGGCGAC 


450 




GAAGTTGAAA 


TCGTTGGTGT 


ACGTGCGACA 


ACTAAGACAA 


CGTGTACTGG 


500 




TGTAGAAATG 


TTCCGTAAAC 


TGCTTGACGA 


AGGTCGTGCA 


GGTGAGAACT 


550 




GTGGTATTTT 


GTTACGTGGT 


ACTAAGCGTG 


ATGACGTAGA 


ACGTGGTCAA 


600 


15 


GTATTAGCGA 


AGCCAGGTTC 


AATCAACCCA 


CACACTACTT 


TTGAATCAGA 


650 




AGTTTACGTA 


CTGTCAAAAG 


AAGAAGGTGG 


TCGTCACACG 


CCATTCTTCA 


700 




AAGGCTACCG 


TCCACAGTTC 


TACTTCCGTA 


CAACTGACGT 


AACCGGTACT 


750 




ATCGAACTGC 


CAGAAGGCGT 


AGAGATGGTA 


ATGCCAGGCG 


ATAACATCAA 


800 




GATGGTAGTG 


ACACTGATTT 


GCCCAATCGC 


GATGGACGAA 


GGTTTACGCT 


850 


20 


TCGCAATCCG 


TGAAGGCGGT 


CGTACAGTGG 


T 




881 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Stigmatella aurantiaca 

(B) STRAIN: DW4 

(C) ACCESSION NUMBER: X82820 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1750 



40 



45 



50 



AACATGATCA CGGGCGCGGC GCAGATGGAC 

CGCGGCCGAC GGCCCGATGC CCCAGACGCG 

GGCAGGTGGG CGTGCCCTAC ATCGTCGTCT 

CTGGACGATC CGGAGCTGCG CGAGCTGGTG 

GCTCAAGAAG TACGAGTTCC CGGGCGACAG 

GCGCGCTCAA GGCGCTGGAG GGAGACACCA 

ATCCTGAAGC TGATGGCGGC GGTGGACGAG 

TGCGACGGAC AAGCCGTTCC TGATGCCGGT 

CAGGCCGAGG AACGGTGGCG ACGGGCCGAG 

GTGGGCGAGG AAGTGGAGAT CGTGGGGATC 

CATCACGGGG GTGGAGATGT TCCGCAAGCT 

GAGACAACAT CGGAGCGCTG CTGCGAGGCC 

CGTGGGCAGG TGCTGGCGAA CTGGGGGAGC 



GGAGCGATTC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAGG 
CATCCCCATC 
GCGACATCGG 
TACATCCCGA 
GGAAGACGTG 
TGGAGCGCGG 
CGTCCGACGC 
GCTGGACGAG 
TGAAGCGCGA 
ATCAACCCGC 



TGGTGGTGTC 50 

CTGCTGGCCA 100 

GGTGGACATG 150 

TGCGCGACCT 200 

ATCCCTGGCA 250 

CGAGGGAGCG 300 

CGCCGCAGCG 350 

TTCTCCATCG 400 

CAAGATCAAG 4 50 

AGAAGACGGT 500 

GGCATGGCGG 550 

GGACCTGGAG 600 

ACACGAAGTT 650 
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CAAGGCGCAG GTGTACGTGC TGTCGAAGGA 

CGTTCTTCAA GGGATACCGG CCGCAGTTCT 

ACCGGAACGG TGAAGCTGCC GGACAACGTG 

CAACATCGCC ATCGAGGTGG AGCTCATTAC 

5 AGCTGCCGTT CGCCATCCGT GAGGGTGGCC 
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AGAGGGAGGG 


CGGCACACGC 


700 


ACTTCCGGAC 


GACGGACGTG 


750 


GAGATGGTGA 


TGCCGGGAGA 


800 


TCCGGTCGCC 


ATGGAGAAGG 


850 


GCACGGTGGG 


CGCCGGC 


897 



10 



15 



20 



25 



30 



35 



iO 



2) INFORMATION FOR SEQ ID NO: 1751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thiomonas cuprina 

(B) STRAIN: Hoe5 

(C) ACCESSION NUMBER: x76871 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1751 



AACATGATCA 
CGCCGCCGAC 
GTCAGGTGGG 
GTCGACGACG 
GCTGTCCAAG 
CGGCCAAGCT 
ATTCTCAAGC 
GGCCGTCGAC 
CCGGGCGCGG 
GTCGGCGAGG 
CTGCACCGGC 
GCGACAACGT 
CGCGGCCAGG 
CACCGCCGAG 
CCTTCTTCAA 
ACCGGCGCCA 
TAATGTGAGC 
GCCTGCGCTT 



CCGGTGCGGC 
GGCCCCATGC 
CGTGCCCTAC 
CCGAGCTGCT 
TACGACTTCC 
GGCCCTCGAA 
TGGCCGAGGC 
GGCGCGTTCC 
CACGGTGGTC 
AAATCGAGAT 
GTGGAAATGT 
CGGCATCTTG 
TGCTGTGCAA 
GTGTACGTGC 
CAACTACCGC 
TCGAACTGCC 
ATCACCGTCA 
CGCCATCCGC 



CCAGATGGAC 
CCCAAACCCG 
ATCATCGTGT 
CGAACTCGTC 
CCGGTGACGA 
GGCGACAAGG 
CCTGGACACC 
TCATGCCCGT 
ACCGGGCGTG 
TGTCGGCCTC 
TCAGGAAGCT 
CTGCGCGGCA 
ACCCGGCTCG 
TGAGCAAGGA 
CCGCAGTTCT 
CAAGGACAAG 
AGCTCATCGC 
GAAGGCGGCC 



GGCGCCATCC 
CGAGCACATC 
TCCTCAACAA 
GAGATGGAAG 
CACCCCCATC 
GCGAACTGGG 
TACATCCCCA 
GGAAGACGTG 
TGGAGCGCGG 
AAGCCCACCC 
GCTCGACCAG 
CCAAGCGCGA 
ATCAAGCCCC 
CGAGGGCGGC 
ACTTCCGCAC 
GAAATGGTCA 
CCCCATCGCC 
GCACCGTCGG 



TGGTCGTGTC 50 

CTGCTGGCGC 100 

GTGCGACATG 150 

TGCGCGAGCT 200 

ATCAAGGGCT 250 

CGAAGGCGCC 300 

CGCCCGAGCG 350 

TTCTCCATCT 400 

CATCATCAAG 450 

TCAAGACCAC 500 

GGCCAGGCCG 550 

GGAAGTCGAG 600 

ACACCCACTT 650 

CGCCACACCC 700 

CACCGACGTC 750 

TGCCCGGCGA 800 

ATGGAAGAAG 850 

CGCCGGC 897 



55 



2) INFORMATION FOR SEQ ID NO: 1752 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: AE001202 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1752 



10 



15 



20 



25 



AATATGATCA CGGGTGCTGC GCAGATGGAC 

TGCGCCTGAC GGCGTTATGC CACAGACGAA 

GTCAGGTTGG TGTTCCCTCC ATCATTGTTT 

GTTGATGATC CTGAGTTGCT AGAGCTGGTG 

GCTTGCTGGA TATGGGTTTT CGCGTGAGAC 

CGTTTAAAGC TCTGCAGGAT GGCGCTTCCC 

GAGGAACTGC TTGCGGCCAT GGATTCCTAC 

CGACGCAAGA CCTTTCTTGC TCTCTATCGA 

GGCGTGGTAC CGTTGTCACG GGGCGCATCG 

AATGAAGAGG TCGAGATCGT CGGGATTAAG 

TACTGGCATT GAGATGTTTA ATAAGTTGCT 

ATAACGTGGG GCTGCTTTTG CGCGGGGTGG 

GGTCAGGTGC TTTCTAAGCC CGGTTCTATT 

GGCGCAGATC TACGTGCTCT CTAAGGAAGA 

TTTTTCAAGG TTATCGTCCG CAGTTTTATT 

GGTACGATTT CTCTTCCTGA AGGGGTAGAC 

CACCAAGATT ATAGGTGAGC TCATCCACCC 

TGAAGCTTGC GATTCGTGAA GGGGGGCGCA 



GGTGGTATTC 
GGAGCATCTT 
TTTTGAACAA 
GAAGAAGAGG 
GCCTATCGTC 
CGGAGGATGC 
TTTGAAGACC 
GGATGTGTAC 
AATGTGGGGT 
CCCACTAAGA 
TGATCAGGGA 
ATAAAAAAGA 
AAGCCACACA 
GGGTGGCCGT 
TTAGAACTAC 
ATGGTGAAGC 
GATAGCTATG 
CTATTGCTTC 



TCGTCGTGTC 50 

CTGCTCGCCC 100 

GGTTGATTTG 150 

TGCGTGATGC 200 

AAGGGGTCTG 250 

AGCTTGTATT 300 

CAGTGCGTGA 350 

ACTATTTCTG 400 

AATTAGTCTG 450 

AAACAGTGGT 500 

ATTGCAGGTG 550 

GGTTGAGCGC 600 

CCAAGTTTGA 650 

CACAGTCCTT 700 

TGACATTACC 750 

CGGGGGATAA 800 

GACAAGGGTC 850 

TGGT 894 



30 



2) INFORMATION FOR SEQ ID NO: 1753 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ureaplasma urealyticum 

(B) STRAIN: ATCC 33697 

45 (C) ACCESSION NUMBER: Z34275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1753 

AATATGATTA CAGGGGCAGC ACAAATGGAT GGAGCAATTT TAGTTATTGC 

50 TGCATCTGAT GGGGTTATGG CTCAAACTAA AGAACATATT TTATTAGCAC 

GTCAAGTTGG TGTTCCAAAA ATCGTTGTTT TCTTAAACAA ATGTGATTTC 

ATGACAGATC CAGATATGCA AGATCTTGTT GAAATGGAAG TTCGTGAATT 

ATTATCTAAA TATGGATTTG ATGGCGATAA CACACCAGTT ATTCGTGGTT 



50 
100 
150 
200 
250 
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CAGGTCTTAA 
GAATTAATGG 
. TGACAAACCA 
GTGGTACAGT 
5 GATGAGGTTG 
AGGAATTGAA 
ATGCTGGTAT 
CAAGTACTTG 
- - - -TAAAGTT.TAT 
10 TTTCAGGATA 
GCTATTTCAT 
TGAAATGACT 
AATTCTCAAT 



GGCTTTAGAA 
ACGCAGTTGA 
TTCTTATTAG 
AGTAACTGGA 
AAATTGTTGG 
ATGTTTAGAA 
TTTATTACGT 
TAAAACCAGG 
ATTCTTAAAA 
CCGTCCACAA 
TACCTGCTGG 
GTAGAATTAA 
CCGTGAAGGT 



GGAGATCCAG 
TTCATGAATT 
CAATTGAAGA 
CGTGTTGAAC 
TCTAAAAGAC 
AATCATTAGA 
GGTATTAAAA 
ATCAATTAAA 
AAGAAGAAGG 
TTCTATTTTA 
TGTTGATTTG 
TTGCTCCAGT 
GGTAAAACTG 



TTTGAGAAGC 

CCATTACCAG 

TGTATTCACA 

GTGGTGTATT 

ACTCAAAAAA 

TCAAGCTGAA 

AAGAAGATGT 

CCTCACCGTA 

TGGACGTCAT 

GAACAACAGA- 

GTTATGCCAG 

TGCGATTGAA 

TAGGTCATGG 



AAAAATTGAT 300 

AACGTAGTAC 350 

ATTTCAGGAC 400 

AAAAGTTAAT 450 

CTGTTGTTAC 500 

GCTGGTGATA 550 

TGAACGTGGT 600 

CTTTTACTGC 650 

ACACCTATTG 700 

TGTAACAGGT- 750 

GTGATGACGT 800 

GATGGATCTA 850 

T 891 



15 



2) INFORMATION FOR SEQ ID NO: 1754 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 909 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wolinella succinogenes 

(B) STRAIN: DSM 1740 

30 (C) ACCESSION NUMBER: X76862 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1754 





AACATGATTA 


CAGGTGCTGC 


TCAAATGGAT 


GGCGCGATTC 


TTGTTGTTTC 


50 


35 


TGCGGCGGAT 


GGCCCCATGC 


CCCAAACTAG 


GGAGCACATT 


CTTCTTTCTC 


100 




GACAAGTAGG 


CGTTCCTTAC 


ATCGTGGTTT 


TCTTGAACAA 


AGAAGATATG 


150 




GTTGATGACG 


CTGAGCTTCT 


TGAGCTTGTT 


GAAATGGAAG 


TTAGAGAACT 


200 




TCTTAGCAAC 


TACGACTTCC 


CTGGAGATGA 


CACTCCTATC 


GTTGCAGGTT 


250 




CCGCTCTTAA 


AGCTCTTGAA 


GAGGCTAACG 


ACCAGGAAAA 


TGTTGGCGAG 


300 


40 


TGGGGCGAGA 


AAGTATTGAA 


GCTTATGGCT 


GAGGTTGACC 


GATATATTCC 


350 




TACGCCTGAG 


CGAGATGTGG 


ATAAGCCTTT 


CCTTATGCCT 


GTTGAAGACG 


400 




TATTCTCCAT 


CGCGGGTCGT 


GGAACCGTTG 


TGACAGGAAG 


AATTGAAAGA 


450 




GGCGTGGTTA 


AAGTCGGTGA 


CGAAGTAGAA 


ATCGTTGGTA 


TCCGAAACAC 


500 




ACAAAAAACA 


ACCGTAACTG 


GCGTTGAGAT 


GTTCCGAAAA 


GAGCTCGACA 


550 


45 


AGGGTGAGGC 


GGGTGACAAC 


GTTGGTGTTC 


TTTTGAGAGG 


CACCAAGAAA 


600 




GAAGATGTTG 


AGAGAGGTAT 


GGTTCTTTGT 


AAAATAGGTT 


CTATCACTCC 


650 




TCACACTAAC 


TTTGAAGGTG 


AAGTTTACGT 


TCTTTCCAAA 


GAGGAAGGCG 


700 




GACGACACAC 


TCCATTCTTC 


AATGGATACC 


GACCTCAGTT 


CTATGTTAGA 


750 




ACTACAGACG 


TTACCGGTTC 


TATCTCTCTT 


CCTGAGGGCG 


TAGAGATGGT 


800 


50 


TATGCCTGGT 


GACAACGTTA 


AGATCAATGT 


TGAGCTTATC . 


GCTCCTGTAG 


850 




CCCTCGAAGA 


GGGAACACGA 


TTCGCGATCC 


GTGAAGGTGG 


TCGAACCGTT 


900 




GGTGCGGGT 










909 



917 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



20 



25 



30 



WO 01/2J604 

2) INFORMATION FOR SEQ ID NO: 1755 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia cepacia 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1755 



PCT/CA00/01150 



ATGATCACGG 
AGCAGACGGC 
AGGTTGGTGT 
GACGACGCTG 
GTCGAAGTAC 
CCAAGCTGGC 
ATGAGCCTGG 
AGTTGACGGC 
GCCGTGGTAC 
GGCGAAGAAA 
CACGGGCGTT 
ACAACGTCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCAACAA 
GGCTCGATCG 
CGTGTCGATC 
TGCGCTTCGC 



GCGCAGCGCA 
CCGATGCCGC 
TCCGTACATC 
AACTGCTCGA 
GACTTCCCGG 
GCTGGAAGGC 
CAGACGCGCT 
GCGTTCCTGA 
GGTGGTGACG 
TCGAAATCGT 
GAAATGTTCC 
TATCCTGCTG 
TGGCGAAGCC 
TACGTGCTGA 
CTACCGTCCG 
AGCTGCCGAA 
ACGGTGAAGC 
AATCCGTGAA 



GATGGACGGC 
AAACGCGTGA 
ATCGTGTTCC 
GCTGGTCGAG 
GCGACGACAC 
GACACGGGCG 
GGACACGTAC 
TGCCGGTGGA 
GGTCGTGTCG 
CGGTATCAAG 
GCAAGCTGCT 
CGCGGCACGA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGACAAGGAA 
TGATTGCTCC 
GGCGGCCGTA 



GCGATCCTGG 
GCACATCCTG 
TGAACAAGTG 
ATGGAAGTTC 
GCCGATCGTG 
AGCTGGGCGA 
ATCCCGACGC 
AGACGTGTTC 
AGCGCGGCAT 
CCGACGGTGA 
GGACCAAGGT 
AGCGTGAAGA 
ACGCCGCACA 
AGGCGGCCGT 
TCCGTACGAC 
ATGGTGATGC 
GATCGCGATG 
CGGTCGGC 



TTTGCTCGGC 50 

CTGGCGCGTC 100 

CGACAGTGTG 150 

GCGAACTCCT 200 

AAGGGTTCGG 250 

AGTGGCGATC 300 

CGGAGCGTGC 350 

TCGATCTCGG 4 00 

CGTGAAGGTC 450 

AGACGACCTG 500 

CAGGCAGGCG 550 

CGTGGAGGGT 600 

CGCACTTCAC 650 

CACACGCCGT 700 

GGACGTGACG 750 

CGGGCGACAA 800 

GAAGAAGGTC 850 
888 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1756 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 

918 
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AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 100 

CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 150 

ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 350 

AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 388 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO: 1757 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1757 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 50 

AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 100 

CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 150 

30 ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 350 

AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 388 

35 



2) INFORMATION FOR SEQ ID NO: 1758 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
50 (B) STRAIN: ATCC 7064 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1758 
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TGAAAGTTCA GGTAAAACAA 
AGCGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAACT 
ACAACCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGATATT 
CCGAAAGCTG AAATTGAAGG 
AGCTCGTCTA ATGTCTCAAG 
AATCAAAAAC AATCGCAATC 



CAGTTTCATT ACACGCAATT GCAGAAGTAC 
GCGTTCATTG ATGCTGAGCA TGCAATGGAT 
AGGTGTTAAC ATCGATGAAT TACTATTATC 
AAGGTTTAGA AATCGCAGAA GCACTTGTAC 
ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 
AGACATGGGT GACTCACACG TAGGTTTACA 
CACTTCGTAA ACTTTCAGGT GCAATCAATA 
TTTATTAACC AAATTCGT 



50 
100 
150 
200 
250 
300 
350 
388 



10 
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2} INFORMATION FOR SEQ ID NO: 175? 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1759 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG TGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 
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(xi) SEQUENCE DESCRIPTION: SEQ 

AAACGACAGT TTCATTACAT GCAATTGCAG 

5 CAAGCAGCAT TCATCGATGC GGAGCACGCA 

AAAATTAGGC GTTAACATAG ATGAATTACT 

GGGAGCAAGG ATTAGAAATC GCAGAAGCAC 

GACATTATCG TAATTGACTC TGTAGCAGCT 

• - TGAAGGAGAC -ATGGGTGACT CACACGTAGG 

10 CACAAGCACT TCGTAAGCTT TCAGGAGCAA" 

GCAATCTTTA TTAACCAAAT TCGT 



ID NO: 1760 

AAGTACAACG TCAAGGTGGA 50 

ATGGATCCTG TATATGCACA 100 

ATTATCACAG CCTGATACAG 150 

TTGTACGAAG TGGTGCGGTT 200 

CTTGTACCGA AAGCAGAGAT 250 

TTTACAAGCA CGTTTAATGT 300 

TCAACAAATC- AAAAACAATT . 35.0 

374 



15 2) INFORMATION FOR SEQ ID NO: 1761 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) x TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 61 



GGAAAGTTCA GGTAAAACAA CGGTTTCCTT ACATGCGATT GCAGAAGTGC 50 

AACGTCAAGG TGGACAAGCG GCATTTATTG ATGCGGAGCA TGCGATGGAT 100 

CCTGTATATG CACAAAAGTT AGGTGTTAAT ATTGATGAGT TACTATTATC 150 

GCAGCCTGAT ACAGGAGAAC AAGGTTTAGA AATCGCAGAA GCATTAGTAC 200 

35 GAAGCGGTGC GATTGATATC ATTGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCAAAAGCAG AAATCGAAGG GGAAATGGGT GACTCCCACG TTGGTTTACA 300 

AGCGCGTTTA ATGTCACAAG CACTTCGTAA GCTTTCTGGT GCGATTAACA 350 

AATCAAAAAC AATTGCAATC TTCATTAACC A 381 



2) INFORMATION FOR SEQ ID NO: 1762 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1762 

5 TGAMGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC. GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

10 CCGAAAGCAG AGATTGAAGG CGACATGGGT GACTCACACG TAGGTTT-ACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



15 



25' 



2) INFORMATION FOR SEQ ID NO: 1763 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1763 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATTG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

35 ACAGCCTGAT ACAGGGGAGC AAGGATTGGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG CGATATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 

40 



2) INFORMATION FOR SEQ ID NO: 1764 

45' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN : ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1764 

5 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCGG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT ATACGCCATG 100 

AACGTATTGG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTGGGTGAC GTCATCGGTA AATACCACCC TCATGGTGAT ACTGCCGTAT 200 

-10- ACGACACCAT TGTACGTATG GCGCAGCCCT TCTCCCTGCG TTACATGCTG - 250 

GTAGATGGCC AGGGTAACTT TGGTTCGGTC GACGGCGACT CCGCCGCAGC 300 

GATGCGTTAT ACGGAAATCC GTATGTCGAA GATCGCCCAT GAACTGATGG 350 

CCGACCTC , 358 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1765 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1765 



TTAAGAACTC TTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG 50 

CTGCCGGATG TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTTTA 100 

35 CGCCATGAAC GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCAG 150 

CCCGTGTCGT TGGTGACGTA ATCGGTAAAT ACCACCCGCA CGGCGACTCC 200 

GCGGTATACG ACACCATCGT GCGTATGGCG CAGCCGTTCT CGCTGCGTTA 250 

CATGCTGGTG GACGGCCAGG GTAACTTTGG TTCCATCGAC GGCGACTCCG 300 

CCGCGGCGAT GCGTTATACC GAAATTCGTC TGGCGAAAAT CGCTCATGAG 350 

40 CTGATGGCCG ATCTT 365 



45 



2) INFORMATION FOR SEQ ID NO: 1766 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1766 

AAGAGCTCGT ATCTGGATTA TGCGATGTCG GTCATTGTTG GCCGTGCGCT 50 

GCCGGATGTC CGAGATGGCC TGAAACCGGT ACACCGTCGC GTACTTTACG 100 

- - CCATGAACGT. _ATTGGGCAAT_ GACTGGAACA AAGCCTATAA AAAATCCGCC 150 

10 CGTGTCGTTG GTGACGTAAT CGGTAAATAC CACCCTCATG GTGATACCGC 200 

CGTTTATGAC ACCATTGTAC GTATGGCACA GCCATTCTCC TTGCGTTATA 250 

TGCTGGTCGA TGGCCAGGGT AACTTCGGTT CTGTCGATGG CGACTCCGCC 300 

GCAGCGATGC GTTATACGGA AATCCGTATG TCGAAAATCG CCCA 344 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1767 



{i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 345 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1767 



AACTCTTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

GGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

35 TGAACGTATT GGGCAATGAC TGGAACAAAG CCTATAAAAA ATCAGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAC CCGCACGGCG ACTCCGCGGT 200 

AT AC G AC AC C ATCGTGCGTA TGGCGCAGCC GTTCTCGCTG CGTTACATGC 250 

TGGTGGACGG CCAGGGTAAC TTTGGTTCCA TCGACGGCGA CTCCGCCGCG 300 

GCGATGCGTT ATACCGAAAT TCGTCTGGCG AAAATCGCTC ATGAG 345 

40 



2) INFORMATION FOR SEQ ID NO: 1768 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1768 

5 

CTTATCTGGA TTATGCGATG TCGGTCATTG TTGGCCGTGC GCTGCCGGAT 50 

GTCCGAGATG GCCTGAAGCC GGTACACCGT CGCGTACTTT ACGCCATGAA 100 

CGTATTGGGC AATGACTGGA ACAAAGCCTA TAAAAAATCA GCCCGTGTCG 150 

TTGGTGACGT AATCGGTAAA TACCACCCGC ACGGCGACTC CGCGGTATAC 200 

10 "GACACCATCG'T'GCGTATGGC GCAGCCGTTC TCGCTGCGTT ACATGCTGGT- 250 

GGACGGCCAG GGTAACTTTG GTTCCATCGA CGGCGACTCC GCCGCGGCGA 300 

TGCGTTATAC CGAAATTCGT CTGGCGAAAA TCGCTCATGA GCTGATGGCC 350 

GATCTT - 356 

15 



2) INFORMATION FOR SEQ ID NO: 1769 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 361 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1769 



TAAGAACTCT TATCTGGATT 
TGCCGGATGT CCGAGATGGC 

35 GCCATGAACG TATTGGGCAA 
CCGTGTCGTT GGTGACGTAA 
CGGTATACGA CACCATCGTG 
ATGCTGGTGG ACGGCCAGGG 
CGCGGCGATG CGTTATACCG 

40 TGATGGCCGA T 



ATGCGATGTC GGTCATTGTT 
CTGAAGCCGG TACACCGTCG 
TGACTGGAAC AAAGCCTATA 
TCGGTAAATA CCACCCGCAC 
CGTATGGCGC AGCCGTTCTC 
TAACTTTGGT TCCATCGACG 
AAATTCGTCT GGCGAAAATC 



GGCCGTGCGC 


50 


CGTACTTTAC 


100 


AAAAATCAGC 


150 


GGCGACTCCG 


200 


GCTGCGTTAC 


250 


GCGACTCCGC 


300 


GCTCATGAGC 


350 




361 



2) INFORMATION FOR SEQ ID NO: 1770 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13824 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1770 





TAAGAACTCT 


TATCTGGATT 


ATGCGATGTC 


GGTCATTGTT 


GGCCGTGCGC 


50 




TGCCGGATGT CCGAGATGGC 


CTGAAGCCGG 


TACACCGTCG 


CGTACTTTAC 


100 


10 


GCCATGAACG 


TATTGGGCAA" 


TGACTGGAAC 


AAAGCCTATA 


AAAAATCAGC 


.150 




CCGTGTCGTT 


GGTGACGTAA 


TCGGTAAATA 


CCACCCGCAC 


GGCGACTCCG 


200 




CGGTATACGA 


CACCATCGTG 


CGTATGGCGC 


AGCCGTTCTC 


GCTGCGTTAC 


250 




ATGCTGGTGG 


ACGGCCAGGG 


TAACTTTGGT 


TCCATCGACG 


GCGATTCCGC 


300 




CGCGGCGATG 


CGTTATACCG 


AAATTCGTCT 


GGCGAAAATC 


GCTCATGAGC 


350 


15 


TGATGGCCGA 


TCTTG 








365 



2) INFORMATION FOR SEQ ID NO: 1771 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella terrigena 

(B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1771 

35 TCGTATCTGG AT TAT GC GAT GTCGGTCATT GTTGGCCGTG CGCTGCCGGA 50 

TGTCCGAGAT GGGTTGAAAC CGGTACACCG TCGCGTACTT TACGCCATGA 100 

ACGTATTGGG CAATGACTGG AACAAAGCCT ATAAAAAATC CGCCCGTGTC 150 

GTTGGTGACG TAATCGGTAA ATATCACCCT CACGGTGATA CCGCCGTTTA 200 

TGACACCATT GTACGTATGG CGCAGCCATT CTCCTTGCGT TATATGCTGG 250 

40 TCGATGGCCA GGGTAACTTC GGTTCTGTCG ATGGCGACTC CGCCGCAGCG 300 

ATGCGTTATA CGGAAATCCG TATGTCGAAA ATCGCCCACG AGCTGATGGC 350 

CGACCTC 357 



45 

2) INFORMATION FOR SEQ ID NO: 1772 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 968 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 
5 (B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1772 

CAGTACACAG GCGAGTTCTT TTTGCGATGA GTGAGTTAAG TAATGATTGG 50 

-10 AATAAGCCGT ATAAAAAATC TGCTCGTGTA GTAGGGGATG TCATTGGTAA 100 

ATATCATCCT CATGGGGATA CAGCTGTTTA TGACACTATT GTTCGTATGG 150 

CTCAGCCCTT TTCCATGCGT TATATGCTGA TTGATGGGCA GGGTAATTTT 200 

GGCTCTGTAG ATGGAGATGC TCCAGCTGCC ; ATGCGTTACA CTGAAGTAAG 250 

AATGTCCAAA GTGGCGCATG CTTTACTGGC TGATTTGGAT AAGGAAACCG 300 

15 TTGATTTTAG TCCTAACTAT GATGAAACAG AATTTGCTCC AGTGGTATTG 350 

CCATCGAGAA TTCCCAATTT ACTAGTTAAT GGCTCTTCCG GTATTGCGGT 4 00 

AGGGATGGCT ACTAATATTC CACCACATAA TCTTACCGAA GTAATCAATG 4 50 

CATGTATTGC TTTAGTGGAT GAACCTGACA CGAGTCTTGA AGATTTAATG 500 

GAAATTATTC CTGGCCCTGA TTTTCCTACA GCCGCAATTA TTAATGGTCG 550 

20 TGCTGGAATT ATTGAAGGTT ATCGTACTGG AAAAGGGCGG GTTGTTATCA 600 

GGGCACGCAC AGAAATTGAA ACGGATGAAA GTTCAGGCCG TCAGTCAATT 650 

ATTATTCAGG AATTACCCTA TCAGGTGAAT AAAGCGCGTT TGATCGAGCG 700 

TATTGCTGAA TTGGTAAGGG ACAAGAAAAT CGAAGGAATT TCCGGCTTGA 750 

GAGATGAGTC AGACAAGCAA GGAATGAGAG TAGTCATTGA ATTAAAACGC 800 

25 AATGAAGTAG CAGATGTGGT ATTGAATAAC CTGTTCGCTC ATACTCAAAT 850 

GCAAAATGTA TTCGGAATTA ATATGGTTGC TCTGGTGGAT GGCCAACCGC 900 

GTACTTTGAA TTTGAAGCAA ATACTGGAAT ATTTTATAAA ACATCGAAGA 950 

GAGGTTGTTA CCAGACGC 968 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1773 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1773 



CACCGCCGAG TGCTATTTGC GATGAATGTA CTGGGAAACG ATTGGAATAA " 50 

ACCTTATAAA AAATCAGCCC GTGTTGTTGG GGATGTAATC GGTAAATATC 100 

50 ACCCGCACGG TGACAGTGCT GTCTATGAAA CGATTGTTCG TTTAGCACAG 150 

CCTTTTTCTA TGCGCTACAT GTTGGTTGAC GGTCAGGGTA ACTTCGGGTC 200 

AGTTGATGGT GACTCCGCGG CGGCTATGCG TTATACCGAA GTTCGTATGG 250 

CGAAGATCGC CCATGAACTG CTGGCGGATT TGGAAAAAGA GACGGTCGAC 300 
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TTTGTTCCTA ACTATGATGG AACAGAAAAT 
CCGTATTCCA AACTTGTTAG TTAATGGCTC 
TGGCAACGAA TATCCCTCCG CATAACCTCG 
CTTGCCTATG TTGATAATGA AGACATCACC 
5 TATTACCGGG CCTGATTTTC CGACTGCTGC 
GAATATTAGA TGCTTATCGT ACAGGGCGTG 
CAGGCTGATA TTGAAACTGA TGAGAAAACA 
GACAGAAATT CCTTATCAGG TGAATAAAGC 
CGGAGCTTGT AAAAGATAAA CGTATTGAAG 
10 GAGTCTGATA AAGACGGTAT GCGTATTGTT 
AGTCGGTGAA GTAGTATTAA ATCACCTCTT 
TCTCTTTTGG TATTAATATG GTTGCGCTTC 
TTGAACCTAA AAGAAATTAT CGCAGCCTTT 
GGTGACTCGC CGTACCA 

15 



ATACCGGCTG TTATGCCAAC 350 

TTCAGGTATT GCCGTTGGGA 400 

GTGAAGTTAT CGACGGTTGT 450 

ATAGAAGAAT TAATGGAATA 500 

GATTATTAAT GGTCGCAGAG 550 

GAAAGATTTA TATCCGTGCT 600 

GGTCGCGAAA CCATTATCGT 650 

CCGTTTAATT GAAAAAATTG 700 

GTATCAGCGG ATTACGTGAC 750 

GTTGAGATCA- AACGTGATGC 800 

TTCACAAACT CAAATGCAAG 850 

ATCAAGGCCA ACCAAAATTA 900 

ATTCGCCATC GTCGTGAAGT 950 

967 



2) INFORMATION FOR SEQ ID NO: 1774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 
30 (B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1774 

CCAGTACACC GCAGAGTATT GTATGCGATG AATGTATTGG GAAATGATTG 50 

35 GAATAAACCC TATAAAAAAT CTGCCCGTAT TGTTGGGGAC GTCATCGGTA 100 

AATACCATCC ACATGGTGAT AGCGCTGTTT ACGAGACAAT CGTTCGTCTT 150 

GCTCAGCCTT TCTCAATGCG TTACATGCTG GTTGATGGTC AGGGAAACTT 200 

CGGTTCTGTT GACGGAGACT CCGCAGCGGC AATGCGTTAT ACGGAAATCC 250 

GTATGGCGAA AATTGCCCAT GAACTACTTG CTGACCTTGA AAAAGAAACC 300 

40 GTTGATTTCG TTCCTAACTA TGACGGAACA GAGCACATTC CTGAAGTTAT 350 

GCCAACGAAA ATCCCAAACC TTTTGGTTAA TGGGTCGTCA GGTATTGCTG 400 

TTGGGATGGC AACCAATATC CCACCTCACA ATTTAGGGGA GGTGATTAAT 450 

GGTTGTCTTG CCTATATAGA AGACGAAGAC ATCAGCATTG ATGGTTTAAT 500 

GGAACACATT CCAGGGCCTG ATTTCCCAAC CGCAGCTATT ATTAATGGCC 550 

45 GTCGTGGGAT TATTGATGCG TATCGCACAG GGCGTGGCAA GGTCTATATC 600 

CGTGCAAGCG CTGAAGTGGA AGTCGATGAG AAAAATGGTC GCGAAACCAT 650 

TATTGTCAGC GAAATTCCTT ATCAAGTGAA TAAAGCTCGC TTGATTGAAA 700 

AAATTGCTGA GTTAGTTAAA GACAAGCGTG TTGAAGGTAT CAGTGCACTG 750 

CGTGACGAGT CTGATAAAGA CGGTATGCGT ATTGTTATTG AAATCAAACG 800 

50 CGATGCGGTG GGTGAAGTTG TACTGAACAA CTTATATTCC CTGACCCAAT 850 

TGCAAGTTTC TTTTGGTATC AATATGGTAG CTCTACACCA AGGGCAGCCG 900 

AAAATACTGA ATTTAAAAGA TATCATTGCT GCTTTTGTGC GTCACCGCCG 950 

TGAAGTCGTC ACTCGTCGTA CGATTTTC 978 
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2) INFORMATION FOR SEQ ID NO: 1775 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(-ii) MOLECULE TYPE: Genomic DNA 



10 



IS 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1775 



CCAGTACACC 
GAATAAACCT 
AATATCACCC 

20 GCACAGCCTT 
CGGGTCAGTT 
GTATGGCGAA 
GTTGATTTTG 
GCCAACCCGT 

25 TCGGGATGGC 
GGCTGTCTTG 
GGAGCATATC 
GCAGAGGAAT 
CGTGCACAAG 

30 TATCGTGACG 
AAATTGCAGA 
CGTGATGAGT 
TGATGCTGTT 
TGCAGGTTTC 

35 AAAGTGTTAA 
TGAAGTGGTG 



GTCGCGTTCT 
TATAAAAAAT 
GCACGGTGAC 
TTTCTATGCG 
GATGGTGACT 
AATCGCCCAT 
TTCCTAACTA 
ATTCCAAACT 
AACAAACATT 
CTTATGTTGA 
ACGGGGCCTG 
TTTAGATGCT 
CTGATGTAGA 
GAAATTCCTT 
GCTTGTTAAA 
CAGATAAAGA 
GGTGAAGTTG 
TTTTGGTATT 
ACCTGAAAGA 
ACTCGTCGTA 



TTTCGCGATG 
CAGCCCGTGT 
AGTGCTGTTT 
TTACATGTTG 
CGGCGGCTGC 
GAACTGCTGG 
TGATGGAACA 
TATTAGTCAA 
CCTCCGCATA 
TAACGAAGAC 
ATTTCCCAAC 
TACCGTACTG 
AACCGATGAG 
ATCAGGTGAA 
GATAAACGTA 
TGGTATGCGC 
TACTAAATCA 
AACATGGTTG 
AATTATTTCA 
CTATTTTT 



AATGTATTAG 
TGTTGGGGAT 
ATGAAACGAT 
GTTGACGGGC 
AATGCGTTAT 
CGGATTTGGA 
GAGCATATCC 
TGGTTCTTCA 
ACCTAGGTGA 
ATCACTATTG 
TGCCGCTATT 
GGCGCGGAAA 
AAAACTGGTC 
CAAAGCTCGC 
TTGAAGGCAT 
ATTGTTGTTG 
CTTATTTTCT 
CACTGCATCA 
GCCTTTATTC 



GAAACGATTG 50 

GTTATCGGTA 100 

AGTTCGTTTA 150 

AGGGTAACTT 200 

ACCGAAGTTC 250 

AAAAGAAACG 300 

CGGCAGTCAT 350 

GGTATCGCAG 400 

AGTTATCGAC 450 

AAGAGTTGAT 500 

ATTAATGGCC 550 

AATTTATATT 600 

GCGAAACAAT 650 

TTAATTGAAA 700 

TAGCGGATTA 750 

AAATTAAGCG 800 

CAGACTCAGA 850 

AGGTCAACCG 900 

GTCACCGTCG 950 
978 



40 2) INFORMATION FOR SEQ ID NO: 1776 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1776 

TGGGTAATGA CTGGAATAAG CCATACAAAA AATCGGCCCG TGTAGTCGGG 50 

GACGTTATCG GTAAATATCA CCCGCATGGT GACAGCGCGG TCTACGACAC 100 

5 AATTGTGCGT ATGGCCCAGC CGTTCTCACT GCGCTATATG CTGGTGGATG 150 

GGCAGGGCAA CTTCGGTTCC GTTGATGGCG ACTCCGCCGC AGCGATGCGT 200 

TATACCGAAA TCCGTATGTC TAAAATTGCT CACGAATTGT TGGCGGACTT 250 

AGAAAAAGAT ACCGTCGACT TCGTGCCGAA CTATGACGGT ACGGAGCAAA 300 

TTCCTGCCGT AATGCCAACC CGAATCCCTA ACTTGCTGGT TAACGGCTCG 350 

10 " TCAGGTATTG- CTGTCGGTAT GGCAACCAAT ATTCCGCCGC_ ATAACCTTTC 4 00 

TGAGGTTATT GATGGCTGTC TGGCCTATAT CGAAGATGAA AACATCACCA 4 50 

TTGAAGGGTT GATGGAGTAC ATCCCGGGGC CAGATTTCCC AACTGCTGCG 500 

ATTATCAATG GTCGCCGTGG TATTGAAGAA GCTTATCGTA CTGGCCGTGG 550 

CAAGGTGTAT ATCCGTGCCC GTGCTGAAGT TGAGGCTGAC GCTTAAACCG 600 

15 GTCGCGAAAC CATTATTGTT CACGAGATCC CGTATCAGGT GAACAAGGCG 650 

CGGTTGATTG AAAAAATCGC CGAGCTGGTT AAAGAAAAAC GCGTAGAAGG 700 

CATCAGTGCG TTGCGTGATG AGTCTGATAA AGACGGCATG CGTATCGTGA 750 

TTGAAATCAA ACGTGATGCT GTCGGGGAAG TGGTTCTGAA CAACCTCTAT 800 

TCTCTGACGC AATTGCAGGT GACTTTCGGT ATCAATATGG TGGCTCTGTC 850 

20 TCAAGGGCAG CCTAAGTTGC TTAACCTGAA AGACATTTTG GTTGCTTTCG 900 

TGCGCCACCG CCGTGAAGTG GTGACTCGCC GTACCATTTT 940 



25 2) INFORMATION FOR SEQ ID NO: 1777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



4 0. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1777 



CCGTTTATTG GCGATGGCCT GAAGCCGGTC CAGCGTCGCA TCGTCTATGC 50 

GATGTCTGAA CTGGGTCTGA ACGCCAGCGC GAAGTTCAAA AAGTCCGCCC 100 

GCACCGTCGG TGACGTGCTG GGTAAATACC ATCCCCACGG CGACAGCGCG 150 

TGCTATGAAG CCATGGTGCT GATGGCTCAG CCCTTCTCCT ACCGCTATCC 200 

45 GCTGGTTGAC GGTCAGGGAA ACTGGGGGGC GCCGGACGAT CCTAAATCCT 250 

TCGCCGCAAT GCGTTATACC GAATCCCGTT TGTCGAAGTA TGCTGAACTG 300 

CTGCTGAGCG AACTGGGGCA AGGCACCGTT GACTGGGTAC CAAACTTCGA 350 

CGGCACTTTG CAGGAGCCGA AGATGCTGCC TGCGCGCCTG CCCAATATTC 400 

TGCTAAACGG TACTACCGGC ATTGCCGTTG GGATGGCGAC GGACATTCCG 450 

50 CCGCACAACC TGCGTGAAGT GGCCCGGGCG GCCATTACCC TGATTGAAAA 500 

GCCGCAAACT TCGCTGGATG ACCTGCTGGA TATCGTGCAG GGGCCGGATT 550 

ATCCTACCGA AGCGGAAATC ATTACCCCCC GTGCCGAAAT CCGCAAAATC 600 

TACCAGAATG GCCGCGGTTC GGTGCGGATG CGCGCGGTAT GGGCCAAAGA 650 

930 
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AGACGGCGCG GTGGTGAT 



PCT/CA00/01150 
668 



5 2) INFORMATION FOR SEQ ID NO: 1778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: "Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1778 



GTTTATTGGC GATGGCCTGA AGCCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCTGAACT GGGTCTGAAC GCCAGCGCGA AGTTCAAAAA GTCCGCCCGC 100 

ACCGTCGGTG ACGTGCTGGG TAAATACCAT CCCCACGGCG ACAGCGCGTG 150 

CTATGAAGCC ATGGTGCTGA TGGCTCAGCC CTTCTCCTAC CGCTATCCGC 200 

25 TGGTTGACGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC TAAATCCTTC 250 

GCCGCAATGC GTTATACCGA ATCCCGTTTG TCGAAGTATG CTGAACTGCT 300 

GCTGAGCGAA CTGGGGCAAG GCACCGTTGA CTGGGTACCA AACTTCGACG 350 

GCACTTTGCA GGAGCCGAAG ATGCTGCCTG CGCGCCTGCC CAATATTCTG 400 

CTAAACGGTA CTACCGGCAT TGCCGTTGGG ATGGCGACGG ACATTCCGCC 450 

30 GCACAACCTG CGTGAAGTGG CCCGGGCGGC CATTACCCTG ATTGAAAAGC 500 

CGCAAACTTC GCTGGATGAC CTGCTGGATA TCGTGCAGGG GCCGGATTAT 550 

CCTACCGAAG CGGAAATCAT TACCCCCCGT GCCGAAATCC GCAAAATCTA 600 

CCAGAATGGC CGCGGTTCGG TGCGGATGCG CGCGGTATGG GCCAAAGAAG 650 

ACGGCGCGGT GGTGATTAGC GCGCTGCCGC ATCAGGTTTC CGGCGCCCGC 700 

35 GTGCTTGAGC AGAT 714 



2) INFORMATION FOR SEQ ID NO: 1779 



40 



(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 722 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1779 

931 
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I i bbbbAl bb 


f mm 7\ tv 7i T\ r^rr^ 

LI i AAAALCG 


GTLLAGLbTL 


bbAlLGILl A 


IbLbATGILL 


DU 




bAbbl bbbbb 


IbAAbbbLAG 


LbLGAAAT rL 


AAAAAb 1 LLb 


LLLGCACCbl 


1 uu 




bbbbbALblb 


1 I bbbi AAA1 


AILALLLbLA 


LbbbbAbAbb 


bLLl bLTAIb 


1 CLf\ 

xov 


5 


AAGCGAibGT 


GCTGArGGCG 


LAbLLbT TLI 


LI 1 ALLbbl A 


IbbbbTGGTb 


zuu 




GATGGTCAGG 


GAAALIbGGG 


GGLGLLGGAb 


bAIbbLAAAl 


bl i ILGCCGL 


^OU 




CATGCGTTAC 


ACCGAArCCC 


GCCTG1CGAA 


bl AlbbbbAb 


blbblbblLA 


JUU 




GCGAGCTGGG 


GCAGGGGACG 


GTCGACTGGG 


TGCCAAACTT 


rp/^'Tv ^ m tv 

TGACGGTACG 




- 


_ CTGCAGGAbC 


ripTi tv tv Tvrn/^/^«rn 

CGAAAATGCT 


GCCAGCGCGT 


mm r> tv tv r* t\ 

Tl bLLbAALA 


lbblbblbAA 


y» n n 
4 UU 


10 


CGGCACCACC 


GGCATCGCGG 


rp TV _ y-^/^m TV rp /"" , r*"» 

TAGGTATGGC 


bAbbbAl A I 1 


Lb 1 LLbLALA . 


yi c: n 
H 0 U 




ACCTGCGGGA 


tv m (~* ~s\ t\ tv 

AGTGbbLAAA 


GLbbLbAI I A 


bbbibAl IbA 


bbAbbbbAAA 


OUU 




ACCACCCTCG 


ACGAACTGCT 


/"^ /"» tv m Tk m y*^ m Tt 

GGATATCGTA 


CAGGGGCCGG 


tv mm mr'/>or , 7\ 

ATTTCCCGAC 


rr/v 

ooU 




CGAGGCGGAG 


ATCATCACCT 


CGCGGGCGGA, 


AATTCGAAAA 


ATCTACCAGA 


600 




ACGGGCGCGG 


CTCAGTGCGC 


ATGCGCGCGG 


TGTGGAGTAA 


AGAGGACGGC 


650 


15 


GCGGTGGTGA 


TCAGCGCGCT 


GCCGCATCAG 


GTCTCCGGCG 


CCAAAGTGCT 


700 




GGAGCAGATT 


GCGGCGCAGA 


TG 






722 



20 2) INFORMATION FOR SEQ ID NO: 1780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1780 



GTCCAGCGTC GCATCGTTTA TGCGATGTCT GAGCTGGGGC TGAACGCCAG 50 

CGCGAAGTTC AAAAAGTCCG CCCGCACCGT GGGTGATGTG CTGGGTAAAT 100 

ATCACCCGCA CGGCGATAGC GCATGCTATG AAGCTATGGT GCTGATGGCG 150 

CAGCCATTCT CCTACCGCTA CCCGCTGGTG GATGGGCAGG GGAACTGGGG 200 

40 GGCACCGGAC GATCCTAAAT CCTTCGCCGC GATGCGTTAT ACCGAATCCC 250 

GTTTGTCGAA GTATGCGGAA CTGCTGCTGG GCGAACTGGG GCAGGGAACC 300 

GTCGACTGGG TGCCGAACTT CGACGGGACG ATGCAGGAGC CGAAAATGCT 350 

GCCTGCGCGT CTGCCGAATA TTCTGCTGAA CGGCACTACC GGCATCGCCG 400 

TCGGTATGGC AACCGATATT CCTCCGCACA ACCTGCGTGA AGTGGCGCAG 450 

45 GCGGCGATTA CCCTGATCGA AAAACCGCAG ACCTCGCTCG ACGAACTGCT 500 

GGATATCGTT CACGGACCCG ACTACCCGAC CGAAGCCGAA ATCATTACTC 550 

CACGCGCGGA GATCCGCAAA ATCTACCAGA ACGGCCGCGG TTCGGTGCGG 600 

ATGCGCGCGG TATGGAAAAA AGAGGACGGC GCGGTCGTGA TTACGGCGTT 650 

GCCGCATCAG GTTTCCGGCG CCCGCGTGCT GGAGCAAATT GC 692 

50 



2) INFORMATION FOR SEQ 



ID NO: 1781 
932 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1781 



GTTTATTGGC GATGGCTTAA AACCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCCGAGCT GGGGCTGAAC GCCAGCGCGA AATTCAAAAA GTCCGCCCGC 100 

ACCGTCGGCG ACGTGTTGGG TAAATATCAC CCGCACGGCG ACAGCGCCTG 150 

CTATGAAGCG ATGGTGCTGA TGGCGCAGCC GTTCTCTTAC CGCTATCCGC 200 

20 TGGTGGATGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC CAAATCTTTC 250 

GCCGCCATGC GTTACACCGA ATCCCGCCTG TCGAAGTATG CCGAGCTGCT 300 

GCTCAGCGAG CTGGGGCAGG GGACGGTCGA CTGGGTGCCA AACTTTGACG 350 

GCACGCTGCA GGAGCCGAAA ATGCTGCCAG CGCGTTTGCC GAACATCCTG 400 

CTGAACGGCA CCACCGGCAT CGCGGTAGGC ATGGCGACCG ATATTCCTCC 450 

25 GCACAACCTG CGGGAAGTGG CCAAAGCGGC GATTACGCTG ATTGAGCAGC 500 

CGAAAACCAC CCTCGACGAA CTGCTGGATA TCGTACAGGG GCCGGATTTC 550 

CCGACCGAGG CGGAGATCAT CACCTCGCGG GCGGAAATTC GCAAAATCTA 600 

CCAAAACGGG CGCGGCTCAG TGCGCATGCG CGCGGTGTGG AGTAAAGAGG 650 

ACGGCGCGGT GGTGATCAGC GCGCTGCCGC ATCAGGTCTC CGGCGCCAAA 700 

30 



2) INFORMATION FOR SEQ ID NO: 1782 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 
45 (B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1782 

ATTGGCGATG GCTTAAAACC GGTCCAGCGT CGCATCGTCT ATGCGATGTC 50 

50 CGAGCTGGGG CTGAACGCCA GCGCGAAATT CAAAAAGTCC GCCCGCACCG 100 

TCGGCGACGT GTTGGGTAAA TATCACCCGC ACGGCGACAG CGCCTGCTAT 150 

GAAGCGATGG TGCTGATGGC GCAGCCGTTC TCTTACCGCT ATCCGCTGGT 200 

GGATGGTCAG GGAAACTGGG GGGCGCCGGA CGATCCCAAA TCTTTCGCCG 250 

933 
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CCATGCGTTA CACCGAATCC 
AGCGAGCTGG GGCAGGGGAC 
GCTGCAGGAG CCGAAAATGC 
ACGGCACCAC CGGCATCGCG 
5 AACCTGCGGG AAGTGGCCAA 
AACCACCCTC GACGAACTGC 
CCGAGGCGGA GATCATCACC 
AACGGGCGCG GCTCAGTGCG 
CGCGGTGGTG ATCAGTGCGC 
- - -10 TGGAGCAGAT TGCGGCGCAG 



CGCCTGTCGA AGTATGCCGA 
GGTCGACTGG GTGCCAAACT 
TGCCAGCGCG TCTGCCGAAC 
GTAGGCATGG CGACCGATAT 
AGCGGCGATT ACGCTGATTG 
TGGATATCGT ACAGGGGCCG 
TCGCGGGCGG AAATTCGCAA 
CATGCGCGCG GTGTGGAGTA 
TGCCGCATCA GGTCTCTGGC 
ATGCGC- - 



GCTGCTGCTC 300 

TTGACGGCAC 350 

ATCCTGCTGA 400 

TCCTCCGCAC 4 50 

AGCAGCCGAA 500 

GATTTCCCGA 550 

AATCTACCAG 600 

AAGAGGACGG 650 

GCCAAAGTGC 700 

. 726 



2) INFORMATION FOR SEQ ID NO: 1783 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1783 

30 GGCTTAAAAC CGGTCCAGCG TCGCATCGTC 

GCTGAACGCC AGCGCGAAAT TCAAAAAGTC 

TGTTGGGTAA ATATCACCCG CACGGCGACA 

GTGCTGATGG CGCAGCCGTT CTCTTACCGC 

GGGAAACTGG GGGGCGCCGG ACGATCCCAA 

35 ACACCGAATC CCGCCTGTCG AAGTATGCCG 

GGGCAGGGGA CGGTCGACTG GGTGCCAAAC 

GCCGAAAATG CTGCCAGCGC GTCTGCCGAA 

CCGGCATCGC GGTAGGCATG GCGACCGATA 

GAAGTGGCCA AAGCGGCGAT TACGCTGATT 

4 0 CGACGAACTG CTGGATATCG TACAGGGGCC 

AGATCATCAC CTCGCGGGCG GAAATTCGCA 

GGCTCAGTGC GCATGCGCGC GGTGTGGAGT 



TTGCGG 



TATGCGATGT 


CCGAGCTGGG 


50 


CGCCCGCACC 


GTCGGCGACG 


100 


GCGCCTGCTA 


TGAAGCGATG 


150 


TATCCGCTGG 


TGGATGGTCA 


200 


ATCTTTTGCC 


GCCATGCGTT 


250 


AGCTGCTGCT 


CAGCGAGCTG 


300 


TTTGACGGCA 


CGCTGCAGGA 


350 


CATCCTGCTG 


AACGGCACCA 


400 


TTCCTCCGCA 


CAACCTGCGG 


450 


GAGCAGCCGA 


AAACCACCCT 


500 


GGATTTCCCG 


ACCGAGGCGG 


550 


AAATCTACCA 


GAACGGGCGC 


600 


AAAGAGGACG 


GCGCGGTGGT 


650 


CGCCAAAGTG 


CTGGAGCAGA 


700 






706 



45 



2) INFORMATION FOR SEQ ID NO: 1784 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

934 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN : ATCC 13884 

10 - (xi) SEQUENCE- DESCRIPTION:. SEQ ID NO: 1784 

TTGGCGATGG CTTAAAACCG GTCCAGCGTC GCATCGTCTA TGCGATGTCC 50 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC AAAAAGTCCG CCCGCACCGT 100 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA CGGCGACAGC GCCTGCTATG 150 

15 AAGCGATGGT GCTGATGGCG CAGCCGTTCT CTTACCGCTA TCCGCTGGTG 200 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC GATCCCAAAT CTTTCGCCGC 250 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA GTATGCCGAG CTGCTGCTCA 300 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG TGCCAAACTT TGACGGCACG 350 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT CTGCCGAACA TCCTGCTGAA 4 00 

20 CGGCACCACC GGCATCGCGG TAGGCATGGC GACCGATATT CCTCCGCACA 450 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA CGCTGATTGA GCAGCCGAAA 500 

ACCACCCTCG ACGAACTGCT GGATATCGTA CAGGGGCCGG ATTTCCCGAC 550 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA AATTCGCAAA ATCTACCAGA 600 

ACGGGCGCGG CTCA 614 

25 



2) INFORMATION FOR SEQ ID NO: 1785 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella terrigena 
40 (B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1785 

GCCTGAAACC GGTCCAGCGG CGCATCGTTT ATGCGATGTC CGAACTGGGT 50 

45 CTGAACGCCA CCGCTAAATT CAAAAAATCC GCGCGCACCG TCGGCGACGT 100 

GCTGGGTAAA TATCACCCGC ACGGCGATAG CGCCTGCTAT GAGGCGATGG 150 

TGCTGATGGC GCAGCCGTTC TCTTACCGCT ACCCGCTGGT GGACGGTCAG 200 

GGCAACTGGG GCGCCCCGGA CGATCCCAAA TCCTTCGCCG CGATGCGTTA 250 

TACCGAATCC CGCCTGTCAA AGTATGCGGA GCTGCTGCTG GGCGAGCTGG 300 

50 GTCAGGGAAC CGTTGACTGG GTACCTAACT TTGACGGTAC GATGCAGGAG 350 

CCGAAAATGC TGCCTGCGCG TTTGCCGAAT ATTCTGCTCA ACGGCACCAC 400 

CGGTATCGCC GTGGGGATGG CCACCGATAT TCCGCCGCAC AACCTGCGCG 450 

AAGTGGCCAA AGCGGCCATC ACCCTGATTG AAAAGCCGCA GACCTCGCTC 500 

935 
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GACGAACTGC TGGATATCGT TCACGGGCCG GACTACCCCA CCGAAGCTGA 
AATCATCACC CCGCGCGCCG AGATCCGCAA AATCTATCAG AACGGTCGCG 
GCTCGGTTCG CATGCGTGCG GTGTGGAAAA AAGAGGACGG CGCGGTGGTG 
ATTAGCGCCC TGCCGCAT 



550 
600 
650 
668 



2) INFORMATION FOR SEQ ID NO: 1786 



(i) SEQUENCE CHARACTERISTICS : " - - 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1786 

CATTACGTTC TAACACTCAA GGACGCGGAA CATTCTCTAT GGTGTTTGAC 50 
CACTATGAAG AAGTACCAAA GTCTGTTTCT GAAGAAATTA TCAAAAAAAA 100 
TAAAGGTGAA TAA 113 



2) INFORMATION FOR SEQ ID NO: 1787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1787 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 
TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 1788 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus' anthracis - - 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1788 

15 GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG' TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1789 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 118 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1789 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 

TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 

AAAAATAAAG GTGAATAA 118 

40 



2) INFORMATION FOR SEQ ID NO: 1790 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

937 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1790 



GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 1791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1791 

GCAACGTCAT TACGTTCTAA CACGCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1792 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1792 

CGTCATTACG TTCTAACACT CAAGGACGCG GAACATTCTC TATGGTGTTT 50 
GACCACTATG AAGAAGTACC AAAGTCTGTT TCTGAAGAAA TTATCAAAAA 100 
AAATAAAGGT GAATAA 116 



938 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(.ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1793 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 

GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 

AAAAAAATAA AGGTGAATAA 120 

20 



2) INFORMATION FOR SEQ ID NO: 1794 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
35 (B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1794 

CACTCAAGGA CGCGGAACAT TCTCTATGGT GTTTGATCAC TATGAAGAAG 50 
40 TACCAAAGTC TGTTTCTGAA GAAATTATCA AAAAAAATAA AGGTGAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1795 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1795 



GCAACGGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCTGAA GAAATTATTA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 1796 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1796 

GCGACAGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATTA 100 
AAAAAAATAA AGGCGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1797 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 
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2) INFORMATION FOR SEQ ID NO: 1798 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1798 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCCATATAAA 100 

CTAAGGAGGA ATTTAGA 117 

20 



2) INFORMATION FOR SEQ ID NO: 1799 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
35 (B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1799 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
40 TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



45 2) INFORMATION FOR SEQ ID NO: 1800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) . TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1800 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
- TGGGACGCAA GT.TTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
10 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1801 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1801 

30 TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACTTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1802 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 117 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1802 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
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5 2) INFORMATION FOR SEQ ID NO: 1803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

lb" (CY STRANDEDNESS": "Double" " 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: • Bacillus cereus 

(B) STRAIN: ATCC 7064 



20 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1803 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1804 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM Bacillus cereus 

(B) STRAIN: ATCC 49064 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1804 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCACTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
45 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1805 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

943 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



25 



(C > STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 



10 (xi) SEQUENCE DESCRIPTION: "SEQ ID NO: 1805 - - 

TTGATTTTTA TCAATTGTTC GAGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGTAA GTTTCGCTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 

15 



2) INFORMATION FOR SEQ ID NO: 1806 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
30 (B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1806 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
35 TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



40 2) INFORMATION FOR SEQ ID NO: 1807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1807 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

5 CTAAGGAGGA ATTTAGA 117 



.2) INFORMATION FOR SEQ ID NO: 1808 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1808 

25 TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1809 



(i) SEQUENCE CHARACTERISTICS: 
(A) 1 LENGTH: 117 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1809 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAG 117 

50 



2) INFORMATION FOR SEQ ID NO: 1810 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 1810 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GACGCTGCTC CAGAAGAAAG AGAGCGCGGA 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTAAA AACATGAT 



CCCCATGTTA ACATCGGTAC 50 
ATTAACTGCT GCGATCACTA 100 
CACGCGGATA CGATCAAATC 150 
ATCACAATCT CAACTGCACA 200 
TGCACACGTT GACTGCCCAG 250 

278 



2) INFORMATION FOR SEQ ID NO: 1811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1811 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1812 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 270 bases 
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WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
_ .(B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1812 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

15 CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA 270 



2) INFORMATION FOR SEQ ID NO: 1813 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1813 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

40 CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCATGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1814 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 266 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1814 

10" ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT , ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

15 GTCACGCTGA CTATGT 266 



20 



2) INFORMATION FOR SEQ ID NO: 1815 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 



(xi) SEQUENCE DESCRIPTION: SEQ 

35 ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTACTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA 

40 GTCACGCTGA CTATGTTAA 



ID NO: 1815 

CCCCATGTTA ACATCGGTAC 50 

ATTAACTGCT GCGATCACTA 100 

CACGCGGATA CGATCAAATC 150 

ATCACAATCT CAACTGCACA 200 

TGCACACGTT GACTGCCCAG 250 

269 



45 



2) INFORMATION FOR SEQ ID NO: 1816 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1816 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


. . CAGTTCTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


10 GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT" 


CAACTGCACA 


- 200 


CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCACGCTGA 


CTATGTTA 








268 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1817 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 278 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1817 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

35 GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1818 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 268 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 . 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1818 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 
AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 
GTCACGCTGA CTATGTTA 



50 
100 
150 
200 
250 
268 



2) INFORMATION FOR SEQ ID NO: 1819 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) "TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1819 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1820 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1820 
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ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
5 CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTAAA AACATGAT 



PCT/CA00/01150 

CCCCATGTTA ACATCGGTAC 50 
ATTAACTGCT GCGATCACTA 100 
CACGCGGATA CGACCAAATC 150 
ATCACAATCT CAACTGCACA .200 
TGCACACGTT GACTGCCCAG 250 

278 



10 2) INFORMATION FOR SEQ ID NO: "1821" 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 bases 

(B) TYPE: • Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 4 9064 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1821 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


CAGTACTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT 


CAACTGCACA 


200 


30 CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCACGCTGA 


CTA 








263 



35 2) INFORMATION FOR SEQ ID NO: 1822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1822 

CAATCGAAGT ACAACGTTCT CTTCGWGTAT TGGACGGTGC GGTTACTGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TACGGAGTTC CACGTATCGT ATTTGCTAAC AAAATGGACA 150 
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AAATCGGTGC 
CAAGCAAACG 
CCGTGGTATC 
ACCTTGGTAC 
5 CAAGCTCAAG 
TGAAGACTTG 
AATTGAAAGC 
GTATTGTGTG 
TGCGGTTATC. 

10 GTATCAACCC 
GAGCCATTCG 
TCGTTTGACA 
ACGTATTGAA 
CAAATGCACG 

15 TATCGCTGCT 
CAGATGAAAA 
GTTATCCAAT 
GGGTATCGCC 
AAMCAAACGT 

20 CACTTGGACG 
GAACGTAGGT 
CTCAAGCACG 
GGTGATGTAT 
ATTCGAAAAC 

25 CGGTTGAAAA 
TACCCAATGG 
TGTCGACTCA 
AAGAAGCTGC 
GTAACCATCA 

30 AACTGCTCGT 
AAATCGTTCG 



TGACTTCCTT 
CACACCCAAT 
ATCGACTTGA 
AGATATCCTT 
AATACCGTGA 
ATGATGAAAT 
TGCTATCCGT 
GTTCTGCCTT 
GACTACCTTC 
AGATWCAGAT 
CAGCTCTTGC 
TTCTTCCGTG 
CACATCTAAA 
CTAACAGCCG 
GCCGTTGGTT 
AGCTAAAATC 
TGATGGTTGA 
CTTCAAAAAT 
TGAAACTGGT 
TCCTTGTTGA 
GCTCCTCAAG 
TGGATTCTTC 
GGATTGAATT 
GCAATCGTCG 
AGGTTTGGTA 
TTGACGTTAA 
TCTGAAACTG 
TAAGTCAGCA 
CTGTTCCAGA 
CGTGGACGTG 
TGCTTACG 



TACTCAGTAA 
CCAATTGCCA 
TCAAGATGAA 
GAAGAAGATA 
AAAATTGGTT 
ACCTTGAAGG 
AAAGCAACTA 
CAAGAACAAG 
CAAGCCCACT 
GAAGAAGAAA 
CTTCAAGATC 
TATACTCARG 
GGTAAACGTG 
TCAAGAAATT 
TGAAAGATAC 
ATCCTTGAGT 
GCCAAAATCT 
TGGCTGAAGA 
GAAMCAGTTA 
CCGTATGCGT 
TATCTTACCG 
AAACGTCAGT 
TACTCCAAAC 
GTGGTGTGGT 
GAATCTATGG 
AGCTAAGCTT 
CCTTCAAGAT 
CAACCAGCTA 
AGAAAACCTT 
TAGATGGTAT 



GCACACTTCA 
ATCGGTGCTG 
AGCTGAAATC 
TTCCAGCTGA 
GAAGCAGTCG 
TGAAGAAATC 
TCAACGTTGA 
GGTGTTCAAT 
TGATATCCCA 
CTCGTCCAGC 
ATGACGGACC 
TGTTCTCCAA 
AACGTATCGG 
GACACTGTTT 
TYCAACTGGT 
CAATCAACGT 
AAAGCTGACC 
AGATCCAACA 
TCTCAGGTAT 
CGTGAGTTCA 
TGAAACATTC 
CTGGTGGTAA 
GAAGAAGGTA 
TCCTCGTGAA 
CTAACGGTGT 
TACGATGGTT 
CGCGGCTTCA 
TCCTTGAGCC 
GGTGATGTTA 
GGAAGCACWC 



CGACCGTCTT 
AAGATGACTT 
TATACTAACG 
ATACCTTGAC 
CTGAAACTGA 
ACTAACGAAG 
ATTCTTCCCA 
TGATGCTTGA 
GCGATCAAAG 
ATCTGACGAA 
CATTTGTAGG 
TCARGKTCTT 
ACGTATCCTT 
ACTCAGGTGA 
GACTCATTGM 
TCCAGAMCCA 
AAGATAAGAT 
TTCCGCGTTG 
GGGTGAGCTT 
AAGTGGAAGC 
CGCGCTTCTA 
AGGTCAATTC 
AAGGATTCGA 
TTTATCCCAG 
TCTTGCAGGT 
CATACCACGA 
CTTGCCCTTA 
AATGATGCTT 
TGGGTCACGT 
GGTAACAGCC 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1668 



35 2) INFORMATION FOR SEQ ID NO: 1823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1823 

AGACCTGCGT TCACAAACAC AGGGTCGTGC TTCTTACTCT ATGGAGTTCT 50 
TGAAGTACAA CGAAGCGCCA AACAACGTTG CTACAGCAAT CATTGAAGCT 100 
CGTAAGGCTA GATAA 115 
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2) INFORMATION FOR SEQ ID NO: 1824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

_(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 , 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1824 

CTGCGTTCAC TGACCAAGGT CGTGCATCTT ACTCCATGGA ATTCCTGAAG 50 

TATGATGACG CGCCAAACAA CGTAGCTCAG GCCGTAATCG AAGCTCGCGG 100 

TAAATAA 107 

20 



10 



2) INFORMATION FOR SEQ ID NO: 1825 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
35 (B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1825 

TTACTCCATG GAGTTCCTGA AGTATGATGA TGCGCCGAAC AACGTTGCTC 50 
40 AGGCCGTAAT CGAAGCCCGT GGTAAATAA 7 9 



2) INFORMATION FOR SEQ ID NO: 1826 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1826 



CAGCTGCGTT CTCTGACCAA AGGTCGTGCA TCATACACTA TGGAATTCCT 
GAAGTATGAT GATGCGCCAA ACAACGTTGC TCAGGCCGTT ATTGAAGCCC 
GTGGTAAGTA A 



50 
100 
111 



2) INFORMATION FOR SEQ ID NO: 1827 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1827 

GATTTGCGCT CTGCAACTCA TGGGCGTGCT TCGTACTCCA TGGAGTTCTT 50 
GAAGTACTCT GATGCACCGC AAAACATTGC GAAAGCGATT ATTGAATCTC 100 
GTAGCTAA 108 



2) INFORMATION FOR SEQ ID NO: 1828 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1828 

CTCAGCTGCG TTCTCTGACC AAAGGTCGTG CATCTTACTC CATGGAATTC 50 
CTGAAGTATG ATGATGCGCC TAACAACGTT GCTCAGGCCG TTATTGAAGC 100 
TCGTGGCAAA TAA 113 
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2) INFORMATION FOR SEQ ID NO: 1829 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1829 

GCCGCAGGGT TAAAACCAAA GTCCCGTGCT CTCTCCTGAA GGGGAGAGCA 50 
CTATAGTAAG GAATATAGCC . 70 



2) INFORMATION FOR SEQ ID NO: 1830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1830 

GCCTCGGGTA AAACTTATAT CCCAGTCCCC CTCGTATAGA GGGGGATAGA 50 
GTAAAGGAAG ATAATC 66 



2) INFORMATION FOR SEQ ID NO: 1831 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1831 

5 

TCCACAGGAT TAAAACCCAG GTTTAAACCT AAGTCCCGTG CTCTCTCCTC 50 
AGGGGAGAGC ACAATAGTAA GGAATATAGC C 81 



10 



20 



30 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1832 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 70 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
'(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1832 

GCTACTAGTT TAAAACATTG ATCCCGTGCT CTCTCTATGA AGGGAGAGCA 50 
CAAGAGTAAG GAATAAAGCC 70 



2) INFORMATION FOR SEQ ID NO: 1833 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 72 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1833 



TTTCCAGTTA CGACATAAAT GTTATTATGG TCCAGCTTTG ACTGGACTAT 50 
TCTGAAAAGA AAGGAATATA TC 72 



2) INFORMATION FOR SEQ ID NO: 1834 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 (vi) ORIGINAL SOURCE: - - • - ...... 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1834 

GCCCCGGGTT TTAAAAAACA TTGATCCCGT GCTCTCTCCA GAAGGGGAGA 50 
GCGCAACAGT AAGGAATATA GCC 73 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1835 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 795 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

35 (xi) SEQUENCE DESCRIPTION: SEQ IDvNO: 1835 

CTGCAGCTGA TGGTCCTATG CCACAAACTA GAGAGCACAT CCTTCTATCA 50 

CGCCAAGTAG GTGTTCCATA TATCGTTGTA TTTATGAATA AAGCAGATAT 100 

GGTTGATGAT GCTGAACTTT TAGAATTGGT TGAAATGGAA ATTAGAGAAT 150 

40 TATTAAGCTC TTATGATTTC CCAGGTGATG ACACACCTAT TATTTCAGGT 200 

TCTGCTTTAA AAGCTCTTGA AGAAGCAAAA GCTGGACAAG ATGGCGAATG 250 

GTCAGCGAAA ATTATGGATC TTATGGCTGC TGTTGATAGC TATATTCCAA 300 
CTCCAACTCG TGACACTGAA AAAGATTTCT TAATGCCAAT TGAAGATGTT • 350 

TTCTCAATTT CAGGTCGTGG TACTGTTGTT ACAGGTAGAA TTGAAAAAGG 400 

45 TATTGTAAAA GTTGGTGATA CTATAGAAAT CGTTGGTATT AAAGATACTC 450 

AAACAACAAC TGTAACTGGC GTTGAAATGT TTAGAAAAGA AATGGACCAA 500 

GGTGAAGCAG GGGATAATGT TGGTGTTCTT CTTCGTGGTA CAAAAAAAGA 550 

AGAAGTTATC CGCGGTATGG TTCTTGCTAA ACCAAAATCA ATTACTCCAC 600 

ATACTGATTT CGAAGCTGAA GTTTATATCC TAAATAAAGA TGAGGGTGGT 650 

50 AGACATACTC CATTCTTTAA TAACTATAGA CCGCAATTCT ATGTAAGAAC 700 

AACAGATGTA ACAGGTTCTA TTAAATTAGC TGATGGCGTT GAAATGGTTA 750 

TGCCTGGTGA AAATGTAAGA ATTACTGTAA GCTTGATTGC ACCAG 7 95 
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2) INFORMATION FOR SEQ ID NO: 1836 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1836 

GCCATACTAG TTGTTTCTGC AGCTGATGGC CCAATGCCAC AAACTAGAGA 50 

GCACATTTTG CTATCTCGTC AAGTTGGTGT TCCATATATA GTTGTTTTTA 100 

TGAACAAAGC TGATATGGTA GATGACGCAG AGTTGCTAGA ATTAGTTGAA 150 

ATGGAGATCA GAGAGTTATT AAGCGAATAT GACTTCCCTG GTGATGATAC 200 

TCCTATTATA AGCGGATCAG CACTTCAAGC TCTTGAAGAA GCTAAAGCTG 250 

GTAATGATGG CGAATGGTCA GCTAAGATTA TGGATCTTAT GGCTGCTGTT 300 

GATAGCTACA TACCAACTCC AGTTCGTGCT ACTGATAAAG ATTTCTTAAT 350 

GCCGATTGAA GACGTATTCT CAATTTCTGG CCGTGGTACT GTTGTTACTG 400 

GTAGAATTGA AAAAGGTATA GTTAAAGTTG GTGATACTAT CGAAATCGTA 450 

GGTATTAGAG ATACACAAAC TACAACAGTT ACCGGCGTTG AAATGTTTAG 500 

AAAAGAAATG GATCAAGGCG AGGCTGGTGA TAACGTTGGT GTTCTTTTAC 550 

GCGGTACAAA GAAAGAAGAC GTTGAAAGAG GTATGGTTCT TTGTAAGCCA 600 

AAATCAATTA CTCCTCATAC TAAATTTGAG GGAGAAGTTT ATATCTTGAC 650 

TAAGGAAGAG GGCGGTAGAC ATACTCCATT CTTCAACAAC TATAGACCAC 700 

AATTTTATGT AAGAACAACA GATGTTACTG GATCAATCAC TCTTCCAGAG 750 

GGTACTGAGA TGGTTATGCC TGGTGATAAC TTAAAAATCA CTGTTGAGTT 800 

AATCAACCCA GTTGCTC 817 



2) INFORMATION FOR SEQ ID NO: 1837 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerea Jis 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1837 

CCATACTAGT TGTTTCTGCA GCTGATGGCC CAATGCCACA AACTAGAGAG 50 
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CACATTTTGC TATCTCGTCA AGTTGGTGTT CCATATATAG TTGTTTTTAT 100 

GAACAAAGCT GATATGGTAG ATGACGCAGA GTTGCTAGAA TTAGTTGAAA 150 

TGGAGATCAG AGAGTTATTA AGCGAATATG ACTTCCCTGG TGATGATACT 200 

CCTATTATAA GCGGATCAGC ACTTCAAGCT CTTGAAGAAG CTAAAGCTGG 250 

5 TAATGATGGC GAATGGTCAG CTAAGATTAT GGATCTTATG GCTGCTGTTG 300 

ATAGCTACAT ACCAACTCCA GTTCGTGCTA CTGATAAAGA TTTCTTAATG 350 

CCGATTGAAG ACGTATTCTC GATTTCTGGC CGTGGTACTG TTGTTACTGG 400 

TAGAATTGAA AAAGGTATAG TTAAAGTTGG TGATACTATC GAAATCGTAG 4 50 

GTATTAGAGA TACACAAACT ACAACAGTTA CCGGCGTTGA AATGTTTAGA 500 

10 " AAAGAAATGG ATCAAGGCGA GGCTGGTGAT AACGTTGGTG- TTCTTTTACG - - 550. 

CGGTACAAAG AAAGAAGACG TTGAAAGAGG TATGGTTCTT TGTAAGCCAA 600 

AATCAATTAC TCCTCATACT AAATTTGAGG GAGAAGTTTA TATCTTGACT 650 

AAGGAAGAGG GCGGTAGACA TACTCCATTC ( TTCAACAACT ATAGACCACA 700 

ATTTTATGTA AGAACAACAG ATGTTACTGG ' ATCAATCACT CTTCCAGAGG 750 

15 GTACTGAGAT GGTTATGCCT GGTGATAACT TAAAAATCAC TGTTAGTT 7 98 



2} INFORMATION FOR SEQ ID NO: 1838 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1838 

35 GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACTGCA GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGCGGTTCTG CACGCGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

40 GGCACGCCGA CTACGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCGATGC CACAGACTCG 350 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATTC ATGATCGTGT 400 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGATTTCC CGGGCGACGA 500 

45 CATCCCAGTG GTTCGTGGTT CAGCTCTGAA AGCGCTGGAA GGCGAAGCAG 550 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG GTCACCTGGA TAACTACATC 600 

CCAGAACCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCC ATCTCCGGCC GTGGTACTGT TGTTACTGGT CGTGTAGAGC 700 

GTGGTATCAT TAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAT 750 

50 ACCGTGAAAT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATTAAAC 850 

GTGAAGATAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCTATCAAG 900 

CCGCACACTC AGTTCGAATC AGAAGTTTAT ATCCTGTCCA AAGATGAAGG 950 
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CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA 
GTACAACTGA CGTGACTGGC ACCATCGAAC 
GTAATGCCGG GCGACAACAT TCAAATGGTT 
CGCAATGGAC GACGGT 



CCGTCCACAG TTCTACTTCC 1000 
TGCCAGAAGG CGTTGAGATG 1050 
GTTACCCTGA TCCACCCAAT 1100 

1116 



2) INFORMATION FOR SEQ ID NO: 1839 

10 (i) SEQUENCE- CHARACTERISTICS : - 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE:. Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
20 (B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1839 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

25 TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACCGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTGCTG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

30 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCTCTGGAA GGCGACGCTG 550 

35 AGTGGGAATC TAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 750 

ACTGCTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

40 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTTTAT ATCCTGTCCA AAGACGAAGG 950 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

45 GTTATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGA 1109 



50 2) INFORMATION FOR SEQ ID NO: 1840 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1108 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1840 



GTGTCTAAAG AAAAATTTGA ACGTACTAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCA GCTATCACTA 100 

15 CCGTACTGTC TAAAGTATAC GGTGGTCAGG CTCGTGCATT CGATCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTAGAGTAC GACACCCCAA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

GTCACGCCGA CTACGTGAAG AACATGATCA CTGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTAGTAGC TGCGACTGAC GGCCCAATGC CTCAGACTCG 350 

20 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 4 50 

GAAATGGAAG TACGTGAGCT GCTGTCTCAG TACGACTTCC CAGGCGACGA 500 

TACTCCAGTT GTTCGCGGTT CTGCACTGAA AGCGCTGGAA GGCGATGCTC 550 

AGTGGGAAGA GAAGATTGTT GAACTGGCAG GCTACCTGGA CAGCTACATC 600 

25 CCTGAGCCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATCTCCGGCC GTGGTACTGT AGTAACTGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 750 

ACTACCAAGA CTACTTGTAC TGGCGTTGAA ATGTTCCGTA AGCTGCTGGA 800 

CGAAGGTCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 850 

30 GTGATGACGT AGAGCGTGGT CAGGTTCTGG CTAAGCCAGG CTCAATCAAC 900 

CCACACACCA ACTTTGTAGC AGAAGTTTAT ATTCTGTCCA AAGATGAAGG 950 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

GTAATGCCAG GTGACAACAT TCAAATGGTT GTTACCCTGA TTGCACCAAT 1100 

35 CGCGATGG 1108 



2) INFORMATION FOR SEQ ID NO: 1841 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1841 

961 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TGGCAAAAGC TAAATTTGAA CGTATTAAGC CTCACGTAAA CGTGGGCACC 50 

ATTGGTCACG TTGACCATGG TAAAACCACT CTGACTGCAG CTATCTCTCA 100 

CGTACTGGCT AAGACCTACG GTGGCGAAGC TAAAGACTTC TCTCAAATCG 150 

5 ATAACGCTCC AGAAGAGCGT GAGCGCGGTA TTACCATCAA TACCTCTCAC 200 

ATCGAATATG ACACGCCATC ACGCCACTAC GCCCACGTAG ACTGCCCAGG 250 

CCACGCTGAC TATGTTAAAA ACATGATCAC TGGTGCTGCA CAGATGGACG 300 

GCGCGATTCT GGTAGTCGCT TCAACAGACG GTCCAATGCC ACAGACTCGT 350 

GAGCACATCC TGCTTTCTCG TCAGGTTGGC GTACCATTCA TCATCGTATT 400 

10 CATGAACAAA TGTGACATGG~TAGATGACGA~AGAGCTGTTA GAGCTAGTTG - 450 

AGATGGAAGT GCGTGAACTG TTATCAGAAT ACGATTTCCC AGGTGATGAC 500 

TTACCGGTAA TCCAAGGTTC AGCTCTGAAA GCGCTAGAAG GCGAGCCAGA 550 

GTGGGAAGCA AAAATCCTTG AATTAGCAGC GGCGCTGGAT TCTTACATTC 600 

CAGAACCACA ACGTGACATC GATAAGCCGT TCCTACTGCC AATCGAAGAC 650 

15 GTATTCTCAA TTTCAGGCCG TGGTACAGTA GTAACAGGTC GTGTTGAGCG 700 

TGGTATTGTA CGCGTAGGCG ACGAAGTTGA AATCGTTGGT GTACGTGCGA 750 

CAACTAAGAC AACGTGTACT GGTGTAGAAA TGTTCCGTAA ACTGCTTGAC 800 

GAAGGTCGTG CAGGTGAGAA CTGTGGTATT TTGTTACGTG GTACTAAGCG 850 

TGATGACGTA GAACGTGGTC AAGTATTAGC GAAGCCAGGT TCAATCAACC 900 

20 CACACACTAC TTTTGAATCA GAAGTTTACG TACTGTCAAA AGAAGAAGGT 950 

GGTCGTCACA CGCCATTCTT CAAAGGCTAC CGTCCACAGT TCTACTTCCG 1000 

TACAACTGAC GTAACCGGTA CTATCGAACT GCCAGAAGGC GTAGAGATGG 1050 

TAATGCCAGG CGATAACATC AAGATGGTAG TGACACTGAT TTGCCCAATC. 1100 

GCGATGG 1107 

25 



2) INFORMATION FOR SEQ ID NO: 1842 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 
40 (B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1842 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

45 AATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CACGTGCATT CGACCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

50 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTAT 400 

TCCTGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAGCTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CAGGCAATGA 500 

962 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TACTCCAATC ATCCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCGTA GAACTGGCTG AAACTCTGGA TTCTTACATC 600 

CCAGAACCAG AACGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTTACCGGT CGTGTAGAGC 700 

5 GCGGTATCGT TAAAGTTGGT GAAGAAGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTTAAAT CAACTTGTAC CGGCGTTGAA ATGTTCCGTA AACTGCTGGA 800 

CGAAGGTCGT GCAGGCGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 850 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCTATCAAA 900 

CCACACACCA AGTTCGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

10 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAAATGATC GTTACCCTGA TCCACCCAAT 1100 

CGCAATGGAC GATGGT , 1116 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1843 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1129 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1843 



GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACAAC TCTGACTGCT GCTATCACCA 100 

35 GCGTTTTAGC TAAAACTTAT GGCGGTAACG CTCGTGCATT CGATCAAATC 150 

GATAATGCAC CAGAAGAAAA AGCACGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAGTAT GATACTCCTG CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GACACGCCGA CTATGTGAAA AACATGATCA CCGGTGCTGC TCAAATGGAC 300 

GGCGCGATCT TAGTTGTTGC GGCAACTGAT GGTCCTATGC CACAGACTCG 350 

4 0 TGAGCACATC CTGTTAGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTGT 4 00 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAACTGTT AGAATTAGTT 4 50 

GAAATGGAAG TTCGTGAGCT TCTTTCTGCT TATGATTTCC CTGGTGACGA 500 

TACTCCAGTT GTTCGTGGTT CTGCGCTGAA AGCGTTAGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCATT GAATTAGCCG GATATCTGGA TAGCTACATC 600 

45 CCAGAGCCAG AGCGTGCGAT TGACCGTCCG TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATTTCAGGCC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 700 

GCGGAATCGT TAAAGTCGGT GAAGCCGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTACAAA CTACCTGTAC TGGCGTTGAA ATGTTCCGTA AGTTACTTGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 850 

50 GTGAAGAAAT CGAACGCGGT CAAGTACTGG CTAAGCCAGG TTCAATCAAC 900 

CCGCACACCA ACTTTGTATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

963 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

GTGATGCCTG GTGACAACAT TCAGATGACT GTAACTCTGA TTGCACCAAT J 100 
CGCGATGGAC GAAGGTTTAC GCTTCGCTA 1129 



2) INFORMATION FOR SEQ ID NO: 1844 

(i) SEQUENCE CHARACTERISTICS: 
... . (A) LENGTH: 810 bases 
10 (B) TYPE: "Nucleic "acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1844 

ATATTCATAA TGCATTACAA GTTGTGAAAA CAACAAGTGA CGGAAGTGAA 50 

AAGACTGTTA CATTGGAAAC TGCTGTAGAA TTAGGGGATG GTGCAGTTCG 100 

TACGATTGCC ATGGAATCTA CAGATGGTTT GCAACGTGGC ATGAAAGTAG 150 

25 TGGACTTAGG ACGCACAATT AGCGTTCCTG TGGGACCTGA AACATTAGGT 200 

CGTGTATTCA ACGTTTTAGG AGATACAATC GACTTGAAAG AACCATTCCC 250 

AGAAGACTTT ACAAGACATG AAATCCATAA ACCAGCACCA AAATTTGAAG 300 

AATTAAACAG TCAATATGAA ATTCTACAAA CAGGGATTAA AGTTATTGAC 350 

CTTTTAGCAC CTTATCTTAA AGGTGGTAAA ATCGGTTTAT TCGGTGGTGC 400 

30 CGGTGTAGGG AAAACCGTAT TAATTCAAGA ATTAATTCAT AATATCGCTG 450 

AAGAACTTGG TGGTATTTCA GTATTTACAG GGGTAGGGGA ACGTACTCGT 500 

GAAGGGAATG ACCTTTACCA TGAAATGCAA GAATCAGGCG TATCTGCTAA 550 

AACAGCGATG GTGTTTGGGC AAATGAACGA ACCACCAGGA GCTCGTATGC 600 

GTGTAGCACT AACAGGGTTA ACTATTGCGG AATACTTCCG TGATATGGAA 650 

35 AAACAAGACG TGCTTTTATT CATCGATAAC ATTTATCGTT TCACGCAAGC 700 

AGGTTCAGAA GTGTCAGCGT TACTTGGTCG TATGCCTTCT GCCGTAGGGT 750 

ATCAACCAAC ATTAGCGACA GAAATGGGTC AATTACAAGA ACGTATCAGT 800 

TCAACTAAAG 810 



40 



2) INFORMATION FOR SEQ ID NO: 1845 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 

964 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(B) STRAIN: ATCC 9345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1845 

5 TGCTCTGCCA GAAATTAACA ACGCCCTCCT CACCGAAGTA GACCTCTCCG 50 

GCCAAGGCGA AGGCGAAAGC GTTCTCAAGA TGACTCTTGA GGTTGCTCAG 100 

CACCTCGGCG ATAACATCGT CCGTACCATC GCCATGAAGC CAACCGACGG 150 

TCTGGTTCGC GGCGCCACCG TTATCGATAC CGGCGCCCCA ATCACCGTGC 200 
CAGTTGGCGA CGCAACTAAA GGTCATGTTT TCAACGTGAC CGGTGATGTC . 250 

10 " CTAAACTTGG GCGAAGGCGA AACCCTTGAC GTCAAGGAAC GGTGGCCAAT .300 

CCACCGCAAG GCTCCACAGT TCGACGAACT CGAACCGGAA ACCAAGATGT 350 

TCGAAACAGG CATCAAGGTG ATCGATCTCC TCACCCCATA CGTACAGGGC 400 

GGCAAGATCG GTCTGTTTGG CGGTGCTGGT GTTGGTAAGA CCGTTCTTAT 4 50 

CCAGGAAATG ATCCAGCGTG TTGCACAGGA TCATGGCGGT GTGTCCGTGT 500 

15 TCGCGGGTGT GGGTGAACGT ACCCGTGAAG GTAACGATCT TATCCACGAA 550 

ATGGAAGATG CGGGCGTTCT TGATAAGACC GCGCTTGTGT TCGGCCAGAT 600 

GGATGAACCG CCAGGGGTTC GTTTGCGTAT TGCACTTTCC GGCCTGACCA 650 

TGGCGGAATA CTTCCGTGAC GTGCAAAACC AGGACGTGCT TTTGTTCATC 700 

GATAACATCT TCCGCTTCAC CCAGGCAGGT TCGGAAGTGT CCACGTTGCT 750 

20 TGGCCGTATG CCATCAGCAG TGGGCTACCA GCCGACCTTG GCAGATKAAA 800 

TGGGCGCATT GCAGG 815 



25 2) INFORMATION FOR SEQ ID NO: 1846 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1846 



ACCTTCCTCC TATCTTGAAH GCCCTGGAGG 

TTAGTTTTAG AGGTGTCCCA GCATTTGGGT 

TGCTATGGAC GGTACTGAAG GATTGGTTCG 

CCGGATATCC TATTAGAGTT CCTGTCGGTC 

45 ATGAACGTTA TTGGCGAGCC TGTTGATGAG 

GAAGCTTGCA CCCATCCACG CTTCTCCCCC 



TAAGACTGTG TTTATCCAGG AGTTGATYAA 

50 GTGGTTACTC CGTGTTCGCT GGTGTTGGTG 

GATTTGTACC ACGAGATGAT TCAAACTGGT 

ATCCAAGGCT GCCCTTGTCT ACGGMCAAAT 

GTGCCCGMGT CGCTTTGACC GGTCTTACCG 



TCCAAAACCA 


CAGCTCTCGY 


50 


GAAAACACCG 


TTCGTACTAT 


100 


TGGTCAAAAT 


GTCGTAGATA 


150 


CTGAATGTTT 


GGGTCGTATC 


200 


CGCGGYCCTA 


TCAAGACCAA 


250 


CGAGTTCGTY 


GACCAATCCA 


300 


AGGTTGTCGA 


TTTGTTGGCC 


350 


TTCGGTGGTG 


CCGGTGTCGG 


400 


CAACGTTGCC 


AAGGCCCACG 


450 


AGCGTACTCG 


TGAGGGTAAC 


500 


GTCATCAAGC 


TTGATGGCCA 


550 


GAACGAGCCC 


CCAGGTGCTC 


600 


TTGCTGAATA 


CTTCCGTGAT 


650 



965 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GAGGAAGGHC AAGATGTGTT GCTCTTCATT 
CCAAGCTGGT TCTGAAGTGT CYGCCTTGTT 
TCGGTTACCA ACCCACCTTG GCCACCGATA 
ATTACCACCA CCAAGAAGGG TTCCATTACC 
5 CCCTGCTGAT GATTTGACCG ATCCCGCTCC 
TTGACGCCAC CACCGTGTTG TCTCGTTCCA 
CCCGCTGTCG ATCCCCTCGA CTCCAAGTCT 
TGTCGGTGAA GAGCACTACG ACATCGCCAC 
ARTCTTACAA GTCTCTCCAG GAT 



PCT/CAOO/01150 

GACAACATTT TCCGTTTCAC 700 

GGGTCGTATC CCCTCCGCTG 750 

TGGGTGTCAT GCAAGAGCGT 800 

TCTGTCCAGG CCATTTACGT 850 

TGCCACTACT TTTGCCCATC 900 

TCTCTGAGTT GGGTATTTAC 950 

CGTATGTTGG ATCCYCGTAT 1000 

TGGTGTTCAG AAGATTCTCC 1050 

1073 



2) INFORMATION FOR SEQ ID NO: 1847 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 
25 (B) STRAIN: ATCC 56220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1847 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TGCCTCACCC TTTGGGTATT 50 

30 TTGCGAATAC TAATTATAGT AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAACGTA CTCGTGAGGG TAACGATTTG 150 

TACCACGAAA TGCAGGAAAC TGGTGTCATT CAGCTCGAGG GTGAATCCAA 200 

GGTCGCCCTC GTGTTCGGTC AGATGAACGA GCCCCCTGGT GCCCGTGCCC 250 

GTGTCGCTCT TACTGGTTTG ACCATTGCCG AGTACTTCCG TGACGAGGAG 300 

35 GGTCAAGATG TGCTTCTCTT CATTGACAAC ATTTTCCGTT TCACTCAGGC 350 

CGGTTCTGAG GTGTCTGCCC TTTTGGGTCG TATCCCCTCT GCCGTCGGTT 4 00 

ACCAGCCCAC TCTCGCCGTC GACATGGGTG TCATGCAGGA GCGTATTACC 4 50 

ACCACCACCA AGGGTTCCAT CACCTCCGTC 4 80 



2) INFORMATION FOR SEQ ID NO: 1848 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 566 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

966 
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WO 01/23604 



PCT/CAOO/01150 



(B) STRAIN: ATCC 14112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1848 

5 TGTCTTCATT CAGGAGTTGA TTGTACGTCC CTTCCTCTCT ACAAATGACG 50 

GGCGAGGAAA ATTTTTGGCT TTTTCTAATA GCTCGTTATA GAACAACATT 100 

GCCAAAGCCC ACGGTGGTTA CTCCGTTTTC ACTGGTGTCG GCGAGCGGAC 150 

CCGTGAAGGA AACGATTTGT ACCACGAGAT GCAGGAAACC CGTGTTATCC 200 

AGCTCGATGG CGAGTCTAAG GTCGCACTCG TCTTCGGTCA GATGAACGAG 250 

10 CCCCCCGGAG CCCGTGCCCG TGTTGCCCTC" ACTGGCCTGA GCATTGCTGA - 300 

ATATTTCCGT GACGAGGAAG GTCAAGACGG TATGTATTCA TATAAATTAC 350 

TCCGGGCAAA TTGACTCAGA ACCGCACTCA CTCACACATA TATTAGTGCT 400 

TCTCTTTATC GACAACATTT TCCGCTTCAC, CCAGGCCGGT TCCGAAGTGT 450 

CCGCCCTGCT TGGTCGTATT CCCTCCGCCG' TCGGTTACCA ACCCACTCTC 500 

15 GCCGTCGACA TGGGTGGTAT GCAGGAACGT ATCACAACCA CCACCAAGGG 550 

CTCCATTACC TYCGTG 566 



20 2) INFORMATION FOR SEQ ID NO: 1849 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 817 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1849 



AATGAAGCCA TTGTTGTAAA TTTTGAAAGT GAAGGCAAAA AACAAAAACT 50 

TGTTTTAGAA GTAGCAGCAC ACTTGGGCGA TAATAGAGTT AGAACTATTG 100 

CTATGGATAT GACAGATGGC TTGGTAAGAG GACTTAAAGC AGAAGCTTTG 150 

GGTGCTCCTA TTAGCGTTCC TGTGGGTGAA AAAGTTTTAG GAAGAATTTT 200 

40 TAATGTTACG GGAGATTTGA TCGATGAAGG TGAAGAAATT TCTTTTGATA 250 

AAAAATGGGC AATTCATAGA GATCCACCAG CTTTTGAAGA TCAAAGCACA 300 

AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTGGATT TACTTGCTCC 350 

TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA GGTGTTGGTA 400 

AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT TAAACATAGC 450 

45 GGCTATTCTG TATTTGCAGG TGTAGGTGAG AGAACTCGTG AAGGAAATGA 500 

CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA GTTGCTCTAT 550 

GTTATGGACA AATGAATGAA CCACCAGGGG CAAGAAATCG TATTGCTTTA 600 

ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG GTCTTGATGT 650 

GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA GGTTCTGAAA 700 

50 TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA TCAACCAACC 750 

CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT CAACTAAAAA 800 

AGGATCAATT ACTTCAG 817 



967 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1850 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1850 

AAGGCAACAC GCATAAACTT ATTTTAGAGA CTGCTGCACA CCTTGGAGAT 50 

AATCGTGTAA GAACTATCGC TATGGATATG AGCGAAGGAC TTACAAGAGG 100 

GTTAGATGCT ATAGCGCTTG GGTCGCCTAT CAGTGTTCCT GTTGGAGAAA 150 

20 AAGTTTTAGG AAGAATATTC AACGTAATTG GTGATCTTAT AGACGAAGGC 200 

GAAGAAGAAA AATTTGATAA AAAATGGTCG ATTCATAGAG ATCCGCCGGC 250 

ATTTGAAGAT CAAAGCACAA AAAGTGAAAT TTTTGAAACA GGTATAAAAG 300 

TCGTAGATCT TTTGGCTCCT TATGCAAAAG GCGGTAAAGT TGGACTATTT 350 

GGCGGTGCCG GCGTTGGTAA AACAGTTATC ATTATGGAAC TTATCCACAA 400 

25 CGTTGCATTC AAACACAGCG GCTATTCGGT ATTTGCCGGT GTCGGTGAAA 4 50 

GAACAAGAGA GGGTAACGAT CTTTATAATG AAATGAAAGA ATCCGGCGTT 500 

TTGGATAAAG TTGCCTTATG TTATGGACAA ATGAATGAAC CGCCGGGTGC 550 

AAGAAACCGT ATAGCGCTTA CTGGTCTTAC AATGGCTGAG TATTTTCGTG 600 

ACGAGATGGG ACTAGATGTT CTTATGTTTA TCGATAACAT CTTCCGTTTC 650 

30 TCACAATCAG GCTCAGAGAT GTCGGCTCTT CTTGGACGTA TCCCAAGTGC 700 

GGTTGGTTAT CAACCAACGT TAGCTAGCGA AATGGGAAGA CTTCAAGAAA 750 

GAATCACATC AACTAAAAAA GGTTC 775 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1851 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 793 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1851 

CGAAGCTATT GAAGTAAATT TTACAGTAGA AGGCAACACG CATAAACTTA 50 
TTTTAGAGAC TGCTGCACAC CTTGGAGATA ATCGTGTAAG AACTATCGCT 100 

968 
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WO 01/23604 



PCT/CA00/01150 



ATGGATATGA GCGAAGGACT TACAAGAGGG TTAGATGCTA TAGCGCTTGG 150 

GTCGCCTATC AGTGTTCCTG TTGGAGAAAA AGTTTTAGGA AGAATATTCA 200 

ACGTAATTGG TGATCTTATA GACGAAGGCG AAGAAGAAAA ATTTGATAAA 250 

AAATGGTCGA TTCATAGAGA TCCGCCGGCA TTTGAAGATC AAAGCACAAA 300 

5 AAGTGAAATT TTTGAAACAG GTATAAAAGT CGTAGATCTT TTGGCTCCTT 350 

ATGCAAAAGG CGGTAAAGTT GGACTATTTG GCGGTGCCGG CGTTGGTAAA 400 

ACAGTTATCA TTATGGAACT TATCCACAAC GTTGCATTCA AACACAGCGG 450 

CTATTCGGTA TTTGCCGGTG TCGGTGAAAG AACAAGAGAG GGTAACGATC 500 

. TTTATAATGA AATGAAAGAA TCCGGCGTTT TGGATAAAGT TGCCTTATGT 550 

10 TATGGACAAA TGAATGAACC GCCGGGTGCA AGAAACCGTA" TAGCGCTTAC 600 

TGGTCTTACA ATGGCTGAGT ATTTTCGTGA CGAGATGGGA CTAGATGTTC 650 

TTATGTTTAT CGATAACATC TTCCGTTTCT CACAATCAGG CTCAGAGATG 700 

TCGGCTCTTC TTGGACGTAT CCCAAGTGCG , GTTGGTTATC AACCAACGTT 750 

AGCTAGCGAA ATGGGAAGAC TTCAAGAAAG AATCACATCA ACT 793 

15 



2) INFORMATION FOR SEQ ID NO: 1852 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 
30 (B) STRAIN: ATCC 33236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1852 

GGACTATTTA CCGAAGATTA ACGAAGCTAT CGAGGTTAAA TTTGACGTCG 50 

35 AGGGCGCTCA TCGCAGGCTG ATCCTAGAGG TAGCCGCGCA CCTTGGAGAC 100 

AATCGCGTCC GCACGATCGC TATGGATATG AGCGATGGAC TTAGGCGAGG 150 

GCTTGAGGCC GTCGCTTTGG GCGCGCCTAT TACGGTGCCT GTGGGCGAGA 200 

AAGTTTTGGG TAGAATTTTT AATGTTACGG GCGATCTGAT CGACGAAGGC 250 

GAGGATGAAA AATTTGAAAC CCGCTGGTCG ATCCACAGAG ATCCGCCTAG 300 

40 CTTTGAAAAT CAAAGCACGA AGAGTGAAAT TTTTGAAACC GGCATTAAGG 350 

TAGTCGATCT GCTCGCCCCT TATGCAAAGG GCGGTAAGGT AGGACTATTC 400 

GGCGGTGCTG GCGTCGGTAA GACCGTCATC ATCATGGAAC TGATTCACAA 450 

CGTCGCTTTC AAACACAGCG GCTACTCCGT ATTTGCGGGT GTCGGCGAGC 500 

GAACGAGAGA GGGAAACGAC CTTTATAACG AGATGAAAGA ATCGGGCGTT 550 

45 TTGGATAAAG TCGCCTTGAC CTATGGTCAG ATGAACGAAC CGCCGGGAGC 600 

GAGAAACCGT ATCGCGCTAA CCGGTCTTAC GATGGCCGAG TATTTCCGCG 650 

ACGAGCTAGG GCTTGACGTT TTGATGTTTA TTGATAATAT CTTCCGCTTC 700 

TCGCAGTCGG GTTCGGAGAT GTCCGCGCTT TTAGGACGAA TTCCGTCCGC 750 

GGTCGGTTAT CAGCCTACGC TTGCCAGCGA AATGGGTAAA TTACAGGAGC 800 

50 GCATTACTTC TACTAAGAAG GGCTC 825 
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2) INFORMATION FOR SEQ ID NO: 1853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1853 

TTTACCTCAA ATTAATGAAG CAATTGTTGT AAATTTTGAA AGCGAAGGAA 50 

AAAAACATAA ACTTGTTTTA GAAGTAGCAG CTCATTTAGG AGATAATAGA 100 

GTTAGAACTA TTGCTATGGA TATGACAGAT GGTTTGGTAA GGGGCTTAAA 150 

20 AGCTGAGGCT TTAGGTGCTC CTATTAGTGT TCCTGTTGGT GAGAAAGTTT 200 

TAGGAAGAAT TTTCAATGTT ACTGGAGATT TGATCGATGA AGGTGAAGAA 250 

ATTTCTTTTG ATAAAAAATG GGCAATTCAT AGAGATCCGC CAGCTTTTGA 300 

AGATCAAAGC ACAAAAAGTG AGATTTTTGA AACAGGGATT AAAGTTGTAG 350 

ATTTGCTTGC TCCTTATGCA AAAGGTGGTA AAGTAGGTCT TTTTGGTGGT 400 

25 GCAGGTGTTG GTAAAACTGT TATTATTATG GAGCTTATTC ACAATGTTGC 450 

ATTTAAGCAT AGCGGCTATT CTGTATTTGC AGGTGTGGGT GAGAGAACTC 500 

GTGAAGGAAA TGACCTTTAT AATGAAATGA AAGAAAGTAA TGTTTTAGAC 550 

AAAGTTGCTC TATGTTATGG ACAAATGAAT GAACCACCAG GAGCAAGAAA 600 

TCGTATTGCT TTAACAGGTT TAACAATGGC TGAGTATTTT AGAGATGAAA 650 

30 TGGGTCTTGA TGTGCTTATG TTTATTGATA ATATCTTTAG ATTTTCACAA 700 

TCAGGTTCTG AAATGTCAGC ACTTTTAGGA AGAATTCCAT CAGCTGTGGG 750 

TTATCAACCA ACCCTAGCAA GTGAAATGGG TAAATTCCAA GAAAGAATTA 800 

CTTCAACTAA AAAAGGCT 818 
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2) INFORMATION FOR SEQ ID NO: 1854 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1854 

ATTACCTGAT ATCAACAACG CCTTATTGGT CTATAAAAAT GATGAACAAA 50 
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AAAGTAAAAT TGTGCTAGAA GCTGCCTTAG 
CGTACAATTG CCATGGAATC AACGGATGGT 
TGTCGATACT GGTAAACCAA TTTCAGTTCC 
GACGTGTCTT TAACGTTTTA GGGGATACGA 
5 GCACAAGATG CAGATCGTTC TGCAATTCAT 
AGACTTAAGT ACAAGTACTG AAATTTTAGA 
ACTTATTAGC ACCATATTTA AAAGGTGGTA 
GCCGGAGTAG GTAAAACCGT TTTAATCCAA 
ACAAGAACAT GGTGGGATTT CTGTATTTAC 

10 GTGAAGGAAA TGACTTGTAT CATGAAATGC 
AAAACTGCCA TGGTGTTTGG TCAAATGAAC 
GCGTGTTGCT TTAACTGGGT TAACGATTGC 
AAGGACAAGA TGTGTTGCTA TTTATTGATA 
GCGGGTTCTG AAGTATCAGC CTTGCTTGGT 

15 TTATCAACCT ACATTGGCTA CAGAAATGGG 
CTTCAACTAA GAAGGGCTCT ATCACTTCTA 



AATTAGGTGA TGGCATCATT 100 

TTACAACGTG GGATGGAAGT 150 

AGTTGGTAAA GAAACGCTAG 200 

TTGATATGCA AGAACCATTT 250 

AAAGCTGCAC CAAAATTTGA 300 

AACAGGGATT AAAGTTATCG 350 

AAGTCGGTCT ATTCGGGGGT 400 

GAATTAATCC ATAATATTGC 450 

CGGTGTTGGT GAACGTACAC 500 

GTGATTCAGG AGTTA-TTGAA 550 

GAACCACCTG GAGCTCGTAT 600 

TGAATATTTC CGTGATGTAG 650 

ACATCTTCCG TTTCACTCAA 700 

CGTATGCCAT CTGCCGTGGG 750 

TCAATTACAA GAACGTATCA 800 

830 



20 2) INFORMATION FOR SEQ ID NO: 1855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1855 



TCTTTACCAG ATATCAATAA TGCGCTTATT GTCTATAAAA ATGATGAACA 50 

AAAAAGTAAA ATCGTGCTTG AAGCTGCTTT AGAGCTAGGA GATGGCATTA 100 

TTCGTACGAT TGCAATGGAA TCAACTGATG GATTGCAACG TGGAATGGAA 150 

GTTTTCGATA CAGGTAAGCC AATTTCAGTA CCAGTAGGTC GTGAAACATT 200 

40 AGGTCGTGTA TTTAATGTTT TAGGTGATAC CATTGATACG CAAGAAGCTT 250 

TTCCTGCTGA TGCGAATCGT GATGCGATTC ATAAATCAGC TCCAGCTTTT 300 

GAAGAATTAA GTACAAGTAC TGAAATCCTA GAAACAGGGA TTAAAGTTAT 350 

CGACTTACTA GCACCATACT TAAAAGGTGG GAAAGTTGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACC GTATTAATTC AAGAATTAAT TCATAATATC 450 

45 GCCCAAGAAC ATGGGGGTAT TTCAGTATTT ACCGGTGTTG GTGAACGTAC 500 

ACGTGAAGGA AATGACTTGT ATCACGAAAT GCGTGATTCA GGCGTTATCG 550 

AAAAAACTGC TATGGTGTTT GGGCAAATGA ACGAACCACC TGGAGCACGT 600 

ATGCGTGTTG CGCTAACTGG ACTAACTATT GCGGAATACT TCCGTGATGT 650 

TGAAGGCCAA GACGTATTGC TATTTATTGA TAATATCTTC CGTTTTACTC 700 

50 AAGCAGGTTC TGAAGTTTCT GCCTTACTTG GTCGTATGCC TTCTGCGGTA 750 

GGTTATCAAC CTACTTTGGC TACTGAAATG GGTCAATTGC AAGAACGGAT 800 

TACATCAACG AAGAAAGGTT CGA 823 



971 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1856 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii). MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1856 

TTACCAGACA TTAATAATGC CTTGGTTGTC TATAAAAATG ACGAACAAAA 50 

AACCAAGATT GTATTAGAAG CTGCCTTAGA ACTAGGAGAT GGTGTGATTC 100 

GAACTATCGC CATGGAATCT ACTGATGGCT TACAACGGGG AATGGAAGTT 150 

20 GTCGATACTG GCAGTTCCAT TTCTGTACCG GTAGGAAAAG AAACATTGGG 200 

TCGTGTATTT AACGTTTTAG GAAATACAAT TGACTTAGAA GAACCTTTTC 250 

CAGCGGATGC TAAACGTAGT GGTATCCATA AAAAAGCGCC TGATTTTGAT 300 

GAATTAAGCA CTAGTACAGA AATTTTAGAA ACAGGGATTA AAGTTATTGA 350 

CCTATTAGCC CCTTATTTAA AAGGTGGTAA AGTCGGATTA TTCGGTGGTG 4 00 

25 CCGGAGTTGG TAAAACCGTT TTAATTCAAG AATTAATTCA TAATATTGCC 450 

CAAGAACATG GTGGGATTTC TGTTTTTACT GGTGTTGGTG AAAGAACACG 500 

TGAAGGTAAT GACTTGTATT ATGAAATGAA AGAATCTGGC GTTATCGAAA 550 

AAACTGCCAT GGTATTTGGT CAAATGAATG AGCCACCTGG TGCCCGGATG 600 

CGGGTTGCTT TAACCGGACT TACCATTGCG GAATACTTCC GGGACGTTGA 650 

30 AGGACAAGAT GTATTGCTCT TTATCGATAA TATTTTCCGT TTTACCCAAG 700 

CTGGTTCAGA AGTATCTGCC TTATTAGGAC GGATGCCCTC TGCCGTTGGT 750 

TATCAACCAA CTTTGGCTAC TGAAATGGGA CAACTTCAAG AACGGATTAC 800 

CTCAACGAAA AAAGGTTCTA TTACAT 826 
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2) INFORMATION FOR SEQ ID NO: 1857 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1857 

TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTACAAAA AAAATAAAAC 50 
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AAAAGTTGTT CTTGAAGCTG CTTTGGAACT 

CGATCTCTAT GGAATCAACA GATGGCTTGC 

GATACAGGCA AACCAATCTC AGTTCCCGTT 

TGTGTTTAAC GTATTAGGTG AAACAATCGA 

5 AAGATGCAGT AAAAAGCGGT ATTCATAAAA 

CTTAGTACCA GTAATGAAAT TTTAGAAACA 

ATTAGCTCCT TACTTAAAGG GTGGTAAAGT 

GTGTTGGTAA AACCGTCTTG ATCCAAGAAT 

. GAACACGGTG GTATTTCAGT GTTTACGGGT 

10 AGGGAACGAC CTTTATTATG AAATGAAGGA 

CTGCCATGGT GTTTGGACAA ATGAACGAGC 

GTTGCCTTGA CTGGTTTGAC ATTGGCTGAA 

ACAAGATGTG CTGTTGTTTA TCGACAACAT 

GTTCTGAAGT TTCTGCCTTG CTTGGCCGGA 

15 CAACCAACTT TGGCAACTGA AATGGGTCAA 

AACGAAGAAG GGCT 



PCT/CA00/01150 

TGGTGATGGT GTTATCCGCA 100 

AACGTGGAAT GGAAGTTGTC 150 

GGTAAAGAAA CTTTAGGTCG 200 

CAAAGAAGCG CCTTTTCCAG 250 

AAGCGCCGGC TTTTGAAGAA 300 

GGGATCAAAG TTATCGACTT 350 

CGGACTATTT GGTGGTGCCG 400 

TGATTCATAA TATCGCCCAA 450 

GTTGGTGAAC GTACTCGTGA 500 

ATCAGGCGTT ATTGAGAAAA 550 

CGCCAGGTGC GCGTATGCGT 600 

TATTTCCGAG ATGAAGAAGG 650 

CTTCCGTTTC ACTCAAGCCG 700 

TGCCTTCAGC CGTTGGCTAC 750 

TTGCAAGAAC GAATCACTTC 800 

814 



20 2) INFORMATION FOR SEQ ID NO: 1858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1858 



CGCATTAGTT GTTTATAAAA ATGATGAGCA AAAATCAAAA GTTGTTCTTG 50 

AAGCAGCATT AGAATTAGGT GACGGTGTGA TCCGTACGAT CGCAATGGAA 100 

TCGACGGATG GACTACAACG TGGAATGGAA GTCATCGACA CAAGCAAAGC 150 

GATCTCTGTA CCAGTTGGAA CAGAAACATT AGGTCGTGTG TTCAACGTGT 200 

40 TAGGTGAAAC AATCGATTTG GAAGCACCAT TTCCAGAGGA TGCCCAAAGA 250 

AGCGAGATCC ACAAGAAAGC ACCAAATTTT GATGAATTAA GCACAAGTAC 300 

AGAGATTCTT GAAACTGGGA TCAAAGTCAT TGACTTATTA GCACCTTATT 350 

TAAAAGGTGG GAAAGTTGGA TTGTTTGGGG GTGCCGGTGT TGGTAAAACC 400 

GTACTGATCC AAGAATTGAT CCATAATATC GCCCAAGAAC ATGGGGGAAT 450 

45 CTCAGTGTTT ACCGGTGTAG GGGAACGTAC CCGTGAAGGA AACGATCTGT 500 

ATTACGAAAT GAAAGATTCA GGCGTAATCG AAAAAACAGC GATGGTGTTT 550 

GGACAAATGA ATGAGCCACC AGGTGCTCGT ATGCGTGTCG CACTAACTGG 600 

ATTGACGATT GCGGAATATT TCCGTGATGT CGAAGGACAA GACGTGCTCT 650 

TATTTATTGA TAATATTTTC CGTTTCACCC AAGCAGGTTC AGAAGTATCT 700 

50 GCCTTACTAG GACGTATGCC ATCAGCGGTT GGTTATCAAC CAACCTTAGC 750 

GACTGAAATG GGACAACTCC AAGAACGGAT CACTTCAACG A 791 
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2) INFORMATION FOR SEQ ID NO: 1859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10" _ _ _ 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1859 



TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTATAAAA AAGATAAAAC 50 

AAAAGTTGTT CTTGAAGCTG CTTTGGAACT TGGTGATGGT GTTATTCGCA 100 

CAATCGCCAT- GGAATCAACG GATGGATTAC AACGTGGAAT GGAAGTTGTC 150 

20 GATACTGGCA AGCCTATTTC TGTTCCAGTA GGAAAAGAAA CTCTAGGTCG 200 

TGTATTTAAT GTATTAGGTG AAACAATCGA CAAGGAAGCG CCTTTTCCAG 250 

AAGATGCAGA AAAAAGTGGT ATTCACAAGA AAGCACCAAC TTTCGAAGAA 300 

CTTAGCACAA GTAATGAGAT CTTAGAAACA GGAATCAAAG TTATTGACTT 350 

GTTAGCTCCT TACTTAAAAG GTGGTAAAGT TGGATTATTT GGTGGTGCCG 400 / 

25 GTGTTGGTAA AACAGTCTTG ATTCAAGAGC TAATTCATAA TATCGCTCAA 450 

GAACATGGTG GTATTTCTGT GTTTACTGGT GTTGGTGAAC GTACTCGTGA 500 

AGGGAACGAC CTTTATTATG AAATGAAAGA TTCTGGTGTT ATTGAGAAAA 550 

CTGCTATGGT GTTCGGTCAA ATGAACGAGC CGCCAGGTGC ACGTATGCGT 600 

GTTGCCTTAA CTGGTTTAAC CTTAGCCGAA TACTTCCGTG ATGAAGAAGG 650 

3 0 ACAAGATGTG TTGCTATTTA TTGACAACAT TTTCCGTTTC ACTCAAGCCG 700 

GATCAGAAGT TTCTGCCTTA CTTGGCCGTA TGCCGTCAGC AGTTGGTTAC 750 

CAACCGACTT TAGCAACTGA AATGGGTCAA TTACAAGAAC GTATTACGTC 800 

GACGAAAAAA GGTTCAA 817 
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2) INFORMATION FOR SEQ ID NO: 1860 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 

(B) STRAIN: ATCC 51173 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1860 



CCTGACATTC ATAATGCATT AATTGTAACG AACGCTGATA TGGCGGATGT 50 
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AATGCAAGAA AATATTTCGG ATGAAGAAAA 
CACTGGATTT AGGTCATGGA ATGGTCCGGA 
GATGGTTTGG AACGCGGCAT GACAGTTGTG 
AGTGCCAGTA GGCGAAGCCA CTTTAGGTAG 
5 AGACAATTGA TGAACTAGAA CCGGTTGGCG 
ATTCATCGTG AAGCCCCTAA ATATGAGGAC 
TTTAGAAACC GGAATTAAGG TCATCGATTT 
GGGGAAAAAT CGGTTTATTC GGTGGTGCCG 
ATTCAAGAAT TAATTCATAA TATTGCAGAA 

10 -TTTCACTGGG -GTTGGAGAAC- GTACCCGTGA 
AAATGCGAGA GTCAGGTGTA AGCAAGAAGA 
ATGAATGAAC CACCTGGAGC ACGTATGCGT 
AATGGCGGAA TATTTCCGTG ACGAATTGAA 
TTGATAATAT TTATCGTTTT ACTCAAGCAG 

15 TTAGGTCGTA TGCCTTCAGC AGTAGGGTAT 
AATGGGACAA ATGCAAGAAC GTATTACGTC 
CA 



ATTATTAACC TTAGAAGTTG 100 

CAATTGCGAT GGAATCAACC 150 

GATTATTTAA CACCGATTAA 200 

AGTATTCAAT GTTTTAGGTG 250 

ACGATGTTGA ACTCAAAAGT 300 

TTAGATAATA GTTTTCATGT 350 

ATTAGCTCCT TATATTAAAG 400 

GAGTGGGTAA AACGGTCTTA 4 50 

CAATTAGGAG GTATCTCAGT 500 

AGGGAATGAC • CTCGTTTTTG .550 

CGGCCATGGT TTTCGGTCAA 600 

GTTGTCTTAA CAGGACTTAC 650 

ACAAGACGTC TTATTATTTA 700 

GTTCCGAAGT GTCAGCCTTA 750 

CAACCAACTT TAGCAAGTGA 800 

WACGAAGCRC GGTTCCATTA 850 

852 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1861 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1861 

GCGCGACTCT TCCTGAGATT AATAACGCAC TCATCGTTTA CAAAGATGTA 50 

GACGGCGTTA AAACTAAAAT CGTCCTTGAA GTGGCGTTGG AACTTGGTGA 100 

TGGTGCCGTA CGTACCATCG CTATGGAATC AACTGATGGC TTGACACGTG 150 

40 GACTTGAAGT TCTCGATACA GGTAAAGCAA TCAGCGTACC TGTTGGTCAA 200 

GAAACACTTG GACGTGTCTT CAATGTACTT GGAGATGCTA TTGATGGAGG 250 

GGAAGCATTT GCTGAAAATG CAGAACGCAG CCCTATCCAT AAAAAAGCCC 300 

CATCTTTTGA TGAACTTTCA ACAGCAAATG AAATTCTGGT GACAGGGATT 350 

AAAGTTATTG ACTTGCTTGC CCCATACCTT AAAGGTGGTA AGATTGGGTT 400 

45 GTTCGGTGGT GCCGGAGTTG GTAAAACCGT CCTTATCCAA GAGTTGATTC 450 

ACAATATTGC CCAAGAACAC GGTGGTATTT CCGTATTTAC TGGTGTTGGG 500 

GAACGTACAC GTGAAGGGAA TGACCTTTAC TGGGAAATGA AAGAATCAGG 550 

CGTTATCGAA AAAACAGCCA TGGTCTTCGG TCAAATGAAT GAACCACCTG 600 

GAGCACGTAT GCGTGTTGCT CTTACTGGTT TGACAATTGC TGAATATTTC 650 

50 CGTGATGTAG AAAAACAAGA CGTTTTGCTT TTCATTGATA ATATCTTCCG 700 

TTTCACCCAA GCCGGTTCAG AAGTATCTGC CCTCTTAGGA CGTATGCCAT 750 

CAGCCGTTGG TTACCAACCT ACGCTTGCAA CTGAAATGGG TCAACTTCAA 800 

GAACGTATCA CTTCAACAAA ACAAGGTT 828 
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2) INFORMATION FOR SEQ ID NO: 1862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(B) STRAIN: ATCC 11454 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1862 

AATTGCCTGA RATTAACAAT GCCTTGATTG TCTACAAAGA TGTCAATGGC 50 

CTAAAAACAA AAATTACTCT TGAAGTTGCT TTGGAACTTG GTGATGGTGC 100 

AGTTCGTACA ATCGCTATGG AATCTACTGA TGGCTTGACT CGTGGACTTG 150 

20 AAGTCCTTGA TACAGGTAAA GCAGTCAGCG TTCCTGTTGG GGAAGCCACT 200 

CTTGGTCGTG TTTTTAACGT TCTTGGTGAT GTTATTGACG GTGGGGAAGA 250 

ATTTGCTGCT GATGCAGAAC GTAATCCTAT CCATAAAAAA GCTCCAACAT 300 

TTGACGAATT GTCAACTGCA AACGAAGTTC TCGTAACTGG GATTAAAGTT 350 

GTCGATTTGC TTGCACCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

25 TGGTGCCGGA GTTGGTAAAA CCGTCCTTAT TCAAGAATTG ATTCACAACA 450 

TCGCCCAAGA ACACGGAGGT ATTTCTGTGT TTACCGGTGT TGGGGAACGT 500 

ACTCGTGAAG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

TGAAAAAACT GCCATGGTCT TTGGTCAAAT GAATGAACCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCTGAATA TTTCCGTGAT 650 

30 GTTCAAGGTC AAGACGTACT GCTTTTCATT GACAACATCT TCCGTTTCAC 700 

ACAAGCTGGT TCAGAAGTTT CTGCCCTTTT GGGACGTATG CCTTCTGCCG 750 

TTGGTTACCA ACCAACACTT GCTACTGAAA TGGGGCAATT GCAAGAACGT 800 

ATCACTTCTA CTAAGAAGGG TTCTGTTA 828 



2) INFORMATION FOR SEQ ID NO: 1863 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1863 

CTTACCTGAA ATCTACAACG CCCTAGTTAT TGAATATAAA TCTGATGCAG 50 
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AAGAAGCACC AACTAGCCAA CTTACTTTAG 
GATGACGTTG TTCGTACAAT TGCAATGGCA 
AGGTATGGAA GTTATTGATA CTGGGAGCCC 
CAGTAACACT TGGTCGTGTA TTTAACGTAT 
5 GATGAGCCAC TTCCAAGCGA TATCAAGCGT 
ACCAACATTT GACCAATTAG CAACAACTAC 
TAAAAGTTGT AGACTTGCTA GCCCCATATT 
TTGTTCGGCG GAGCGGGTGT TGGTAAAACC 
TCATAATATC GCTCAAGAAC ATGGTGGTAT 

1 0 GAGAACGTAC TCGTGAAGGG "AACGATCTTT 
GGTGTAATTG AAAAAACTGC CATGGTATTC 
AGGTGCTCGT ATGCGTGTAG CTTTAACAGG 
TCCGTGATGA AGAACACCAA GATGTACTTC 
CGCTTTACTC AAGCTGGTTC AGAGGTTTCG 

15 ATCTGCAGTA GGTTATCAAC CAACTCTAGC 
AAGAACGTAT TACTTCTACT AATGT 



AAGTAGCCAT TCAATTAGGT 100 

TCAACTGATG GTGTTCAAAG 150 

AATCACAGTT CCAGTTGGTA 200 

TAGGAAACAC TATCGATTTG 250 

AATAAAATTC ACCGTGAAGC 300 

TGAAATTCTT GAAACAGGAA 350 

TAAAAGGTGG TAAAATTGGT 4 00 

GTTTTAATCC AAGAACTTAT 450 

TTCTGTGTTC GCTGGTGTTG 500 

ACTTTGAAAT GAAAGACTCT 550 

GGTCAAATGA ACGAACCACC 600 

TCTTACAATC GCTGAATATT 650 

TATTCATTGA TAATATTTTC 700 

GCTTTACTAG GTCGTATGCC 750 

TACCGAAATG GGACAATTAC 800 

825 



20 2) INFORMATION FOR SEQ ID NO: 1864 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1864 



GTCATATTCC AGAGCTTTAT GACGCTCTGG AGGTTAAGGG CGATGGTAAG 50 

CATCGTTCAG ACCTAGTTCT TGAGGTTCAA CAGCAGATTG GCGGTGGTGT 100 

GGTACGCTGC ATTGCCATGG GTTCTTCTGA CGGTTTGAGC AGAGGAATTG 150 

AGGCTGTAAA TACTGGTGCC GGTGTTAAGG TTCCAGTTGG TCGTGAGACC 200 

4 0 CTAGGACGTA TTATGAACGT TTTAGGTCAG CCTGTAGATG AGAGAGGTCC 250 

TATCGGACAG AAAGAGGATT GGGAAATTCA CCGTCCAGCT CCTACCTATG 300 

CTGAGCAGTC ATCAACTACA GAAATTCTAG AAACCGGTAT TAAGGTTATG 350 

GACCTTATCT GCCCATTTGC TAAGGGTGGT AAAGTTGGTC TGTTCGGTGG 400 

TGCCGGTGTG GGTAAGACAG TTAACATGAT GGAGCTTATC AATAACATTG 4 50 

45 CTAAGGCTCA CTCAGGTCTA TCTGTATTTA CCGGTGTTGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG CAGGAATCAA AGGTTATCGA 550 

TAAGGTATCA ATGATTTACG GTCAGATGAA CGAGCCTCCA GGGAACCGTC 600 

TACGTGTTGC TCTGACAGGT CTGACTGTTG CTGAGAAGTT CCGTGACGAA 650 

GGTCTGGATG TGCTTCTGTT CATCGATAAC ATCTATCGTT ATACACTGGC 700 

50 TGGTACAGAG GTATCTGCTC TGTTAGGCCG TATGCCTTCA GCTGTGGGTT 7 50 

ACCAGCCTAC ACTGGCTGAG GAAATGGGTG TATTACAGGA GCGTATTGCT 800 

TCAACTAAGA AAGGTTCTAT T 821 
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2) INFORMATION FOR SEQ ID NO: 1865 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

. (ii). MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1865 

TGATACTTTA CCAGATATCA ATAATGCATT AGCCGTATAT AAAAATGATG 50 

AGAACAAGAC GCGTGTTGTA TTGGAAGCTA CTTTAGAACT TGGAGATGGG 100 

GTAATTCGTG CCATTTCTAT GGGGTCTACT GACGGCTTGC AACGTGGCAT 150 

GGAAGTTGTG GATACACAAG AACCTATTTC TGTTCCGGTA GGAAATGATA 200 

CTTTAGGTCG TGTATTTAAT GTGTTAGGAG AAACAATAGA TAATCAGGAG 250 

CCATTTCCTG AAGATGCTGA AAAAAGTGGT ATTCACAAAA AAGCCCCTAG 300 

TTTTGATGAA TTAAGTACTA GTTCGGAAAT ATTAGAAACA GGGATCAAAG 350 

TGATTGATTT ATTAGAACCT TATCTAAGAG GCGGTAAAGT CGGATTGTTT 400 

GGAGGCGCCG GTGTTGGAAA AACGGTGCTA ATTCAAGAAT TGATCAATAA 450 

TGTTGCCCAA GAACACGGGG GTATTTCCGT GTTTAATGGT GTAGGTGAAC 500 

GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGCAGGA TTCAGGCGTT 550 

ATCGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC CACCAGGTGC 600 

TCGTATGCGT GTTGCTTTAA CTGGCCTAAC ACTGGCAGAA TATTTTCGAG 650 

ATGTTGAAGG TCAAGACGTA TTATTATTTA TTGATAATAT TTTCCGTTTT 700 

ACACAAGCAG GTACCGAAGT TTCCGCTTTA CTTGGTAGAA TGCCATCTGC 750 

TGTTGGCTAT CAACCCACAC TAGCAACTGA AATGGGGCAA CTGCAAGAAC 800 

GGATTACGTC AACGGATAAG GG 822 



2) INFORMATION FOR SEQ ID NO: 1866 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1866 

ATCTCCTCAG GATCKATAGG ACTTGATATA GCTCTTGGTA TAGGCGGCGT 50 
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ACCAAAAGGA AGAATAGTCG AAATTTATGG 
CAACTCTTAC TTTGCATTTA ATAGCAGAAT 
TGCGCGTTTG TAGATGCAGA GCATGCACTT 
TTTAGGCGTT GATACGGATA ACTTATATAT 
5 AGCAAGCTCT TGATATAGTA GAAACTCTAG 
CTTATAGTAA TAGATAGCGT AGCAGCTYTA 
AGGCGATATG GGAGATCAGC ACGTAGGGCT 
AAGCACTTAG AAAATTAACC GGAGTTGTCC 
GTATTTATAA ACCAAATTCG TATGAAAATC 

10 TCCTGAAACT ACTACTGGCG GAAATGCGCT 
GACTTGACGT ACGTAAAATA GCTACTTTAA 
GGAAACCGCG TAAAAGTAAA AGTAGTAAAA 
TAGACAAGCC GAATTTGATA TCATGTTTGG 
GAGAGATAAT AGATTACGGC GTAAAACTTG 

15 GCTTGGTTTA GCTATGATAA TTCAAAATTA 
AAAAGCGTTT TTAAAAGA ■ 



GCCAGAAAGC 


TCTGGTAAAA 


100 


CTCAAAAAGT 


CGGCGGAGTT 


150 


GATGTTAAAT 


ATGCTAAAAA 


200 


TTCTCAACCG 


GACTTCGGAG 


250 


CTAGAAGCGG 


CGCCGTTGAT 


300 


ACACCAAAAA 


GCGAAATAGA 


350 


GCAAGCAAGA 


CTCATGAGTC 


400 


ATAAAATGGG 


AACTACAGTT 


450 


GGCGCTATGG 


GCTATGGCAC 


500 


TAAATTTTAC" 


"GCTTCAGTTA - 


- 550 


AACAGAGCGA 


TGAGCCAATC 


600 


AACAAAGTCG 


CTCCTCCTTT 


650 


AGAAGGTATC 


AGCAAAGAAG 


700 


ATATTATCGA 


TAAAAGCGGC 


750 


GGTCAAGGCA 


GAGAAAACTC 


800 






818 



20 2) INFORMATION FOR SEQ ID NO: 1867 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1867 



TCTCCTCAGG ATCKATAGGA CTTGATATAG CTCTTGGTAT AGGCGGCGTA 50 

CCAAAAGGAA GAATAGTCGA AATTTATGGG CCAGAAAGCT CTGGTAAAAC 100 

AACTCTTACT TTGCATTTAA TAGCAGAATC TCAAAAAGTC GGCGGAGTTT 150 

GCGCGTTTGT AG AT GC AG AG CATGCACTTG ATGTTAAATA TGCTAAAAAT 200 

40 TTAGGCGTTG ATACGGATAA CTTATATATT TCTCAACCGG ACTTCGGAGA 250 

GCAAGCTCTT GATATAGTAG AAACTCTAGC TAGAAGCGGC GCCGTTGATC 300 

TTATAGTAAT AGATAGCGTA GCAGCTYTAA CACCAAAAAG CGAAATAGAA 350 

GGCGATATGG GAGATCAGCA CGTAGGGCTG CAAGCAAGAC TCATGAGTCA 400 

AGCACTTAGA AAATTAACCG GAGTTGTCCA TAAAATGGGA ACTACAGTTG 450 

45 TAT T TAT AAA CCAAATTCGT ATGAAAATCG GCGCTATGGG CTATGGCACT 500 

CCTGAAACTA CTACTGGCGG AAATGCGCTT AAATTTTACG CTTCAGTTAG 550 

ACTTGACGTA CGTAAAATAG CTACTTTAAA ACAGAGCGAT GAGCCAATCG 600 

GAAACCGCGT AAAAGTAAAA GTAGTAAAAA ACAAAGTCGC TCCTCCTTTT 650 

AGACAAGCCG AATTTGATAT CATGTTTGGA GAAGGTATCA GCAAAGAAGG 700 

50 AGAGATAATA GATTACGGCG TAAAACTTGA TATTATCGAT AAAAGCGGCG 750 

CTTGGTTTAG CTATGATAAT TCAAAATTAG GTCAAGGCAG AGAAAACTCA 800 

AAAGCGTTTT TAAA 814 
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2) INFORMATION FOR SEQ ID NO: 1868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1868 

GATAGCATAG GCWCAGGTTC AGTTGGACTT GATCTTGCTT TAGGTATAGG 50 

CGGTGTTCCA AAAGGAAGAA TTATAGAAAT TTATGGGCCT GAAAGTTCAG 100 

GTAAAACTAC TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT 150 

GGGGTTTGTG CTTTTATCGA TGCAGAACAT GCACTTGATG TGAAATATGC 200 

TAAAAATTTA GGTGTAAATA CAGATGATTT GTATGTTTCT CAACCTGATT 250 

TTGGAGAGCA AGCCTTAGAA ATTGTAGAAA CTATAGCWAG AAGTGGTGCA 300 

GTAGATCTTA TWGTAGTAGA TAGCGTTGCA GCWCTTACCC CAAAAGCAGA 350 

AATTGAAGGC GATATGGGCG ATCARCATGT AGGACTTCAA GCAAGACTTA 400 

TGTCTCAAGC TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC 450 

ACAGTAATTT TCATCAACCA AATTCGTATG AAAATCGGTG CTATGGGTTA 500 

TGGTACTCCT GAAACCACAA CAGGTGGAAA TGCATTAAAA TTTTATGCTT 550 

CTGTGCGTTT AGATGTTAGA AAAGTAGCAA CCTTAAAMCA AAACGWAGAM 600 

CCTATAGGAA ACCGCGTTAA AGTAAAAGTA GTTAAAAATA AAGTTGCTCC 650 

TCCATTCAGM CAAGCTGAAT TTGATGTGAT GTTTGGAGAG GGTTTAAGCC 700 

GTGAAGGTGA ATTGATCGAT TATGGTGTAA AACTTGATAT CGTAGATAAA 750 

AGTGGTGCGT GGTTTTCTTA TAAAGATAAA AAACTTGGAC AAGGTAGAGA 800 

AAATTCAAAA GCTTTCTTAA AAGA 824 



2) INFORMATION FOR SEQ ID NO: 1869 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1869 

AGAAAGTTCT GGTAAAACAA CGGTTGCACT GCATGCGATT GCAGAAGTTC 50 
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AAAAACATGG CGGGACGGCA GCCTTTATTG 

CCTCAATACG CACAACGTCT AGGTGTAAAC 

ACAACCAGAT ACTGGGGAAC AAGGCTTAGA 

CAAGTGGCGC AGTCGATATT ATCGTTATTG 

5 CCCCGTGCTG AAATCGATGG CGAGATGGGT 

GGCTCGTTTG ATGTCACAAG CATTGCGCAA 

AAACAAAGAC TATCGCCGTC TTTATTAACC 



PCT/CA00/01150 

ATGCCGAGCA CGCGTTGGAC 100 

ATTGATGAAT TGCTGCTATC 150 

AATTGCAGAT GCTTTAGTTT 200 

ACTCGGTGGC CGCGCTAGTC 250 

GATGCGCACG TTGGTCTGCA 300 

GCTGTCAGGC TCTATCAACA 350 

AAATTCGT 388 



10 



20 



15 



2) INFORMATION FOR SEQ ID NO: 1870 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1870 

TGAAAGTTCA GGTAAAACAA CAGTTGCACT ACACGCTATT GCAGAAGTAC 50 

AAAAAAATGG CGGAACGGCC GCTTTCATTG ATGCTGAGCA TGCGTTAGAT 100 

CCGCAATATG CACAAAAATT AGGTGTGAAT ATCGATGAAC TACTTCTTTC 150 

30 ACAGCCTGAC ACAGGAGAAC AAGGTCTAGA GATCGCTGAT GCTTTAGTAT 200 

CAAGTGGGGC TGTAGATATC GTAGTAGTCG ATTCAGTTGC TGCTTTAGTT 250 

CCACGAGCAG AAATCGACGG CGAAATGGGT GACTCACATG TCGGGTTACA 300 

AGCACGTTTG ATGTCTCAAG CATTGCGTAA ACTCTCTGGT TCGATCAACA 350 

AAACAAAAAC AATCGCTATT TTCATCAACC AAATCCGT 388 

35 



2) INFORMATION FOR SEQ ID NO: 1871 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 
50 (B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1871 
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AGAGAGTTCC GGTAAAACAA CTGTTGCGCT 
AAGCACAAGG CGGAACAGCA GCATTTATCG 
CCGGCTTATG CTAAAAACCT AGGTGTAAAT 
TCAACCAGAT ACAGGAGAAC AAGCTTTAGA 
GAAGTGGTGC AGTTGATATG TTAGTAATTG 
CCACGTGCTG AAATCGAAGG CGAGATGGGC 
AGCACGTTTA ATGTCCCAAG CATTGCGTAA 
AATCAAAAAC CATTGCTATT TTCATTAACC 



TCATGCAATT GCGGAAGTAC 50 

ATGCTGAGCA TGCGTTGGAT 100 

ATTGATGAAT TATTACTATC 150 

GATTGCTGAA GCTTTAGTTA 200 

ACTCCGTTGC AGCACTTGTA 250 

GATGCTCATG TTGGATTACA 300 

ACTTTCTGGT GTTATTAATA 350 

AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1872 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1872 



AGAGTCATCT GGTAAGACAA CGGTTGCCCT 
AAAAAGAAGG TGGTATTGCT GCCTTTATCG 
CCAGCTTATG CTGCGGCCCT TGGTGTCAAC 
ACAACCAGAC TCAGGAGAGC AAGGTCTTGA 
ACTCAGGAGC CGTGGATCTT GTCGTAGTCG 
CCTCGTGCGG AAATTGATGG AGATATCGGT 
GGCTCGTATG ATGAGCCAGG CTATGCGTAA 
AAACCAAAAC AATTGCCATC TTTATCAACC 



TCATGCAGTT GCGCAAGCAC 50 

ATGCGGAACA TGCCCTTGAT 100 

ATTGACGAAT TGCTCTTGTC 150 

GATTGCAGGA AAATTGATTG 200 

ACTCAGTTGC GGCCCTTGTC 250 

GATAGCCACG TTGGTTTGCA 300 

ACTTGGTGCT TCTATCAATA 350 

AATTGCGT 388 



2) INFORMATION FOR SEQ ID NO: 1873 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1873 
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GAACATGACG CCGACTTTTT CACGCAATTG 
TTTTGGTTTT ATTGATAGAA GCTCCGAGTT 
ATTCGAGCCT GCAAACCAAC GTGACTGTCT 
5 CGCACGAGGT ACAAGGGCCG CAACTGAGTC 
CACCTGAGTC AATCAATTTT CCAGCAATTT 
TCTGGTTGTG ACAAGAGCAA TTCGTCAATA 
ATAGGCTGGG TCAAGAGCAT GTTCCGCATC 
CTTCTTTCTG.TGCTTGCGCA ACAGCGTGAA 
LO GATGATTCTG GCGCRTACAY TTCGATGATA 



GTTGATAAAG ATGGCAATTG 50 

TACGCATGGC CTGGCTCATC 100 

CCAATATCCC CATCAATTTC 150 

GATAACGACA AGGTCAACTG 200 

CAAGACCTTG TTCACCTGAG 250 

TTCACACCAA GGGCTGCAGC 300 

GATAAAGGCT GCAATACCAC 350 

GGGCAACCGT TGTCTTACCA 400 

430 



2) INFORMATION FOR SEQ ID NO: 1874 



L5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1874 

JO TTGTCGTTGT TGCTGCCTCC GACGGTCAGA TGTAGGTGGA ACATCTTGGG 50 

AAATACGTCG TAAAACACGT CGCTTACGTT TTCGCGAATA GGCCCCAGAC 100 

TCGTGAGCAT TTGCTGCTCG CCCGCCAGGT TGGTGTCCAG AAGATCGTTG 150 

TCTTCGTCAA CAAAATCGAT GCTATTGATG ATCCGGAGAT GCTGGAACTG 200 

GTCGAACTCG AGATGCGTGA GCTGCTGAAC AGCTACGGTT TCGAGGGTGA 250 

)5 AGAGACTCCG ATCATTTTCG GTTCCGCTCT CTGTGCTCTC GAAGGACGCC 300 

GTGACGACAT CGGTAAAGAC AGAATTGAGC AGCTTATGAA CGCTGTCGAC 350 

ACCTGGATCC CCACTCCTCA GCGTGACCTC GACAAACCTT TCTTGATGTC 400 

TGTCGAGGAA GTGTTCTCTA TCGCCGGCCG TGGTACCGTG GCTTCTGGTC 4 50 

GTGTCGAGCG TGGTATCTTG AAGAAGGACT CTGAGGTTGA GATTGTTGGA 500 

10 GGCTCCTTCG AACCCAAGAA GACCAAAGTC ACCGACATTG AAACCTTCAA 550 

GAAGAGCTGT GATGAATCGC GTGCTGGTGA CAACTCTGGT CTCCTCCTGC 600 

GTGGTATCCG ACGTGAAGAC GTCAAGCGTG GTATGGTCAT TGCTGTTCCC 650 

GGCAGCACCA AGGCTCACGA CAAGTTCCTC GTCTCCATGT ACGTCCTGAC 700 

CGAGGCGGAG GGTGGTCGTC GTACTGGCTT CGGTGCCAAC TACCGTCCCC 750 

15. AAGTCTTCAT CCGTACTGCA GGTAAGTTCC CGCACACCGT GTCCAGATCT 800 

TCCGAGAGAT TAGCGATATA TGCTAATGAT TCATCAGACG AGGCTGCTGA 850 

CCTCAGCTTC CCTGACGGCG ACCAATCTCG CAGAGTTATG CCTGGTGACA 900 

ACGTCGAGAT GATCCTGAAG ACCCACCACC CTGTTGCTGC TGAGGCT 947 



50 



2) INFORMATION FOR SEQ ID NO: 1875 
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- - 10 



15 



20 



25 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) - ORGANISM: Aspergillus- versicolor- 

(B) STRAIN: WSA-175 



(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1875 



GCTGCTTCCG 
TGACAGAATG 
GCACTTGCTG 
TCAACAAGGT 
CTGGAAATGC 
CCCTATCATC 
ATATCGGTAC 
ATCCCTACCC 
GGAGGTCTTC 
AGCGTGGTCT 
CAGGTCATGA 
TGACGAATCC 
GCCGTGAGGA 
AAGGCCCACA 
AGGTGGCCGT 
TTCGCACTGC 
AGAATACCTC 
GATGGCGACA 
CCTCAACCTC 



ATGGTCAAAT 
TCGCCTAACA 
CTTGCCCGCC 
TGATGCCGTC 
GTGAGCTTCT 
TTCGGTTCCG 
TGAGCGAATT 
CTCAGCGTGA 
TCCATTGCCG 
CCTTAAGAAG 
AGACCAAGGT 
CGTGCTGGTG 
TGTCAAGCGC 
AGAAGTTCAT 
CGCAGTGGCT 
TGGTAAGTTT 
TAACTTGTAT 
TGAGCCGCCG 
AACAACCCTG 



GTACGTCAAC 
GCACGCGTGA 
AGGTCGGTGT 
GATGACCCTG 
CAGCACTTAC 
CCCTATGCGC 
GACAGCCTTC 
CCTGGACAAG 
GTCGTGGTAC 
GACAGCGAGG 
CACTGACATT 
ACAACTCCGG 
GGTATGGTTA 
GGTCTCCATG 
TCGGTGTCAA 
TCAAAACATT 
TTAGACGAGG 
TGTCATGCCT 
TCG 



CTTATATACA 
ACTAGGCCCC 
CCAGAAGATT 
AGATGTTGGA 
GGCTTCGAAG 
CCTCGAGGGT 
TTGAGGCCGT 
CCTTTCCTGA 
CGTTGCCTCT 
TCGAGATTCT 
GAGACATTCA 
TCTTCTTCTC 
TTGCTGCTCC 
TACGTCCTCA 
CTATCGTCCC 
CGACCTCTCG 
CTTGCGACCT 
GGTGACAACG 



CCCTCTGATA 
AAACCCGTGA 
GTTGTGTTCG 
ACTTGTTGAG 
GCGAGGAGAC 
CGCCGCCCCG 
TGACACCTGG 
TGTCTGTCGA 
GGCCGTGTTG 
CGGAGGTGGT 
AGAAGCACTG 
CGTGGTATCC 
CGCCTCTATC 
CTGAGGCAGA 
CAGGCTTACA 
CCTTAGAAGA 
TTCTTTCCCC 
TGGAAATGAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
923 



2) INFORMATION FOR SEQ ID NO: 1876 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1876 

CATCATTGTT GTGTCCGCCA CTGATGGTCA AATGCCTCAA ACTCGTGAGC 50 
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ATTTGTTGTT GGCTCGCCAA GTTGGTGTCC 
AACAAGGTTG ATGCCGTTGA TGATCCTGAA 
GGAAATGCGT GATTTGCTTT CCCAATACGG 
CCATTATCCA GGGTTCCGCT CTCTGCGCTC 
5 ATCGGCCGTA ACGCCATCAT GAAGTTGATG 
CCCCACCCCT GCCCGTGATT TGGACAAACC 
ATGTGTTCTC CATCTCTGGC CGTGGTACTG 
CGTGGTATGG TCACCAAGGG TACTGAAGTT 
GCACTTCAAG ACCACCTTGA CCGGTATTGA 

LO ACAAGGGTAT GGCTGGTGAC AACATGGGTT 
CGTGAGCAAG TCCGTCGTGG TATGGTTATC 
GCCACATAAG AAGTTCATGG CTCAGCTCTA 
GAGGCCGCCA CACTCCCTTC GTCAACAACT 
AGAACTGTTG ATGTTACCGC CATCCTTAAG 

L5 TGCTGATGAG AAGATGGTCA TGCCCGGAGA 
AGCTCTT 
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AACACTTGGT TGTCTTCATC 100 

ATGTTGGAGT TGGTCGAGAT 150 

TTTCCCCGGA GACAACGTCC 200 

TTGAGGACCG CAACCCCGAG 250 

GAGGCTGTTG ATAGCAGCAT 300 

TTTCCTCATG CCCGTTGAAG 350 

TTGCCACTGG ACGTGTTGAG 400 

GAAATCGTCG GTATGGGCGA 450 

AATGTTCCAC AAGGAATTGG 500 

GCTTGCTTCG TGGTGTGAAG - 550 

TGTGCCCCCG GATCCGTCAA 600 

CATTCTCACC AAGGATGAGG 650 

ACCGCCCACA AATGTTCTTC 700 

CACCCCCCTG GTACCCCCGA 750 

CAACGTTCAA CTCGAGTGCG 800 

807 



20 2) INFORMATION FOR SEQ ID NO: 1877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 

(B) STRAIN: ATCC 33236 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1877 



AGTTGTTTCT GCTGCGGATG GTCCTATGCC TCAAACTCGC GAGCATATCT 50 

TGCTTTCTCG TCAAGTAGGC GTTCCATACA TCGTAGTTTT CCTAAACAAA 100 

ACCGATATGG TCGATGATCC GGATCTTTTA GAGTTAGTTG AAGAGGAAGT 150 

TAGAGATCTT TTAAAAGAGT ATAAATTCCC TGGCGACGAA ACCCCAATCA 200 

10 TTAAGGGTTC TGCTCTTAAG GCTCTTGAGG AAGCTAAGGC CGGACAAGAC 250 

GGCGAATGGT CTGCAAAGAT TATGGAGCTT ATGGACGCGG TTGATAGCTA 300 

TATTCCAACT CCTGTTCGCG ATACTGATAA AGATTTCCTT CTTCCGATCG 350 

AAGATATTTT CTCGATTTCC GGTCGCGGTA CCGTTGTAAC CGGTAGAATC 400 

GAAAAAGGTA TCGTTAAAGT TGGTGATACT ATCGAGATCG TAGGTATTAA 450 

15 ACCTACTCAG ACTACTACCG TCACTGGCGT TGAGATGTTT AGAAAAGAGA 500 

TGGATCAAGG TGAAGCCGGC GATAATGTAG GTGTTTTATT GCGCGGTACT 550 

AAGAAAGAGG AAGTAGAGCG CGGTATGGTT TTATGCAAAC CAAAATCGAT 600 

CACTCCTCAT ACTAAATTTG AGGGCGAGGT TTATATCCTA ACTAAAGAAG 650 

AAGGCGGACG CCATACTCCA TTCTTTAATA ATTATAGACC GCAGTTTTAC 700 

50 GTTCGTACGA CAGATGTTAC CGGTTCGATT ACTCTTCCTG AAGGAACCGA 750 

GATGGTTATG CCGGGCGACA ACGTTAAAAT CACCGTTGAG CTAATCGCTC 800 

CGATCG 806 
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2) INFORMATION FOR SEQ ID NO: 1878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33292 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1878 

GCTGCAGATG GCCCTATGCC ACAAACTAGA GAGCACATTC TTCTTTCTCG 50 

TCAAGTAGGC GTTCCATATA TTGTTGTTTT TATGAATAAA GCAGATATGG 100 

TTGATGATGC TGAACTTTTA GAGTTAGTTG AAATGGAAAT TAGAGAATTA 150 

20 TTAAGCTCTT ATGATTTCCC AGGCGATGAT ACACCTATTA TTTCTGGTTC 200 

TGCTTTAAAA GCTCTTGAAG AAGCTAAAGC TGGACAAGAT GGTGAATGGT 250 

CAGCAAAAAT TATGGATCTT ATGGCTGCAG TTGATAGCTA TATTCCAACT 300 

CCAACTCGTG ATACTGAAAA AGACTTCTTG ATGCCAATTG AAGATGTTTT 350 

CTCAATTTCA GGTCGTGGTA CTGTTGTTAC AGGTAGAATT GAAAAAGGTG 400 

25 TTGTAAAAGT AGGTGATACT ATCGAAATCG TTGGTATTAA AGATACTCAA 450 

ACAACAACTG TAACAGGTGT TGAAATGTTC AGAAAAGAAA TGGATCAAGG 500 

CGAAGCAGGA GATAACGTAG GTGTTCTTCT TCGTGGTACT AAAAAAGAAG 550 

AAGTTATCCG TGGTATGGTT CTTGCTAAAC CAAAATCAAT TACTCCACAC 600 

ACTGACTTCG AAGCTGAAGT TTATATCTTA AATAAAGATG AAGGTGGTAG 650 

30 ACATACTCCA TTCTTTAACA ACTATAGACC ACAGTTTTAT GTAAGAACAA 700 

CTGATGTTAC AGGTTCGATT AAATTAGCTG ATGGTGTTGA AATGGTTATG 750 

CCAGGTGAAA ATGTGAGAAT TACTGTAAGC TTGATCGCTC CAGTAGCACT 800 

TGAAGA 806 



2) INFORMATION FOR SEQ ID NO: 1879 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1879 

ATGTATGCAA CCGAGAGCAC TCCCGGATCT TGGTTTAAAT GGCACTAATA 50 

986 
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TAAGACAGGC CTCAAACTCG AGAGCATTTA CTTCTCGCCC GTCAGATCGG 100 

TATCCAAAAA ATCGTCGTCT TCGTGAACAA GGTTGATGCC ATCGAGGACA 150 

AAGAGATGTT GGAGCTTGTT GAATTGGAGA TGCGTGAACT CCTAACCAGC 200 

TACGGTTTCS AGGGTGAAGA AACTCCCATC ATTTTTGGCT CTGCTCTCTG 250 

5 TGCCCTCSAA GGAAGACAAC CCGAGATCGG TGTTACCAAG ATTGATGAGC 300 

TCTTGCAGGC CGTCGACACC TGGATTCCCA CTCCTCAGCG TGAGACTGAC 350 

AAGCCCTTCT TGATGTCCAT TGAGGAAGTG TTCTCTATTT CCGGACGAGG 400 

AACCGTTGTC TCCGGCCGTG TGGAGCGTGG TATCCTCAAG AAGGACTCCG 450 

AAGTTGAAAT TGTCGGCGGT TCGCCCGAGC CAATCAAAAC CAAGGTTACC 500 

LO GATATCGAAA CCTTTAAGAA" GTCTTGCGAC GAGTCTCGCG CTGGTGAT-AA 550 

CTCCGGCTTG CTCCTACGAG GCGTTAAGCG TGAAGATATT AGCCGTGGCA 600 

TGGTCGTCGC TGTACCAGGA AGTGTCAAGG CCCATACTGA ATTCTTAGTT 650 

TCGCTTTACG TCCTCACCGA AGCTGAGGGT GGGCGCAAAT CTGGATTCAG 700 

CAGCAAGTAC CGCCCACAGA TGTTCATTCG CACTGCCGGT ATGTAATACT 750 

L5 GTGATAATTT CGTTGACATG GTACTGATTG AATTCTATAG ACGAAGCGGC 800 

TCAGCTCAGC TGGCCCGGAG> AAGATCAAGA CAAGATGGCT ATGCCAGGAG 850 

ACAATATCGA AATGATTTGC ACCACCTTGC ACCCAGTTGC CGCCGA 896 



20 



2) INFORMATION FOR SEQ ID NO: 1880 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1880 



CTGGTAGTTG 
CCTGCTGGGT 
AATGTGACAT 

10 GTMCGTGACC 
CGTGCRYGGT 
CGAAGATCAT 
GAGCGTGCGA 
CATCTCTGGC 

15 TTAAAGTGGG 
TCAACCTGTA 
TGCGGGTGAG 
TCCAGCGTGG 
AAGTTCGAGT 

50 TACTCCGTTC 
ACGTGACCGG 
GGCGACAACA 



CTGCGACTGA 
CGCCAGGTTG 
GGTTGATGAT 
TGCTGTCACA 
TCTGCGCTGA 
CGAACTGGCT 
TTGACAAACC 
CGTGGTACCG 
TGAAGAAGTT 
CCGGCGTTGA 
AACTGTGGTA 
TCAGGTTCTG 
CAGAAGTTTA 
TTCAAAGGCT 
GACTATCGAA 
TTCAGATGGT 



CGGCCCAATG 
GCGTGCCATA 
GAAGAGCTGC 
GTACGACTTC 
AAGCGCTGGA 
GGCCATCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
GATGTTCCGT 
TCCTGCTGCG 
GCGAAGCCAG 
TATTCTGTCT 
ACCGTCCACA 
CTGCCAGAAG 
TGTGACCCTG 



CCTCAGACYC 


GTGAGCACAT 


50 


CATCATCGTG 


TTCCTGAACA 


100 


TGGAGCTGGT 


TGAAATGGAA 


150 


CCAGGCGACG 


ACACGCCAAT 


200 


RGGCGAAGCA 


GAGTGGGAAG 


250 


ATAACTACAT 


CCCGGAACCA 


300 


CCAATTGAAG 


ACGTGTTCTC 


350 


TCGTGTAGAG 


CGCGGTRTSG 


400 


GTATCAAAGA 


TACCGTGAAA 


450 


AAGCTGCTGG 


ACGAAGGCCG 


500 


CGGTATCAAG 


CGCGAAGATA 


550 


GCACCATCAA 


GCCACACACC 


600 


AAAGACGAAG 


GCGGCCGTCA 


650 


GTTCTACTTC 


CGTACTACTG 


700 


GCGTTGAGAT 


GGTGATGCCA 


750 


ATCCACCCGA 


TCGCCATG 


798 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1881 



ATCCTGGTTG TTGCTGCGAC TGACGGYCCG ATGCCGCAGA CCCGTGAGCA 50 

CATCCTGCTG GGTCGTCAGG TAGGCGTTCC GTACATCATC GTGTTCCTGA 100 

20 ACAAATGCGA CATGGTTGAT GACGAAGAGC TGCTGGAACT GGTTGAAATG 150 

GAAGTTCGYG AACTGCTGTC TCAGTACGAC TTCCCGGGCG ACGACACTCC 200 

GATCGTTCGT GGTTCTGCTC TGAAAGCGCT GGAAGGCGAC GCAGAGTGGG 250 

AAGCGAAAAT CATCGAACTG GCTGGCTTCC TGGATTCTTA CATYCCGGAA 300 

CCAGAGCGTG CGATTGACAA GCCGTTCCTG CTGCCGATCG AAGACGTATT 350 

25 CTCCATCTCC GGTCGTGGTA CCGTTGTTAC CGGTCGTGTA GARCGCGGTA 400 

TCATCAAAGT GGGCGAAGAA GTTGAAATCG TTGGTATCAA AGAGACTCAG 450 

AAGTCTACCT GTACTGGCGT TGAAATGTTC CGCAAACTGC TGGACGAAGG 500 

CCGTGCCGGT GAGAACGTAG GTGTTCTGCT GCGTGGTATC AAACGTGAAG 550 

AAATCGAACG TGGTCAGGTA CTGGCTAAGC CGGGCACCAT CAAGCCGCAC 600 

30 ACCAAGTTCG AATCTGAAGT GTACATTCTG TCCAAAGATG AAGGCGGCCG 650 

TCATACTCCG TTCTTCAAAG GCTACCGTCC GCAGTTCTAC TTCCGTACTA 700 

CTGACGTGAC TGGCACCATC GAACTGCCGG AAGGCGTAGA GATGGTAATG 7 50 

CCGGGCGACA ACATCAAAAT GGTTGTTACC CTGATCCACC CGATCGCGAT 800 

GGACGACGGT 810 

35 



2) INFORMATION FOR SEQ ID NO: 1882 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
50 (B) STRAIN: BM10711 

(C) ACCESSION NUMBER : AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1882 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ATGAATTTTT ATTTAGAGGA GTTTAACTTG TCTATTCCCG ATTCAGGTCC 50 

ATACGGTATA ACTTCATCAG AAGACGGAAA GGTATGGTTC ACACAACATA 100 

AGGCAAACAA AATCAGCAGT CTAGATCAGA GTGGTAGGAT AAAAGAATTC 150 

5 GAAGTTCCTA CCCCTGATGC TAAAGTGATG TGTTTAATTG TATCTTCACT 200 

TGGAGACATA TGGTTTACAG AGAATGGTGC AAATAAAATC GGAAAGCTCT 250 

CAAAAAAAGG TGGCTTTACA GAATATCCAT TGCCACAGCC GGATTCTGGT 300 

CCTTACGGAA TAACGGAAGG TCTAAATGGC GATATATGGT TTACCCAATT 350 

GAATGGAGAT CGTATAGGAA AGTTGACAGC TGATGGGACT ATTTATGAAT 400 

10" ATGATTTGCC AAATAAGGGA TCTTATCCTG CTTTTATTAC TTTAGGTTCG - 450 

GATAACGCAC TTTGGTTCAC GGAGAACCAA AATAATTCTA TTGGAAGGAT 500 

TACAAATACA GGGAAATTAG AAGAATATCC TCTACCAACA AATGCAGCGG 550 

CTCCAGTGGG TATCACTAGT GGTAACGATG GTGCACTCTG GTTTGTCGAA 600 

ATTATGGGCA ACAAAATAGG TCGAATCACT ACAACTGGTG AGATTAGCGA 650 

15 ATATGATATT CCAACTCCAA ACGCACGTCC ACACGCTATA ACCGCGGGGA 700 

AAAATAGCGA AATATGGTTT ACTGAATGGG GGGCAAATCA AATCGGCAGA 750 

ATTACAAACG ACAAAACAAT TCAAGAATAT CAACTTCAAA CAGAAAATGC 800 

GGAACCTCAT GGTATTACCT TTGGAAAAGA TGGATCCGTA TGGTTTGCAT 850 

TAAAATGTAA AATTGGGAAG CTGAATTTGA ACGAATGA 888 

20 



2) INFORMATION FOR SEQ ID NO: 1883 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1883 
35 AGCCGCTTGA GCAAATTAAA CTA 23 



2) INFORMATION FOR SEQ ID NO: 1884 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1884 

50 

GTATCCCGCA GATAAATCAC CAC 23 
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2) INFORMATION FOR SEQ ID NO: 1885 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1885 
AGCGAAAAAC ACCTTGCCGA C 

2) INFORMATION FOR SEQ ID NO: 1886 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1886 
GACGCCCGCG CCACCACT 

30 



2) INFORMATION FOR SEQ ID NO: 1887 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1887 
45 GACGCCCGCG ACACCACTA 



2) INFORMATION FOR SEQ ID NO: 1888 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

990 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1888 
GACGCCCGCA ACACCACTA 19 



2) INFORMATION FOR SEQ ID NO: 1889 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1889 
GTTCGCAACT GCAGCTGCTG 20 

25 

2) INFORMATION FOR SEQ ID NO: 1890 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1890 
40 TTCGCAACGG CAGCTGCTG 19 



2) INFORMATION FOR SEQ ID NO: 1891 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

991 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1891 

CCGGAGCTGC CGAICGGG 18 
2) INFORMATION FOR SEQ ID NO: 1892 



(i) SEQUENCE CHARACTERISTICS: 
10 " " " " (A) ' "LENGTH: 18 bases " ' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1892 
CGGAGCTGCC AARCGGGG 18 

20 

2) INFORMATION FOR SEQ ID NO: 1893 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1893 
35 GGAGCTGGCG ARCGGGGT 18 



40 



2) INFORMATION FOR SEQ ID NO: 1894 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1894 

50 

GACCGGAGCT AGCGARCG 18 
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2) INFORMATION FOR SEQ ID NO: 1895 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

L0 _ (ii) .MOLECULE. TYPE: DN A - _. _ . - -- - • - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1895 
CGGAGCTAGC AARCGGGGT 19 

15 

2) INFORMATION FOR SEQ ID NO: 1896 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1896 
30 GAAACGGAAC TGAATGAGGC G 21 



35 



2) INFORMATION FOR SEQ ID NO: 1897 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1897 

15 

CATTACCATG GGCGATAACA G 21 



50 2) INFORMATION FOR SEQ ID NO: 1898 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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1G 



15 



20 



25 



30 



35 



40 



45 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1898 
CCATTACCAT GAGCGATAAC AG 22 



2) INFORMATION FOR SEQ ID NO: 189? 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 15571 

(C) ACCESSION NUMBER: AF124 984 



(xi) SEQUENCE DESCRIPTION: SEQ 

ATGCGTTATA TTCGCCTGTG TATTATCTCC 
GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 
GCCAGCTGTC GGGCCGCGTA GGCATGATAG 
CGCACGCTGA CCGCCTGGCG CGCCGATGAA 
CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 
ACGAACAGCT GGAGCGAAAG ATCCACTATC 
TACTCGCCGG TCAGCGAAAA ACACCTTGCC 
ACTCTGCGCC GCCGCCATTA CCATGAGCGA 
TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 
ATCGGCGACA ACGTCACCCG CCTTGACCGC 
GGCGCTTCCC GGCGACGCCC GCGACACCAC 
CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 
CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 
CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 
GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 
GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 
CGAGCGAAAT CAGCAAATCG CCGGGATCGG 
GGCAACGCTA A 



ID NO: 1899 

CTGTTAGCCA CCCTGCCGCT 50 

GCAAATTAAA CTAAGCGAAA 100 

AAATGGATCT GGCCAGCGGC 150 

CGCTTTCCCA TGATGAGCAC 200 

GGCGCGGGTG GATGCCGGTG 250 

GCCAGCAGGA TCTGGTGGAC 300 

GACGGCATGA CGGTCGGCGA 350 

TAACAGCGCC GCCAATCTGC 400 

TGACTGCCTT TTTGCGCCAG 450 

TGGGAAACGG AACTGAATGA 500 

TACCCCGGCC AGCATGGCCG 550 

GTCTGAGCGC CCGTTCGCAA 600 

CGGGTCGCCG GACCGTTGAT 650 

CGCCGATAAG ACCGGAGCTG 700 

TGCTTGGCCC GAATAACAAA 750 

GATACCCCGG CGAGCATGGC 800 

CGCGGCGCTG ATCGAGCACT 850 

861 



2) INFORMATION FOR SEQ ID NO: 1900 

(i) SEQUENCE CHARACTERISTICS: 

994 
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(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 
10 " ■■ " (B) STRAIN:- SL-K-4 7 

(C) ACCESSION NUMBER: Y11069 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1900 

15 CTGTTAGCCA CCCTGCCGCT GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 50 

GCAAATTAAA CTAAGCGAAA GCCAGCTGTC GGGCCGCGTA GGCATGATAG 100 

AAATGGATCT GGCCAGCGGC CGCACGCTGA CCGCCTGGCG CGCCGATGAA 150 

CGCTTTCCCA TGATGAGCAC CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 200 

GGCGCGGGTG GATGCCGGTG ACGAACAGCT GGAGCGAAAG ATCCACTATC 250 

20 GCCAGCAGGA TCTGGTGGAC TACTCGCCGG TCAGCGAAAA ACACCTTGCC 300 

GACGGCATGA CGGTCGGCGA ACTCTGCGCC GCCGCCATTA CCATGAGCGA 350 

TAACAGCGCC GCCAATCTGC TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 400 

TGACTGCCTT TTTGCGCCAG ATCGGCGACA ACGTCACCCG CCTTGACCGC 450 

TGGGAAACGG AACTGAATGA GGCGCTTCCC GGCGACGCCC GCGCCACCAC 500 

25 TACCCCGGCC AGCATGGCCG CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 550 

GTCTGAGCGC CCGTTCGCAA CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 600 

CGGGTCGCCG GACCGTTGAT CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 650 

CGCCGATAAG ACCGGAGCTG GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 700 

TGCTTGGCCC GAATAACAAA GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 7 50 

30 GATACCCCGG CGAGCATGGC CGAGCGAAAT 780 



35 



2) INFORMATION FOR SEQ ID NO: 1901 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: U92041 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1901 

50 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

995 
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CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 

ACGAACAGCT GGAGCGAAAG ATCCACTATC 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA 

5 TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 

ATCGGCGACA ACGTCACCCG CCTTGACCGC 

GGCGCTTCCC GGCGACGCCC GCAACACCAC 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 

10" CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG 
GGCAACGCTA A 



PCT/CA00/01150 

GGCGCGGGTG GATGCCGGTG 250 

GCCAGCAGGA TCTGGTGGAC 300 

GACGGCATGA CGGTCGGCGA 350 

TAACAGCGCC GCCAATCTGC 400 

TGACTGCCTT TTTGCGCCAG 4 50 

TGGGAAACGG AACTGAATGA 500 

TACCCCGGCC AGCATGGCCG 550 

GTCTGAGCGC CCGTTCGCAA 600 

CGGGTCGCCG GACCGTTGAT 650 

CGCCGATAAG ACCGGAGCTG -700 

TGCTTGGCCC GAATAACAAA 750 

GATACCCCGG CGAGCATGGC 800 

CGCGGCGCTG ATCGAGCACT 850 

861 



2) INFORMATION FOR SEQ ID NO: 1902 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 
30 (B) STRAIN: 803 

(C) ACCESSION NUMBER: AF164577 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1902 

35 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CAAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

40 ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

45 . GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

50 GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 
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2) INFORMATION FOR SEQ ID NO: 1903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii). MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 700603 

(C) ACCESSION NUMBER: AF132290 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1903 



ATGCGTTATT TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

20 GCCAGCTGTC GGGCAGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

2 5 ACTCTGTGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

30 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGGA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

35 GGCAACGCTA ACCCGGCGGT GGCCGCGCGC GTTATCCGGC TCGTAG 896 



2) INFORMATION FOR SEQ ID NO: 1904 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Escherichia coli 

(B) STRAIN: JC2926 

(C) ACCESSION NUMBER: AF148851 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1904 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

5 GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

10 ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

15 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

20 GGCAACGCTA A 861 



2) INFORMATION FOR SEQ ID NO: 1905 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Pseudomonas aeruginosa 

(C) ACCESSION NUMBER: AF096930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1905 

40 ATGCGTTATA TTCGCCTGTG TATTATCTCC 
GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 
GCCAGCTGTC GGGCCGCGTA GGCATGATAG 
CGCACGCTGA CCGCCTGGCG CGCCGATGAA 
CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 

45 ACGAACAGCT GGAGCGAAAG ATCCACTATC 
TACTCGCCGG TCAGCGAAAA ACACCTTGCC 
ACTCTGCGCC GCCGCCATTA CCATGAGCGA 
TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 
ATCGGCGACA ACGTCACCCG CCTTGACCGC 

50 GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC 
CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 
CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 
CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 
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CTGTTAGCCA 


CCCTGCCGCT 


50 


GCAAATTAAA 


CTAAGCGAAA 


100 


AAATGGATCT 


GGCCAGCGGC 


150 


CGCTTTCCCA 


TGATGAGCAC 


200 


GGCGCGGGTG 


GATGCCGGTG 


250 


GCCAGCAGGA 


TCTGGTGGAC 


300 


GACGGCATGA 


CGGTCGGCGA 


350 


TAACAGCGCC 


GCCAATCTAC 


400 


TGACTGCCTT 


TTTGCGCCAG 


450 


TGGGAAACGG 


AACTGAATGA 


500 


TACCCCGGCC 


AGCATGGCCG 


550 


GTCTGAGCGC 


CCGTTCGCAA 


600 


CGGGTCGCCG 


GACCGTTGAT 


650 


CGCCGATAAG 


ACCGGAGCTA 


700 
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GCAAACGGGG TGCGCGCGGG ATTGTCGCCC 
GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 
CGAGCGAAAT CAGCAAATCG CCGGGATCGG 
GGCAACGCTA A 
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TGCTTGGCCC GAATAACAAA 750 
GATACGCCGG CGAGCATGGC 800 
CGCGGCGCTG ATCGAGCACT 850 

861 



2) INFORMATION FOR SEQ ID NO: 1906 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1906 
20 CCTTATTCCC TTTTTTGCGG 20 



2) INFORMATION FOR SEQ ID NO: 1907 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1907 

35 

CACCTATCTC AGCGATCTGT CT 22 



40 2) INFORMATION FOR SEQ ID NO: 1908 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1908 

AACAGCGGTA . AGATCCTTGA GAG 23 
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2) INFORMATION FOR SEQ ID NO: 1909 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



10 



15 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1909 
ATGACTTGGT TAAGTACTCA CC 

2) INFORMATION FOR SEQ ID NO: 1910 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1910 
ATGACTTGGT TGAGTACTCA CC 

30 



2) INFORMATION FOR SEQ ID NO: 1911 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1911 
45 CCATAACCAT GGGTGATAAC AC 



2) INFORMATION FOR SEQ ID NO: 1912 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

1000 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1912 
CCATAACCAT GAGTGATAAC AC 22 



10 



2) INFORMATION FOR SEQ ID NO: 1913 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1913 
CGCCTTGATC ATTGGGAACC 20 

25 

2) INFORMATION FOR SEQ ID NO: 1914 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1914 
40 CGCCTTGATC GTTGGGAACC 20 



45 



2) INFORMATION FOR SEQ ID NO: 1915 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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CGCCTTGATA GTTGGGAACC 
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2) INFORMATION FOR SEQ ID NO: 1916 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1916 
CGTGGGTCTT GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1917 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1917 
CGTGGGTCTG GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1918 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1918 
GTGGGTCTCA CGGTATCATT G 
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2) INFORMATION FOR SEQ ID NO: 1919 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1919 
CGTGGGTCTC TCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1920 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1920 
CGTGGNTCTC GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1921 
CGTGGGTCTA GCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1922 
GTTTTCCAAT GATTAGCACT TTTA 24 



2) INFORMATION FOR SEQ ID NO: 1923 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1923 
GTTTTCCAAT GATAAGCACT TTTA 24 

25 

2) INFORMATION FOR SEQ ID NO: 1924 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1924 
40 GTTTTCCAAT GCTGAGCACT TTT 23 



2) INFORMATION FOR SEQ ID NO: 1925 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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CGTTTTCCAA TGATGAGCAC TTT 
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2) INFORMATION FOR SEQ ID NO: 1926 

(i) SEQUENCE CHARACTERISTICS: 
(A) .LENGTH:. 23_ bases _ . _ . _ .. . . 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 1 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1926 
GTTTTCCAAT GGTGAGCACT TTT 23 



2) INFORMATION FOR SEQ ID NO: 1927 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: MC9690-129 

(C) ACCESSION NUMBER: AF126482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1927 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG > 700 
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GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



750 
800 
850 
861 
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2) INFORMATION FOR SEQ ID NO: 1928 

SEQUENCE. CHARACTERISTICS:. . . 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

. (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: HB251 

(C) ACCESSION NUMBER: X57972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1928 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG . TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCATTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 7 00 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

5 (B) STRAIN: 26W 

(C) ACCESSION NUMBER: U09188 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1929 

10 - ATGAGTATTC AACATTTCCG -TGTCGCCCTT ATTCCCTTTT. TTGCGGCATT . . 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

15 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATAGTTGGG AACCGGAGCT 500 

20 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

25 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATG 808 



30 2) INFORMATION FOR SEQ ID NO: 1930 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190692 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1930 



ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

50 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 
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ACTTACTTCT 
CACAACATGG 
GAATGAAGCC 
TGGCAACAAC 
TCCCGGCAAC 
ACTTCTGCGC 
GAGCCGGTGA 
GGTAAGCCCT 
TATGGATGAA 
AGCATTGGTA 



GACAACGATC 
GGGATCATGT 
ATACCAAACG 
GTTGCGCAAA 
AATTAATAGA 
TCGGCCCTTC 
GCGTGGGTCT 
CCCGTATCGT 
CGAAATAGAC 
A ' 



GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 
AACTCGCCTT GATCGTTGGG AACCGGAGCT 
ACGAGCGTGA CACCACGATG CCTGCAGCAA 
CTATTAACTG GCGAACTACT TACTCTAGCT 
CTGGATGGAG GCGGATAAAG TTGCAGGACC 
CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GGCGGTATCA TTGCAGCACT GGGGCCAGAT 
AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA 



450 
500 
550 
600 
650 
700 
750 
800 
850 
861 
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2) INFORMATION FOR SEQ ID NO: 1931 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2728 

(C) ACCESSION NUMBER: AF104442 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1931 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT CTCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1932 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190695 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1932 

15 ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGCT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

20 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA. AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

25 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT AGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

30 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 861 bases 
HO (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190693 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1933 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 
TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 
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CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGGT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

5 GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATCGTCGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

10 TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

15 TATGGATGAA CGAGATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



20 2) INFORMATION FOR SEQ ID NO: 1934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1934 

GAACGCCAGC GCGAAATTCA AAAAG 25 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1935 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1935 
AGCTCGGCAT ACTTCGACAG G 21 

2) INFORMATION FOR SEQ ID NO: 1936 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
5 (D) TOPOLOGY-: Linear 

(ii) MOLECULE TYPE: DNA 

(x y SEQUENCE DESCRIPTION: SEQ ID NO: 

10 

TACCACCCGC ACGGC 



2) INFORMATION FOR SEQ ID NO: 1937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID 
CGGAGTCGCC GTCGATG 



30 2) INFORMATION FOR SEQ ID NO: 1938 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 bases 
35 ( B ) TYPE: Nucleic acid 

(O STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1938 
CCGCGCACCA TTGCTTCGTA CACTGAGGAG TCTCCGCGCG G 

2) INFORMATION FOR SEQ ID NO: 1939 



(i) SEQUENCE CHARACTERISTICS: 
50 ( A ) LENGTH: 4 6 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1939 
CGACCCGGAT GGTAGTATCG ATAATGATCC GCCAGCGGCC GGGTCG 



25. 



30 



10 " 2) INFORMATION FOR "SEQ ID NO: - 1940 - - 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases _ 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS: Single 

(D) TOPOLOGY:' Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1940 
GTATCGTTGG TGACGTAAT 

2) INFORMATION FOR SEQ ID NO: 1941 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi , SEQUENCE DESCRIPTION: SEQ ID NO: 1941 
GCAATGGTCC GTTTAAGT 

2) INFORMATION FOR SEQ ID NO: 1942 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1942 
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GACTGGAACA AAGCCTATAA AAAATCA 27 

5 2) INFORMATION FOR SEQ ID NO: 1943 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

L0- - (C) STRANDEDNESS:" Single - - 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1943 

. GCTGGTGGAC GGCCAG 16 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1944 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1944 
TTTCGCCGCC ATGCGTTAC 19 

2) INFORMATION FOR SEQ ID NO: 1945 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

i5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1945 
CGGCGACTAC GCGGTAT 17 

50 



2) INFORMATION FOR SEQ ID NO: 1946 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) - SEQUENCE DESCRIPTION: -SEQ ID-NO:- -1946 

CGGCGACTTC GCGGTAT 



2) INFORMATION FOR SEQ' ID NO: 1947. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1947 
CGGTATACGG CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1948 
GCGGTATACA ACACCATCG 



2) INFORMATION FOR SEQ ID NO: 1949 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1949 

CGGTATACGC CACCATCGT 19 



10 



20 



25 



50 



2} INFORMATION FOR SEQ ID NO: 1950 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1950 
GGCGACATCG CCTGC 15 

2) INFORMATION FOR SEQ ID NO: 1951 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1951 
GGCGACAGAG CCTGCTA 17 

40 



2) INFORMATION FOR SEQ ID NO: 1952 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1952 
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CCTGCTATGG AGCGATGGT 19 



2) INFORMATION FOR SEQ ID NO: 1953 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
_ L 0" " (B) TYPE: ~ Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



L5 



JO 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1953 
CGCCTGCTAT AAAGCGATGG T 21 

2) INFORMATION FOR SEQ ID NO: 1954 



(i) SEQUENCE CHARACTERISTICS: 
>5 (A) LENGTH: 589 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

JO (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

55 (C) ACCESSION NUMBER: AF052258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1954 

ACACCGGTCA ACATTGAGGA AGAGCTTAAG 
10 GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 

AGCCGGTACA CCGTCGCGTA CTTTACGCCA 

TGGAACAAAG CCTATAAAAA ATCAGCCCGT 

TAAATACCAC CCGCACGGCG ACTCCGCGGT 

TGGCGCAGCC GTTCTCGCTG CGTTACATGC 
15 TTTGGTTCCA TCGACGGCGA CTCCGCCGCG 

TCGTCTGGCG AAAATCGCTC ATGAGCTGAT 

CGGTCGATTT CGTCGACAAC TATGACGGTA 

ATGCCGACCA AAATTCCTAA CCTGCTGGTG 

CGTAGGGATG GCCACCAACA TACCGCCACA 
>0 ACGGCTGTCT GGCGTATGTT GACGATGAAG 



AACTCTTATC 


TGGATTATGC 


50 


GGATGTCCGA 


GATGGCCTGA 


100 


TGAACGTATT 


GGGCAATGAC 


150 


GTCGTTGGTG 


ACGTAATCGG 


200 


ATACGACACC 


ATCGTGCGTA 


250 


TGGTGGACGG 


CCAGGGTAAC 


300 


GCGATGCGTT 


ATACCGAAAT 


350 


GGCCGATCTT 


GAAAAAGAGA 


400 


CGGAGCGTAT 


TCCGGACGTC 


450 


AACGGCGCCT 


CCGGGATCGC 


500 


TAACCTGACG 


GAAGTGATTA 


550 


ACATCAGCA 
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2) INFORMATION FOR SEQ ID NO: 1955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii). MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1955 
CGCGCAACCA TTGCTTCGTA CACTGAGGAG , TCTGCGCG 



2) INFORMATION FOR SEQ ID NO: 1956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Candida inconspicua 

(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1956 

CGGTAAAACC ACTTTAACTG CTGCAATCAC CAAAGTTTTA TCTGAAGAAG 50 

GTGGTGCAGA TTTCTTAGAT TATTCATCTA TTGATAAAGC ACCAGAAGAG 100 

AGAGCTAGAG GTATTACCAT TTCTACTGCT CATGTTGAAT ATGAAACTCC 150 

AAACAGACAT TATTCACATG TTGATTGTCC AGGTCATCAA GATTATATTA 200 

AGAACATGAT TACTGGTGCA GCTCAAATGG ATGGTGCTAT TATTGTTGTT 250 

GCAGCTACTG ATGGTCAAAT GCCACAAACT AAGGAACATT TATTATTAGC 300 

TAGACAAGTT GGTGTTCAAC ATTTAGTTGT TTTTGTTAAC AAGTGTGATA 350 

CCGTTGATGA TCCAGAAATG TTRGAATTAG TTGAAATGGA AATGAGAGAA 400 

TTATTGACCG AATATGGATT YGATGGTGAT AACACACCAG TTATTATGGG 450 

TTCTGCATTA ATGGCATTAG AAGGTAAGAG ACCAGAAGTT GGTAAAGAAT 500 

CTATTGTTAA GTTAATGGAA GCTGTTGATA CTTGGATTCC AACTCCACAA 550 

AGAGACTTAG AAAAACCATT CTTATTACCA ATTGATGAAG TTTTCTCTAT 600 

TTCTGGTAGA GGTACTGTTG TTTCTGGTAC TGTTGATAGA GGTACATTAA 650 

AGAAGGGTGA AGAAGTTGAA ATTGTTGGTG GTAAAGAAGG TGTTATTAAG 700 

ACTACTGTTA CCGGTATTGA AATGTATCAC AAGGAATTAG ATCAAGCACA 750 

AGCTGGTGAT ACTCCAGGTA TTTTGTTAAG AGGTGTTAAG AGAGATCAAA 800 

TTGCAAGAGG TCAAATTCTT GCAAAGCCAG GWTCTGTTAA GGCATACAAG 850 

AAGTTCTTAT CATCATTATA CATTTTAACA AAGGAAGAAG GTGGTAGACA 900 

TACTCCATTT TCTGAAAATT ACAGACCTCA AATGTACATT AGAACTTCCA 950 

ATGTTAATGT TACTTTGAAG TTCCCAGAAA CTGAAGAAG 989 
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2) INFORMATION FOR SEQ ID NO: 1957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



-(ii). .MOLECULE. TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: ATCC 22023 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1957 

GGTAAGACCA CCCTTACTGC CGCCATCACC AAGTGCCTTG CTGAGAAGGG 50 

AGGTGCCTCG TTCTTGGACT ACAGTGCCAT CGACAAGGCG CCAGAGGAGA 100 

GAGCAAGAGG TATCACCATC TCCACTGCGC ACGTTGAGTA TGAAACTGCC 150 

20 AACAGACACT ACTCGCACGT TGACTGTCCA GGTCACGCTG ATTACATCAA 200 

GAACATGATT ACCGGTGCTG CGCAGATGGA CGGTGCTATC ATTGTCGTTG 250 

CAGCCACTGA CGGTCAGATG CCACAGACCA GAGAACACTT GTTGCTTGCC 300 

AGACAAGTTG GTGTCCAGCA CATTGTTGTC TTCGTCAACA AGGTTGACAC 350 

CATCGACGAC CCTGAGATGC TTGAGCTTGT TGAAATGGAG ATGAGAGAGT 400 

25 TGCTTACTTC GTATGGATTT GACGGTGATA ACACCCCAGT TATCATGGGT 450 

TCTGCTTTGT GTGCTTTGGA AGGCCGTGAG CCAGAGATTG GTGCTAAGGC 500 

CATTGACAAG TTGATGGAGG CCATTGATGA GTACATCCCA ACTCCTCAGA 550 

GAGACCTGGA AAAGCCATTC YTGATGGGTG TTGAAGACGT GTTCTCGATC 600 

TCTGGTAGAG GTACCGTTGT CACRGGCCGT GTTGAGCGTG GTAACTTGAA 650 

30 GAAAGGTGAT GAAATTGAAC TTGTTGGCTA CAACAAGAAC CCAATCAAGA 700 

CCACCGTCAC CGGTATCGAA ATGTTCAAGA AGGAGTTGGA ATCTGCCATG 750 

GCTGGTGACA ACTGTGGTAT CTTGTTGCGT GGTATCAAGA GAGATGACGT 800 

CAAGAGAGGT ATGGTTGCTG CTAAGCCAGG CTCCGTCTCT GCACACACCA 850 

AGTTCCTCGC TTCCTTGTAC ATCCTGACRA AGGAGGAAGG TGGTCGTCAC 900 

35 AGTGCCTTTG CTGAGAACTA CAGACCACAG ATGTTCATCA GAACCGGAGA 950 

TGTCACCACC ATCTTGACAT GGCCAGAGGA GCACGCTGAC C 991 



40 2) INFORMATION FOR SEQ ID NO: 1958 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 
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CGGTAAGACC ACTTTGACCG CCGCCATCAC CAAGGTGTTG AGCGCCAAAG 50 

GTGGTGCTTC CTTCTTGGAC TACGGGTCCA TCGACAGAGC CCCTGAGGAG 100 

5 AGAGCCAGAG GTATTACTAT CTCGACTGCC CACGTTGAGT ACGAGACCGA 150 

TAAGAGACAC TACGCCCACG TTGATTGCCC TGGTCACGCT GATTACATCA 200 

AGAACATGAT CACTGGTGCC GCCCAAATGG ACGGTGCCAT TATTGTCGTT 250 

GCTGCTTCTG ATGGCCAAAT GCCGCAGACC AGAGAGCACT TGTTGCTTGC 300 

CAGACAGGTT GGTGTGCAGA ACTTGGTTGT GTTTGTTAAC AAGGTGGACA 350 

L0" " CCATCGACGA ' CCCCGAAATG TTGGAGTTGG TGGAGATGGA AATGAGAGAAr - 4 00 

TTGTTGACCC ACTACGGCTT TGACGGTGAC AACACCCCTG TCATCATGGG 450 

TTCGGCGTTG TGTGCCTTGG AAGACAGGCA GCCTGAGATT GGCGAGCAAG 500 

CCATCATGAA GTTGTTGGAC GCTGTCGACG t AGTACATTCC CACTCCTCAG 550 

AGAGACTTGG AGCAACCATT TTTGATGCCC GTTGAGGATG TTTTCTCCAT 600 

L5 CTCTGGCAGA GGTACTGTTG TCACCGGTCG TGTTGAGAGA GGCTCATTGA 650 

AGAAGGGTGA GGAGATTGAG ATTGTTGGCG ACTTCCCCAA GCCCTTCAAG 700 

ACTACCGTCA CCGGCATTGA GATGTTCAAG AAGGAGTTGG ATGCCGCGAT 750 

GGCGGGCGAC AACGCCGGGA TCTTGTTGAG AGGTGTCAAG AGAGACGAGG 800 

TCTCGAGAGG TATGGTTTTG GCCAAGCCCG GTACTGTCAC TTCGCACACC 850 

20 AAGGTGTTGG CGTCGCTTTA CATCTTGACC AAAGAGGAAG GTGGCCGCCA 900 

CTCGCCCTTT GGTGAGAACT ACAAGCCACA GTTATTCATC AGAACCTCCG 950 

ATGTCACTGG TACTTTGAGG TTCCCCGCCG GTGAG 985 



25 



35 



10 



2) INFORMATION FOR SEQ ID NO: 1959 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 973 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1959 



CGGTAAGACC ACCTTGACTG CCGCCATCAC CAAGGTTCTC TCCGAGAAGG 50 

GTGGTGCCGA CTTCTTGGAC TACGGTGCCA TTGACAGAGC CCCCGAGGAG 100 

45 CGTGCCCGTG GTATCACCAT CTCCACTGCC CACGTTGAGT ACGAGACTGA 150 

CAACCGTCAC TACGCCCACA TTGACTGTCC CGGTCACGCT GATTACATCA 200 

AGAACATGAT TACCGGTGCC GCCCAGATGG ACGGTGCCAT TATTGTCCTT 250 

GCTGCTACTG ACGGTGCCAT GCCCCAGACC CGCGAGCACT TGCTTCTCGC 300 

CCGTCAGGTT GGTATCCAGG AATTGGTTGT GTTTGTGAAC AAGGTTGACA 350 

50 CCATCGACGA CCCCGAGATG TTGGAGCTCG TTGAGATGGA GATCCGCGAG 4 00 

TTGTTGTCTG AGTTCGGTTT TGACGGTGAC AACACCCCCG TCATCATGGG 450 

TTCCGCTTTG TGCGCTTTGG AGGGCAAGCA GCCCGAGATT GGTGAGCAGG 500 

CTATCACCAA GTTGATGGCC GCCGTTGACG AGCACATCCC CACCCCCCAG 5 50 
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CGTGACTTGG AGCAGCCTTT 
CTCTGGCCGT GGTACCGTGG 
AGAAGGGTGA GGAGATTGAG 
GTGACTGTTA CTGGTATTGA 
GGCTGGTGAC AACGCCGGTA 
TGTCTCGTGG TATGGTTTTG 
AAGGTTTTGG CTTCGCTTTA 
GACCGGCTTC GGCTCCAACT 
ACGTCACTGG TACCCTCACC 



CTTGATGCCT GTTGAGGGTG TTTTCTCTAT 
TGACTGGTAA GGTTGCCCGT GGTGTCCTCA 
ATTGTTGGCA ACTTTGACAA GCCCTACAAG 
GATGTTCAAG AAGGAGTTGG ACCAGGCCAT 
TCTTGTTGCG TGGTGTCAAG CGTGACGAGG 
GCCAAGCCCG GCACTGTTGT CTCGCACAAG 
CATCTTGACC CAGGAGGAGG GTGGCCGTAA 
ACAAGCCCCA GTTGTTCTTG CGCACTACCG 
TTC 



600 
650 
700 
750 
800 
850 
900 
950 
973 



LO 



L5 



20 



15 



50 



15 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1960 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 28870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1960 

AAGACTACCT TGACTGCTGC AATCACCAAG GTCTTAGCTG ATCAAGGTGG 50 

TGCTGATTTC TTAGATTATG CATCTATTGA CAAGGCTCCT GAAGAAAGAG 100 

CAAGAGGTAT TACTATCTCT ACTGCTCACG TTGAGTATGA AACCCCAAAC 150 

AGACATTATT CTCATGTCGA TTGTCCTGGC CATCAAGATT ATATTAAGAA 200 

TATGATTACT GGTGCTGCAC AAATGGATGG TGCTATTATT GTTGTTGCTG 250 

CTACTGATGG TCAAATGCCA CAAACTAAGG AACATTTATT ATTAGCAAGA 300 

CAAGTTGGTG TTCAACATTT AGTTGTCTTT GTTAATAAAT GTGACACCAT 350 

TGATGACCCA GAAATGTTGG AATTAGTTGA AATGGAAATG AGAGAACTAT 400 

TGTCTGAATA TGGTTTTGAT GGTGATAACA CTCCAGTTAT TATGGGTTCT 450 

GCATTGATGG CTTTAGAAGA CAAGAGACCT GAAGTTGGTA AGGAATCTAT 500 

TTTAAAGTTA ATGGAAGCYG TTGACACATG GATTCCAACC CCAGAGAGAG 550 

ATTTAGAAAA ACCATTTTTG TTACCTATTG ATGAAGTTTT CTCAATCTCT 600 

GGTAGAGGTA CTGTCGTTTC TGGTACTGTC GAAAGAGGTA CTTTGAAGAA 650 

GGGTGAAGAA GTTGAAATTG TTGGTGGTAA GGATGGTTCT ATTAAAACTA 700 

CTGTCACAGG TATTGAAATG TATCACAAGG AATTAGACCA AGCGCAAGCA 750 

GGTGATACTC CAGGTATTTT ATTAAGAGGT GTCAAGAGAG ACCAAATCAA 800 

GAGAGGTCAA ATTTTAGCAA AGCCAGATTC CGTTAAGGCA TACAAGAAGT 850 

TCTTGGCTTC CCTTTATATC TTAACCAAGG AAGAAGGTGG TAGACATACA 900 

CCATTCTCTG AAAACTACAG ACCACAAATG TACATCAGAA CTACCAATGT 950 

TAACGTTACT TTGAAGTTCC CAGACACTGA AGAAG 985 



2) INFORMATION FOR SEQ ID NO: 1961 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1961 
GCTCAAGGCA GATGGCATTC CC 22 



15 2) INFORMATION FOR SEQ ID NO: 1962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1962 

GGACAAGGCG GTTGCGTTTG AT 22 



30 



10 



35 



2) INFORMATION FOR SEQ ID NO: 1963 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1963 
CATTCCCGTC TCGCTCGACA GT 22 

2) INFORMATION FOR SEQ ID NO: 1964 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1964 
ATCTGCCTGC CCGTCTTGC 19 



1 0 2) IN FORMAT ION " FOR "SEQ ID "NOT 1965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pGS05 

(C) ACCESSION NUMBER: M36657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1965 



ATGAATAAAT CGCTCATCAT TTTCGGCATC GTCAACATAA CCTCGGACAG 50 

TTTCTCCGAT GGAGGCCGGT ATCTGGCGCC AGACGCAGCC ATTGCGCAGG 100 

CGCGTAAGCT GATGGCCGAG GGGGCAGATG TGATCGACCT CGGTCCGGCA 150 

TCCAGCAATC CCGACGCCGC GCCTGTTTCG TCCGACACAG AAATCGCGCG 200 

30 TATCGCGCCG GTGCTGGACG CGCTCAAGGC AGATGGCATT CCCGTCTCGC 250 

TCGACAGTTA TCAACCCGCG ACGCAAGCCT ATGCCTTGTC GCGTGGTGTG 300 

GCCTATCTCA ATGATATTCG CGGTTTTCCA GACGCTGCGT TCTATCCGCA 350 

ATTGGCGAAA TCATCTGCCA AACTCGTCGT TATGCATTCG GTGCAAGACG 400 

GGCAGGCAGA TCGGCGCGAG GCACCCGCTG GCGACATCAT GGATCACATT 450 

35 GCGGCGTTCT TTGACGCGCG CATCGCGGCG CTGACGGGTG CCGGTATCAA 500 

ACGCAACCGC CTTGTCCTTG ATCCCGGCAT GGGGTTTTTT CTGGGGGCTG 550 

CTCCCGAAAC CTCGCTCTCG GTGCTGGCGC GGTTCGATGA ATTGCGGCTG 600 

CGCTTCGATT TGCCGGTGCT TCTGTCTGTT TCGCGCAAAT CCTTTCTGCG 650 

CGCGCTCACA GGCCGTGGTC CGGGGGATGT CGGGGCCGCG ACACTCGCTG 700 

10 CAGAGCTTGC CGCCGCCGCA GGTGGAGCTG ACTTCATCCG CACACACGAG 750 

CCGCGCCCCT TGCGCGACGG GCTGGCGGTA TTGGCGGCGC TGAAAGAAAC 800 

CGCAAGAATT CGTTAA 816 



15 

2) INFORMATION FOR SEQ ID NO: 1966 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1966 
5 CATGCCAGTC TTGCCAACG 19 



2) INFORMATION FOR SEQ ID NO: 1967 

to ----- " - - — - - - 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1967 

20 

CAGCAATAAG TAATCCAGCG ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1968 

GGAGAGATTT CACCGCATAG 20 



10 



50 



2) INFORMATION FOR SEQ ID NO: 1969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
i5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1969 

AGCCAACCAT CATGCTATTC CA 22 

1023 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1970 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



LO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Transposon TnlO 
(C) ACCESSION NUMBER: J01830 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1970 

ATGAATAGTT CGACAAAGAT CGCATTGGTA ATTACGTTAC TCGATGCCAT 50 

GGGGATTGGC CTTATCATGC CAGTCTTGCC AACGTTATTA CGTGAATTTA 100 

TTGCTTCGGA AGATATCGCT AACCACTTTG GCGTATTGCT TGCACTTTAT 150 

20 GCGTTAATGC AGGTTATCTT TGCTCCTTGG CTTGGAAAAA TGTCTGACCG 200 

ATTTGGTCGG CGCCCAGTGC TGTTGTTGTC ATTAATAGGC GCATCGCTGG 250 

ATTACTTATT GCTGGCTTTT TCAAGTGCGC TTTGGATGCT GTATTTAGGC 300 

CGTTTGCTTT CAGGGATCAC AGGAGCTACT GGGGCTGTCG CGGCATCGGT 350 

CATTGCCGAT ACCACCTCAG CTTCTCAACG CGTGAAGTGG TTCGGTTGGT 400 

15 TAGGGGCAAG TTTTGGGCTT GGTTTAATAG CGGGGCCTAT TATTGGTGGT 450 

TTTGCAGGAG AGATTTCACC GCATAGTCCC TTTTTTATCG CTGCGTTGCT .500 

AAATATTGTC ACTTTCCTTG TGGTTATGTT TTGGTTCCGT GAAACCAAAA 550 

ATACACGTGA TAATACAGAT ACCGAAGTAG GGGTTGAGAC GCAATCGAAT 600 

TCGGTATACA TCACTTTATT TAAAACGATG CCCATTTTGT TGATTATTTA 650 

JO TTTTTCAGCG CAATTGATAG GCCAAATTCC CGCAACGGTG TGGGTGCTAT 700 

TTACCGAAAA TCGTTTTGGA TGGAATAGCA TGATGGTTGG CTTTTCATTA 750 

GCGGGTCTTG GTCTTTTACA CTCAGTATTC CAAGCCTTTG TGGCAGGAAG 800 

AATAGCCACT AAATGGGGCG AAAAAACGGC AGTACTGCTC GAATTTATTG 850 

CAGATAGTAG TGCATTTGCC TTTTTAGCGT TTATATCTGA AGGTTGGTTA 900 

\5 GATTTCCCTG TTTTAATTTT ATTGGCTGGT GGTGGGATCG CTTTACCTGC 950 

ATTACAGGGA GTGATGTCTA TCCAAACAAA GAGTCATGAG CAAGGTGCTT 1000 

TACAGGGATT ATTGGTGAGC CTTACCAATG CAACCGGTGT TATTGGCCCA 1050 

TTACTGTTTA CTGTTATTTA TAATCATTCA CTACCAATTT GGGATGGCTG 1100 

GATTTGGATT ATTGGTTTAG CGTTTTACTG TATTATTATC CTGCTATCGA 1150 

10 TGACCTTCAT GTTAACCCCT CAAGCTCAGG GGAGTAAACA GGAGACAAGT 1200 

GCTTAG 1206 



15 2) INFORMATION FOR SEQ ID NO: 1971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1971 
CYGACTGYGC CATCCTYATC A 21 

5 

2) INFORMATION FOR SEQ ID NO: 1972 

(i) SEQUENCE CHARACTERISTICS: 
10 "(A) LENGTH: 21 bases" - ■- 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1972 
MGICAGCTCA TYITTGCWKS C 21 

20 

2) INFORMATION FOR SEQ ID NO: 1973 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1973 
35 RACACCRGIY TTGGWITCCT T 21 



$0 



2) INFORMATION FOR SEQ ID NO: 1974 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1974 

50 

ACAAGGGITG GRMSAAGGAG AC 22 
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2) INFORMATION FOR SEQ ID NO: 1975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



LO 



L5 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1975 
TGRCCRGGGT GGTTRAGGAC G 21 

2) INFORMATION FOR SEQ ID NO: 1976 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1976 
GATGGAYTCY GTYAAITGGG A 21 

30 



2) INFORMATION FOR SEQ ID NO: 1977 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1977 
15 GATGGAYTCY GTYAARTGGG A 21 



2) INFORMATION FOR SEQ ID NO: 1978 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1978 
CATCITGYAA TGGYAATCTY AAT 23 



2) INFORMATION FOR SEQ ID NO: 1979 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1979 
CATCYTGYAA TGGYAASCTY AAT 23 

25 

2) INFORMATION FOR SEQ ID NO: 1980 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1980 
40 TCRATGGCIT CIAIRAGRGT YT 22 



2) INFORMATION FOR SEQ ID NO: 1981 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1981 
TGGACACCIS CAAGIGGKCY G 21 

5 

2) INFORMATION FOR SEQ ID NO: 1982 

(i) .SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH": 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1982 
TGGACACYIS CAAGIGGKCY G 21 

20 

2) INFORMATION FOR SEQ ID NO: 1983 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1983 
35 CYGAYTGCGC YATICTCATC A 21 



2) INFORMATION FOR SEQ ID NO: 1984 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1984 

50 

CYGAYTGYGC YATYCTSATC A 21 
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L5 



2) INFORMATION FOR SEQ ID NO: 1985 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi). ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Ml-106 

(C) ACCESSION NUMBER: U81804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1985 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

20 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCTA 250 

AGTACCAGGT TACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

25 CACCGGTATC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCC 400 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 4 50 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCTGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGT TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

30 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCCGGTGTT TCCAAGGGTA AGACCCTTCT CGAGGCCATC GACGCCAGTA 700 

GGCCCCCTAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

35 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTCCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

10 TTGCAAGTTC TCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTT CCCAGAAGCC TCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATTAA GAGTGTGGAG AAGTCCGATG GGAAGAGCGG CAAGGTTACC 1350 

15 AAGGCCGCCG AGAAGGCTGC TAAGAAGAAG TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1986 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 
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15 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: B3501 

(C) ACCESSION NUMBER: U81803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1986 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC t GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

L5 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCCA 250 

GGTACCAGGT CACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

2 0 CACCGGTATC GGTGAGTTCG AGGCCGGTAT CTCCAAGGAC GGTCAGACCC 4 00 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 450 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCCGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGC TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

25 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCTGGTGTT TCCAGGGGTA AGACCCTTCT CGAGGCCATC AGCGCCAGTA 700 

GGCCCCATAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

3 0 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTTCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

3 5 CTGCAAGTTT GCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTG CCCAGAAGCC CCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATCAA GAGCGTGGAC AAGACCGAGA AGGGTGGCAA GGTCACCAAG 1350 

JO GCTGCTGAGA AGGCTGCCAA GAAGAAGTAA 1380 



2) INFORMATION FOR SEQ ID NO: 1987 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: X01638 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1987 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

10 AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG -200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

15 GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTO CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

20 GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TCGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTCGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

25 GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

30 AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

35 



2) INFORMATION FOR SEQ ID NO: 1988 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
50 (C) ACCESSION NUMBER: M10992 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1988 
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ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

5 AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

10 GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA" GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG, GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

15 AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

20 AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

25 AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

30 

2) INFORMATION FOR SEQ ID NO: 1989 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 

(B) STRAIN: ATCC 10895 

45 (C) ACCESSION NUMBER: X73978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1989 

ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

50 CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 
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AGTACCACGT CACTGTCATT GACGCCCCAG GCCACAGAGA CTTCATCAAG 300 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

5 GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 

AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 600 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

10 AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

L5 GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

20 AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 

25 

2) INFORMATION FOR SEQ ID NO: 1990 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 
(C) ACCESSION NUMBER: A29820 



ao 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1990 



ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

a5 TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 

AGTACCACGT CACTGTCATT GACCCCCCAG GCCACAGAGA CTTCATCAAG 300 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

50 TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 
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WO 01/23604 



PCT/CAOO/01150 



AGACTGTTCC ATTCGTTCCA 
GAGGCCACCA CCAACGCCCC 
GGCTGGTGCC GTCAAGGGTA 
AGCCACCTGT CAGACCAACT 
GTCTACAAGA TCGGTGGTAT 
CGGTGTCATC AAGCCAGGTA 
CCACTGAAGT CAAGTCCGTC 
GTCCCAGGTG ACAACGTTGG 



ATCTCCGGCT GGAACGGTGA 
ATGGTACAAG GGCTGGGAGA 
AGACCTTGTT GGAGGCCATT 
GACAAGGCAT TGAGATTGCC 
TGGTACGGTT CCAGTCGGCA 
TGGTTGTTAC CTTCGCCCCA 
GAGATGCACC ACGAGCAATT 
TTTCAACGTC AAGAACGTCT 



CAACATGATT 
AGGAGACCAA 
GACGCCATTG 
ATTGCAGGAT 
GAGTCGAGAC 
TCCGGTGTCA 
GGAGGAGGGT 
CCGTCAAGGA 



600 
650 
700 
750 
800 
850 
900 
950 



LO 



L5 



20 



25 



30 



35 



40 



45 



GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 
CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCGAGGTCAA - -1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG , CCGGTGACGC TGCCATGGTC 1200 

AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 



2) INFORMATION FOR SEQ ID NO: 1991 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1646 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Aspergillus oryzae 

(B) STRAIN: KBN616 

(C) ACCESSION NUMBER: AB007770 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1991 

TGGGGTAAGT TTATCAACCC GTCGAGTTGT GTTGCATCTC AGATCATGGC 50 

TGACAAGTAC TTTCCTCCTT ACAGTAAGGA AGACAAGCAG CACATCAACA 100 

TCGTCGTTAT CGGCCACGTC GATTCCGGCA AGTCCACCAC CACTGGTCAC 150 

TTGATCTACA AGTGTGGTGG TATCGACCAG CGTACCATCG AGAAGTTCGA 200 

GAAGGAAGCC GCTGAGCTCG GTAAGGGTTC CTTCAAGTAC GCCTGGGTTC 250 

TTGACAAGCT CAAGTCCGAG CGTGAGCGTG GTATCACCAT CGATATCGCC 300 

CTCTGGAAGT TCCAGACCTC CAAGTATGAG GTCACCGTCA TTGGTAAGCA 350 

TTTGAGTTCC AACCTACGTT GCCCAACATT TACAGTCATC TAACAAAGTT 400 

CAATAGATGC CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT 450 

ACTTCCCAGG CTGACTGCGC TATCCTCATC ATTGCCTCCG GTACTGGTGA 500 

ATTCGAGGCT GGTATCTCCA AGGATGGTCA GACCCGTGAG CACGCTCTGC 550 

TCGCTTTCAC CCTCGGTGTC CGTCAGCTCA TCGTTGCCCT CAACAAGATG 600 

GACACCTGCA AGTGGTCTCA GGATCGTTAC AACGAAATCG TTAAGGAGAC 650 

TTCCAACTTC ATCAAGAAGG TCGGATACAA CCCCAAGAGC GTTCCTTTCG 700 

TCCCCATCTC CGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCACCAAC 750 

TGCCCCTGGT ACAAGGGCTG GGAGAAGGAG ACCAAGGCTG GCAAGTCCAC 800 
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CGGTAAGACC 
CCACCGACAA 
GGTATCGGTA 



CTTCTCGAGG 
GCCTCTCCGT 
CTGTGCCCGT 



CCATCGATGC 
CTTCCCCTCC 
CGGTCGTGTC 



CATCGAGCCC 
AGGATGTCTA 
GAGACTGGTG 



CCCGTCCGTC 
CAAGATCTCT 
TCATCAAGCC 



850 
900 
950 



10 



15 



20 



25 



30 



35 



40 



45 



TGGTATGGTC GTTACTTTCG CTCCTGCCAA CGTGACCACT GAAGTCAAGT 1000 

CCGTTGAAAT GCACCACCAG CAGCTCCAGG CCGGTAACCC CGGTGACAAC 1050 

GTTGGTTTCA ACGTCAAGAA CGTCTCCGTC AAGGAAGTCC GCCGTGGTAA 1100 

CGTTGCCGGT GACTCCAAGA ACGACCCCCC TGCTGGCTGC GATTCCTTCA 1150 

ACGCCCAGGT CATCGTCCTT AACCACCCCG GTCAGGTCGG CAACGGTTAC 1200 

GCTCCCGTCC TGGACTGCCA CACCGCTCAC ATTGCTTGCA AGTTCGCTGA 1250 

GCTCCTTGAG AAGATTGACC GCCGTACCGG TAAATCTGTT GAGGACAAGC- 1300 

CCAAGTTCAT CAAGTCTGGT GATGCTGCCA TCGTCAAGAT GATTCCCTCC 1350 

AAGCCCATGT GTGTGGAGTC TTTCACTGAC TTCCCCCCTC TTGGTCGTTT 1400 

CGCTGTCCGT GACGTAAGTT TTTCCCTCTT GACTATCTTC ACAATTTTTC 1450 

ACATATTTTC ACGCCTCGTC CCACTCTTTT TCCTCCCTTC CTCTTTGGTT 1500 

CCCCTTTTTG CCTGCAAGTT CTCTATAGCT AACATGATGT CTAGATGCGT 1550 

CAAACTGTTG CCGTCGGAGT. TATCAAGTCG GTTGAGAAGA ACACTGGCGG 1600 

TTCTGGCAAG GTCACCAAGG CCGCCCAGAA GGCTGGCAAG AAATAA 1646 



2) INFORMATION FOR SEQ ID NO: 1992 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: R106 

(C) ACCESSION NUMBER: U19723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1992 

ATGGGTAAGG AAAAGTCCCA CATCAACGTC GTCGTTATCG GCCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TCGACAAGCG TACCATCGAG AAGTTCGAGA AGGAAGCCGC CGAACTCGGC 150 

AAGGGTTCCT TCAAGTACGC CTGGGTCCTC GACAAGCTGA AGTCTGAGCG 200 

TGAGCGTGGT ATCACTATCG ATATCGCTCT GTGGAAGTTC GAGACCCCCA 250 

AGTACATGGT CACCGTCATC GATGCCCCCG GTCACCGTGA TTTCATCAAG 300 

AACATGATCA CTGGTACCTC CCAGGCTGAC TGCGCCATTC TCATCATTGC 350 

CGCCGGTACT GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGCCAGACTC 400 

GTGAGCACGC CCTTCTCGCC TACACCCTTG GTGTCAAGCA GCTCATCGTC 450 

GCTATCAACA AGATGGACAC CACCAAGTGG TCTGAGGCCC GTTACCAGGA 500 

GATCATCAAG GAGACCTCCG GTTTCATCAA GAAGGTCGGC TACAACCCCA 550 

AGCACGTTCC CTTTGTCCCC ATCTCGGGTT TCAACGGTGA CAACATGATT 600 

GAGGTTTCTT CCAACTGCCC CTGGTACAAG GGTTGGGAGA AGGAGACCAA 650 

GGCCAAGGCC ACTGGTAAGA CTCTCCTCGA GGCCATTGAC GCCATCGACC 700 

CTCCTTCGCG CCCCACCGAC AAGCCCCTCC GTCTTCCCCT CCAGGATGTC 750 

TACAAGATCG GTGGTATTGG CACGGTGCCC GTCGGCCGTG TCGAGACCGG 800 
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TACCATCAAG 
CTGAGGTCAA 
CCCGGTGACA 



GGTGGTATGG 
GTCCGTCGAG 
ACGTCGGCTT 



TCGTCACCTT 
ATGCACCACG 
CAACGTCAAG 



CGCCCCCGCT GGTGTCACCA 
AGCAGCTCTC CGAGGGTCTC 
AACGTCTCCG TCAAGGAGAT 



850 
900 
950 



10 



15 



20 



25 



30 



35 



40 



45 



CCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCC CCCAAGGGTT 1000 

GCGACTCCTT CAACGCCCAG GTCATCGTCC TCAACCACCC CGGTCAGGTC 1050 

GGTGCTGGTT ACGCACCCGT CCTCGATTGC CACACTGCCC ACATCGCCTG 1100 

CAAGTTCTCC GAGCTTGTTG AGAAGATTGA CCGCCGTACC GGCAAGTCCG 1150 

TTGAGGCCGC CCCCAAGTTC ATCAAGTCTG GTGACGCCGC CATCGTCAAG 1200 

ATGGTTCCCT CCAAGCCTAT GTGTGTTGAG GCCTTCACTG ACTACCCTCC 1250 

TCTCGGTCGT TTCGCCGTCC GTGACATGAG ACAGACCGTC - GCTGTCGGTG 1300 

TCATCAAGTC CGTCGCCAAG TCCGACAAGC AGGGTGCCGG TAAGGTTACC 1350 

AAGGCCGCTG TCAAGGCTGG CAAGAAGTAA 1380 



2) INFORMATION FOR SEQ ID NO: 1993 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: 186AS 

(C) ACCESSION NUMBER: U14100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1993 

ATGGGTAAGG AAGACAAGAC TCACATTAAC CTCGTCGTCA TCGGCCACGT 50 

CGATTCGGGC AAATCTACCA CCACTGGTCA TTTGATCTAC AAATGCGGTG .100 

GTATTGACAG CCGTACCATT GAGAAGTTCG AAAAGGAAGC CGAAGAGTTG 150 

GGCAAGAAAT CCTTCAAATA TGCGTGGGTC CTTGACAAAC TGAAGTCTGA 200 

GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAA TTCGAGACTC 250 

CGAAGTACAG TGTCACTGTC ATTGATGCTC CCGGCCATCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GACTGCGCTA TCCTCATCAT 350 

TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA CGCTCTGCTT GCTTTCACCC TTGGTGTGAG GCAACTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTTCAA 500 

CGAAATCATC AAGGAGGTTT CCAACTTCAT CAAGAAGGTC GGATATAACC 550 

CCAAGGCTGT TCCCTTCGTG CCAATCTCTG GTTTCGAGGG TGACAACATG 600 

ATTGAACCCT CCCCCAACTG CACATGGTAC AAGGGCTGGA ACAAGGAGAC 650 

TGCCTCTGGC AAGTCTTCTG GTAAAACCCT TCTCGATGCC ATTGACGCCA 700 

TTGAACCCCC AACCCGTCCT ACCGATAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTGTACA AAATCTCTGG TATTGGCACT GTTCCCGTCG GACGTGTTGA 800 

GACTGGTGTC ATCAAGCCCG GTATGGTCGT GACTTTCGCT CCCTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAAATGC ACCACCAACA ACTCCAGGCT 900 

GGTTACCCTG GCGACAACGT CGGCTTCAAC GTCAAGAACG TTTCAGTCAA 950 

GGAAGTCCGC CGTGGCAACG TTGCTGGCGA CTCCAAAAAT GATCCCCCTA 1000 

AGGGCTGCGA ATCCTTCAAT GCCCAGGTCA TCGTCCTTAA CCACCCCGGC 1050 
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CAGGTTGGCG CTGGTTATGC 
TGCTTGCAAG TTCTCTGAGC 
AGTCTGTTGA GAACAACCCC 
GTCAAGATGG TTCCCTCCAA 
5 TCCCCCTCTT GGACGTTTCG 
TCGGTGTCAT CAAGTCCGTC 
ACCAAGGCCG CGCAGAAGGC 



CCCAGTCCTC GACTGCCACA 
TTATTGAGAA GATCGACCGC 
AAGTTCATCA AGTCTGGTGA 
GCCCATGTGC GTGGAGCCCT 
CTGTCCGTGA CATGAGACAA 
ATCAAGTCTG ACAAGACTGC 
CACCAAGAAA TAA 



PCT/CA00/01150 

CTGCCCACAT 1100 

CGTACCGGAA 1150 

TGCTGCTATC 1200 

TCACTGACTA 1250 

ACCGTCGCTG 1300 

TGGCAAGGTC 1350 
1383 



LO 



2) INFORMATION FOR SEQ ID NO: 1994 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neurospora crassa 
(C) ACCESSION NUMBER: D45837 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1994 



CGATTCCGGC AAGTCTACCA CTACCGGTCA 
GTATCGACAA GCGTACCATC GAGAAGTTCG 

30 GGTAAGGGTT CCTTCAAGTA TGCCTGGGTT 
GCGTGAGCGT GGTATCACCA TCGATATCGC 
CCAAGTACTA CGTCACCGTC ATCGATGCCC 
AAGAACATGA TCACTGGTAC CTCCCAGGCT 
TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG 

35 CCCGTGAGCA CGCCCTGCTC GCCTACACCC 
GTTGCCATCA ACAAGATGGA CACCACCCAG 
GGAGATCATC AAGGAGACCA AGAACTTCAT 
CCGCTGGTGT CGCTTTCGTC CCCATCTCCG 
CTTGAGCCCT CCACCAACTG CCCCTGGTAC 

10 CAAGGCCGGC AAGGCCACTG GCAAGACCCT 
TTGAGCCCCC CAAGCGTCCT ACCGACAAGC 
GATGTCTACA AGATCGGTGG TATCGGCACA 
GACTGGTGTC CTCAAGCCCG GTATGGTCGT 
TCACCACTGA AGTCAAGTCC GTCGAGATGC 

15 GGTGTCCCCG GTGACAACGT CGGCTTCAAC 
GGATATCCGC CGTGGTAACG TTGCCGGTGA 
CTGGCGCCGC CTCTTTCACC GCCCAGGTCA 
CAGGTCGGTG CCCGCTACGC CCCCGTCCTC 
TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA 

50 AGGCTGTTGA GGCCTCCCCC AAGTTCATCA 
GTCAAGATGA TTCCCTCCAA GCCCATGTGC 
CCCTCCCCTC GGCCGTTTCG CCGTCCGTGA 
TCGGTGTCAT CAAGGCCGTC GACAAGTCCA 
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GTCGTCGTTA 


TCGGCCACGT 


50 


CTTGATCTAC 


AAGTGCGGTG 


100 


AGAAGGAAGC 


CGCTGAGCTC 


150 


CTTGACAAGC 


TCAAGGCCGA 


200 


CCTCTGGAAG 


TTCGAGACTC 


250 


CCGGTCATCG 


TGATTTCATC 


300 


GATTGCGCTA 


TCCTCATCAT 


350 


TATCTCCAAG 


GATGGCCAGA 


400 


TCGGTGTCAA 


GCAGCTCATT 


450 


TGGTCCCAGA 


CTCGTTTCGA 


500 


CAAGAAGGTT 


GGCTACAACC 


550 


GCTTCAACGG 


CGACAACATG 


600 


AAGGGTTGGG 


AGAAGGAGAC 


650 


CCTCGAGGCC 


ATCGACGCCA 


700 


CCCTCCGTCT 


TCCCCTCCAG 


750 


GTGCCCGTCG 


GCCGTATCGA 


800 


TACCTTCGCT 


CCTTCCAACG 


850 


ACCACGAGCA 


GCTTGCTCAG 


900 


GTGAAGAACG 


TTTCCGTCAA 


950 


CTCCAAGAAC 


GACCCCCCTG 


1000 


TCGTTCTCAA 


CCACCCCGGT 


1050 


GACTGCCACA 


CTGCCCACAT 


1100 


GATCGACCGC 


CGTACTGGTA 


1150 


AGTCTGGTGA 


TGCTGCCATC 


1200 


GTTGAGGCTT 


TCACCGACTA 


1250 


CATGCGTCAG 


ACCGTCGCCG 


1300 


CCGCTGCCGC 


TGGCAAGGTC 


1350 
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ACCAAGTCCG CTGCCAAGGC CGCCAAGAAG TAA T383 

5 2) INFORMATION FOR SEQ ID NO: 1995 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 
L0 (C) " STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora anserina 

(C) ACCESSION NUMBER: X74799 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1995 



ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

25 GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATTGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

30 GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

35 TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGTGTTCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

10 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC CTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCTCTGAGC TCCTGCAGAA GATCGACCGC CGTACTGGTA 1150 

AGGCCGTTGA GGAGAGCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

45 GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTT 1350 

ACCAAGTCCG CTGCCAAGGC TGGCAAGAAA TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1996 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

"(A)" ORGANISM:' Podospora 'curvicolla ' - 

(B) STRAIN: VLV 

(C) ACCESSION NUMBER: X96614 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1996 

ATGGGCAAGG AGGACAAGAO TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TTGATATCGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

GTCGCCATCA ACAAGATGGA CACCACCAAA TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCTACTG GCAAGACCCT CCTCGAGGCC ATCGACTCCA 700 

TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTTCAG 750 

GACGTTTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCC CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTCTGAG 900 

GGTGTCCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTC 1000 

TTGGCGCCGC TTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCGCTGAGC TCCTGCAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GGAGAGCCCT AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CTGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC TGGTGGCAAG AAATAA 1386 



2) INFORMATION FOR SEQ ID NO: 1997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

1039 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 



"10 



L5 



20 



15 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sordaria macrospora 

(B) STRAIN: OOO 

(C) ACCESSION NUMBER: X96615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1997 

ATGGGTAAGG AAGACAAGGC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCCACCA CTACCGGTCA CCTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

GGCAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TTCTCATCAT 350 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA. CGCTCTTCTC GCCTACACCC TCGGTGTCAA GCAGCTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGG CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCGCCACCGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGGCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

CAAGGCCGGC AAGTCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

TTGAGCAGCC CAAGCGCCCG ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

GGTGTTCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

GGATATCCGT CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

TCGGCGCTGC CTCTTTCACC GCCCAGGTCA TCGTCCTTAA CCACCCCGGT 1050 

CAGGTCGGTG CCGGCTACGC TCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GACTTCTCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTCGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGACCC AGGCTGTCGC TGGCAAGGTC 1350 

ACCAAGTCTG CTGCCAAGGC TGCCAAGAAG TAA 1383 



10 



2) INFORMATION FOR SEQ ID NO: 1998 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Trichoderma reesei 
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(B) STRAIN: QM9414 

(C) ACCESSION NUMBER: Z23012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1998 



ATGGGTAAGG 
CGACTCCGGC 
GTATCGACAA 
GGCAAGGGTT 

10 GCGTGAGCGT 
CCAAGTACTA 
AAGAACATGA 
CGCTGCCGGT 
CCCGTGAGCA 

15 GTCGCCATCA 
GGAAATCATC 
CCAAGGCCGT 
CTCACCCCCT 
CAAGGCTGGC 

20 TCGAGCCCCC 
GACGTCTACA 
GACTGGTGTC 
TCACCACTGA 
GGCCAGCCTG 

25 GGAAATCCGC 
TGGGCGCCGC 
CAGGTCGGTG 
TGCCTGCAAG 
AGGCTACCGA 

30 GTCAAGATGA 
CCCTCCCCTG 
TCGGTGTCAT 
ACCAAGTCCG 



AGGACAAGAC 
AAGTCTACCA 
GCGTACCATT 
CCTTCAAGTA 
GGTATCACCA 
TGTCACCGTC 
TCACTGGTAC 
ACTGGTGAGT 
CGCTCTGCTC 
ACAAGATGGA 
AAGGAGACTT 
TGCTTTCGTC 
CCACCAACTG 
AAGTTCACCG 
CAAGCGTCCC 
AGATCGGTGG 
CTCAAGCCCG 
AGTCAAGTCC 
GTGACAACGT 
CGTGGCAACG 
TTCTTTCACC 
CCGGCTACGC 
TTCGCCGAGC 
GTCTGCCCCC 
TCCCCTCCAA 
GGTCGTTTCG 
CAAGGCCGTC 
CTGCCAAGGC 



TCACATCAAC 
CCACTGGTCA 
GAGAAGTTCG 
CGCGTGGGTT 
TCGACATTGC 
ATTGACGCTC 
TTCCCAGGCC 
TCGAGGCTGG 
GCCTACACCC 
CACTGCCAAC 
CCAACTTCAT 
CCCATCTCCG 
CCCCTGGTAC 
GCAAGACCCT 
ACGGACAAGC 
TATCGGAACA 
GTATGGTCGT 
GTCGAGATGC 
TGGTTTCAAC 
TTGCCGGTGA 
GCCCAGGTCA 
CCCCGTCCTC 
TCCTCGAGAA 
AAGTTCATCA 
GCCCATGTGC 
CCGTCCGTGA 
GAGAAGTCCT 
CGCCAAGAAA 



GTGGTCGTCA 
CTTGATCTAC 
AGAAGGAAGC 
CTTGACAAGC 
CCTCTGGAAG 
CCGGCCACCG 
GACTGCGCTA 
TATCTCCAAG 
TGGGTGTCAA 
TGGGCCGAGG 
CAAGAAGGTC 
GCTTCAACGG 
AAGGGCTGGG 
CCTTGAGGCC 
CCCTGCGTCT 
GTTCCCGTCG 
TACCTTCGCT 
ACCACGAGCA 
GTGAAGAACG 
CTCCAAGAAC 
TCGTCATGAA 
GACTGCCACA 
GATCGACCGC 
AGTCTGGTGA 
GTTGAGGCTT 
CATGCGCCAG 
CTGCCGCCGC 
TAA 



TCGGCCACGT 50 

CAGTGCGGTG 100 

CGCCGAACTC 150 

TCAAGGCCGA 200 

TTCGAGACTC 250 

TGACTTCATC 300 

TCCTCATCAT 350 

GATGGCCAGA 400 

GCAGCTCATC 450 

CTCGTTACCA 500 

GGCTTCAACC 550 

TGACAACATG 600 

AGAAGGAGAC 650 

ATCGACTCCA 700 

TCCCCTCCAG 750 

GCCGTATCGA 800 

CCCTCCAACG 850 

GCTCGCTGAG 900 

TTTCCGTCAA 950 

GACCCCCCCA 1000 

CCACCCCGGC 1050 

CTGCCCACAT 1100 

CGTACCGGTA 1150 

CTCCGCCATC 1200 

TCACCGACTA 1250 

ACCGTCGCTG 1300 

CGCCAAGGTC 1350 
1383 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1999 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1999 
CATGTCAAYA TTGGTACTAT TGGTCATGT 



29 



50 



2) INFORMATION FOR SEQ ID NO: 2000 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) "SEQUENCE DESCRIPTION: SEQ ID NOT 2000 
CCACCYTCIC TCAMGTTGAA RCGTT 



2) INFORMATION FOR SEQ ID NO: 2001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2001 
ACYACITTRA CIGCYGCYAT YAC 



2) INFORMATION FOR SEQ ID NO: 2002 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2002 
CCIGARGARA GAGCIMGWGG T 



2) INFORMATION FOR SEQ ID NO: 2003 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2003 
CATYTCRAIR TTGTCACCTG G 
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2) INFORMATION FOR SEQ ID NO: 2004 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
(_D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: SC5314 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2004 

GCTGCCTTCG ACCGTTCTAA ACCTCATGTC AACATTGGTA CTATTGGTCA 50 

TGTTGATCAT GGTAAAACTA CATTGACTGC TGCTATCACC AAAGTTTTAG 100 

CCGAACAAGG TGGTGCCAAC TTCTTGGATT ATGGTTCTAT TGATAGAGCT 150 

CCAGAAGAAA GAGCTAGAGG TATCACTATT TCCACTGCCC ACGTTGAATA 200 

CGAAACCAAG AACAGACACT ATGCCCACGT TGATTGTCCA GGACACGCTG 250 

ATTATATCAA AAATATGATT ACTGGTGCCG CTCAAATGGA TGGTGCTATC 3 00 

ATTGTTGTTG CTGCCACTGA TGGTCAAATG CCTCAAACCA GAGAACATTT 350 

GTTATTGGCC AGACAAGTTG GTGTTCAAGA CTTGGTTGTG TTTGTCAACA 4 00 

AAGTCGATAC TATTGATGAC CCTGAAATGT TGGAATTAGT CGAAATGGAA 4 50 

ATGAGAGAAT TGTTATCCAC CTACGGTTTT GATGGTGACA ACACTCCAGT 5 00 

TATTATGGGA TCTGCTTTAA TGGCTTTGGA AGACAAGAAA CCAGAAATTG 550 

GTAAGGAAGC TATCTTGAAA TTGTTAGATG CTGTCGATGA ACACATTCCA 600 

ACTCCATCAA GAGACTTGGA ACAACCATTT TTGTTACCAG TTGAAGACGT 650 

GTTCTCCATC TCCGGTAGAG GAACTGTTGT CACTGGTAGA GTTGAAAGAG 700 

GTGTTTTGAA GAAGGGTGAA GAAATCGAAA TTGTTGGTGG TTTTGACAAA 750 

CCTTACAAGA CTACTGTTAC CGGTATTGAA ATGTTCAAAA AAGAATTAGA 800 

CTCTGCTATG GCTGGTGACA ACTGTGGTGT TTTGTTAAGA GGTGTTAAAA 850 

GAGATGAAAT CAAGAGAGGT ATGGTTTTGG CCAAACCAGG TACTGCTACT 900 

TCTCACAAGA AGTTCTTGGC TTCCTTGTAT ATTTTGACTT CCGAAGAAGG 950 

TGGTCGTTCC ACTCCATTTG GTGAAGGTTA CAAGCCTCAA TGCTTCTTCA 10 00 

GAACTAACGA TGTCACTACC ACATTTTCAT TCCCAGAAGG AGAAGGTGTT 1050 

GATCATTCTC AAATGATCAT GCCAGGTGAC AACATTGAAA TGGTTGGTGA 1100 

ATTGATCAAA TCTTGTCCAT TAGAAGTCAA CCAACGTTTC AACTTGAGAG 1150 

AAGGTGGTAA AACTGTTGGT ACTGGTTTGA TTACCAGAAT CATCGAATAA 1200 

ACAGAATGTG CACTGTGAAT AATAAAAAGA AAAGAGGTAT ATATAGGTGA 1250 

CTTTGTATTT TGTATTGAAC AATAAAATTC TGTAAATAGT AAGGGCCTCA 13 00 

GAAGTTTTGA TTTGATTTAT GCCATGTGGA CTTGTAGAGA TATCCTTCTC 13 50 

AAACTTCTTG 13 60 



2) INFORMATION FOR SEQ ID NO: 2005 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 
(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2005 

AAGCCGCATG TCAATATTGG TACTATTGGT CATGTTGACC ACGGTAAAAC " 50 

GACGTTGACG GCTGCTATTA CTAAATGCCT TTCTGATCTT GGTCAAGCTA 100 

GTTTTATGGA TTATAGTCAA ATTGACAAGG CCCCCGAGGA AAAGGCACGT 150 

GGTATTACCA TTTCATCTGC CCATGTTGAA TACGAAACTG CTAATCGTCA 200 

CTATGCCCAT GTGGATTGTC CTGGTCACGC CGATTACATT AAGAATATGA 250 

TTACTGGTGC TGCTACAATG GATGGCGCTA TCATTGTTGT TTCTGCTACC 300 

GATGGTCAAA TGCCTCAAAC TCGTGAACAT TTGCTTCTGG CTCGTCAAGT 350 

CGGTGTAAAG CAAATTGTTG TATACATCAA TAAAGTCGAT ATGGTCGAGC 400 

CTGATATGAT CGAGCTTGTC GAAATGGAAA TGCGTGAGCT ACTCTCCGAA 450 

TACGGATTTG ATGGTGACAA TACTCCAATT GTTAGCGGCA GTGCTTTATG 500 

TGCCTTAGAG GGTCGTGAGC CTGAGATTGG TCTCAATAGT ATTACTAAAT 550 

TGATGGAAGC TGTTGATAGT TATATTACTC TTCCTGAAAG AAAAACGGAT 600 

GTCCCTTTCT TGATGGCCAT CGAGGACGTT TTTTCAATTT CAGGTCGCGG 650 

AACTGTAGTC ACTGGCCGTG TCGAGCGCGG TACTTTAAAG AAGGGTGCTG 700 

AAATCGAAAT CGTCGGTTAT GGTAGCCATT TAAAGACTAC CGTTACTGGA 750 

ATTGAAATGT TCAAAAAGCA GCTTGATGCC GCCGTTGCCG GTGACAATTG 800 

TGGCCTTTTA CTTCGTTCTA TCAAGCGAGA GCAATTAAAA CGTGGAATGA 850 

TTGTCGCTCA ACCAGGAACC GTTGCTCCTC ATCAGAAATT CAAGGCATCA 900 

TTCTATATTT TGACAAAAGA GGAAGGAGGT CGTCGTACCG GTTTCGTTGA 950 

CAAGTATCGT CCCCAACTGT ACAGTCGTAC TTCCGACGTT ACTGTCGAAC 1000 

TTACCCACCC TGATCCTAAC GACTCAGACA AAATGGTTAT GCCTGGAGAC 1050 

AATGTCGAGA TGATCTGTAC GCTTATTCAC CCCATTGTCA TCGAAAAAGG 1100 

ACAACGCTTC ACAGTTCGTG AGGGTGGAAG CACTGTAGGC ACAGCTTTGG 1150 

TTACTGAACT TTTGGATTAG TGCATTTATG AACTTATTGG CTTTAAAAAT 120 0 

TTTGCATGCT GAATACCAAT ATTATGTCCC TTCTCAGAAT TCTATAACTA 1250 

CAGTGTCATT ATTGTAATAA GACTTTTGCA TCCATTGACA ATGGTATTTG 13 00 

ATACTTTTAT AGTTTCTACT ATTGTTAGCC AAAGTTATAA AA 1342 



2) INFORMATION FOR SEQ ID NO: 2006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2006 
TGGAGCCGGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2007 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

- - •- (xi)- SEQUENCE DESCRIPTION: _SEQ_ID NO: 2007 

TGGAGCCAGT GAGCGTGG 1 8 



2) INFORMATION FOR SEQ ID NO: 2008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2008 
TCTGGAGCCG ATGAGCGTG 19 



2) INFORMATION FOR SEQ ID NO: 2009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2009 
CTGGAGCCAG TAAGCGTGG 19 



2) INFORMATION FOR SEQ ID NO: 2010 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: KMK107 

(C) ACCESSION NUMBER: AF027199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2010 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGACG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCAGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 2011 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: CLSiS L-491 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2011 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCTTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 
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TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCAGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ATGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 2012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 bases, 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2012 
CCGCGGATTA TTAAACCGCC CTTCCGCGG-MR-HEG-ATGTCAGAGG GATAGATCCA 49 



2) INFORMATION FOR SEQ ID NO: 2013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2013 

AGCTTAAGAA CTCTTATCTG GATTACGCGA TGTCGGTCAT TGTTGGCCGT 50 

GCGCTGCCGG ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT 100 

TTACGCCATG AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT 150 

CAGCCCGTGT CGTGGGTGAC GTGATCGGTA AATATCACCC GCATGGTGAT 200 

ACTGCCGTCT ATGACACTAT CGTCCGTATG GCACAGCCAT TCTCACTGCG 250 

ATACATGCTG GTAGATGGTC AAGGTAACTT CGGTTCTGTC GATGGCGACT 300 

CCGCCGCAGC GATGCGTTAT ACGGAAATCC GTATGTCGAA AATCGCCCAT 350 

GAG 353 



2) INFORMATION FOR SEQ ID NO: 2014 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2014 



AGCTCCTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

AGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTACT AGGCAATGAC TGGAACAAAG, CCTATAAAAA ATCTGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAT CCCCATGGTG ACTCGGCGGT 200 

CTATGACACG ATCGTCCGCA TGGCGCAGCC ATTCTCGCTG CGTTATATGC 250 

TGGTAGACGG TCAGGGTAAC TTCGGTTCTA TCGACGGCGA CTCTGCGGCG 300 

GCAATGCGTT ATACGGAAAT CCGTCTGGCG AAAATTGCCC ATGAACTGAT 350 

GGCCG 355 



2) INFORMATION FOR SEQ ID NO: 2015 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2015 
CCAAGAAGCT CAAAAACATC TG 



2) INFORMATION FOR SEQ ID NO: 2016 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2016 
TADCCTGTCC AWACAGCCAT 



2) INFORMATION FOR SEQ ID NO: 2017 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2017 
ACTTTGAATA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2018 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2018 
ACACTAAACA AGGTTGGTTT AG 



2) INFORMATION FOR SEQ ID NO: 2019 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases * 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2019 
ACACTAAACA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2020 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2020 
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2) INFORMATION FOR SEQ ID NO: 2021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

. _ . . (C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 2021 
GTAGCTCCAG ACGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2022 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2022 
GTAGCTCCAG ATGAAACGTT TG 



2) INFORMATION FOR SEQ ID NO: 2023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2023 
GTAACTCCAG ATGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS :• Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2024 
AGTGAAAAGA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2025 
AGTGAGAAAA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2026 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2026 
TCCAAGCATG CATTATGCAA ACG 



2) INFORMATION FOR SEQ ID NO: 2027 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2027 
TCGGTCTAGA TAGAGCTAAA ACG 
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2) INFORMATION FOR SEQ ID NO: 2028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
(U) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2028 
TATGCTCTTC AACAATCACG 



2) INFORMATION FOR SEQ ID NO: 2029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2029 
AGCCGTTGAG ACTTTGAATA AG 



2) INFORMATION FOR SEQ ID NO: 2030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2030 
CTTAATGGTC TTGGTATCG 



2) INFORMATION FOR SEQ ID NO: 2031 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2031 
CGTGACTGGG GTTCTGCTAT GA 



2) INFORMATION FOR SEQ ID NO: 2032 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2032 
CGTGACTGGG GATCATCAAT GA 



2) INFORMATION FOR SEQ ID NO: 2033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2033 
CGTGACTGGG GTTCTGCCAT GA 



2) INFORMATION FOR SEQ ID NO: 2034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2034 
ATCAAGAACA CTGGCTATGT AG 
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2) INFORMATION FOR SEQ ID NO: 2035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2035 
ATCAAGAACA CTGGCTACGT AG 



2) INFORMATION FOR SEQ ID NO: 2036 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2036 
ATCAAGAACA CTGGTTACGT AG 



2) INFORMATION FOR SEQ ID NO: 2037 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2037 
ATCAAAAATA CTGGTTATGT AG 



2) INFORMATION FOR SEQ ID NO: 2038 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2038 

ATCAAGAATA CTGGCTACGT AG 

2) INFORMATION FOR SEQ ID NO: 2039 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : "22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear' 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2039 
ATCAAAAACA CTGGCTATGT AG 



2) INFORMATION FOR SEQ ID NO: 2040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2040 
TGTGACCCCA GACAAACCC 



2) INFORMATION FOR SEQ ID NO: 2041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2041 
GTTGAGCGGC AGCACTATCT 



2) INFORMATION FOR SEQ ID NO: 2042 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2042 
CACGGGGATT TCTCTATTTA " - 20 



2) INFORMATION FOR SEQ ID NO: 2043 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2043 
CACGGGGATT ACTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2044 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2044 
ACCGTAAGTC GGCCAAGTCA 20 



2) INFORMATION FOR SEQ ID NO: 2045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2045 
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19 



2) INFORMATION FOR SEQ ID NO: 2046 

(i) SEQUENCE CHARACTERISTICS: 
. . (A) .LENGTH: .20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 6 
ACGGGGATTT TTCTATCTAT 20 



2) INFORMATION FOR SEQ ID NO: 2047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: Z49094 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2047 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTATGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACCTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAATTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAAGTCATC AATCAAGTTG 850 

AAGAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 
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AATGTAGACC AAGAAGCTCA 
TCAATACGTC TCTTACCCTG 
TAGATGTTTC AAATGGTAAA 
GCAAGTAATG TTTCATTCGG 
CTGGGGATCA TCAATGAAAC 
ATGGAGTCTA TGACTCTACT 
TATCCTGGCA CTGATACTCC 
AAACATTACA ATCCAGTATG 
TTGAGACTTT GAATAAGGTC 
- . . GGTCTTGGTA .TCGACTATCC 
TAACACAACT GAATCCAACA 
CTGCTGCCTA CGCTGCTTTT 
TATATCAATA AAATCGTCTT 
TGCTGGTACA CGAGCTATGA 
TGATGAAAAC TGTTTTAACT 
TGGCTTCCAC AAGCAGGTAA 
AATTGAAAAG TATATCAAGA 
TTGTAGGGTA TACCCGTAAA 
AATCGTCTAA CTCCAATCAT 
CTATCGTTCA ATGATAACTT 
GGACAATGCC AGATGGCTTG 
GGTGCTCGTT CTACGTGGAG 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



AAAACATCTG TGGGATATCT 
ACGATGATTT GCAAGTCGCA 
GTCATCGCAC AACTTGGTGC 
TACCAACCAG GCCGTAGAAA 
CAATCACTGA CTATGCTCCC 
GCTTCTATTG TACATGATGT 
ACTCTACAAC TGGGATCATG 
CTCTTCAACA ATCACGAAAT 
GGTCTAGATA GAGCTAAAAC 
AAGCATGCAT TATGCAAACG 
AAAAATATGG TGCAAGTAGT 
GCTAATGGTG GTATTTATCA 
TAGTGATGGT AGCGAAAAAG 
AAGAGACTAC, TGCCTATATG 
TACGGAACAG GACGTGGAGC 
GACAGGTACT TCTAACTATA 
ACACTGGTTA CGTAGCTCCA 
TATGCAATGG CTGTTTGGAC 
CGGAGATGGT TTCCTTGTTG 
ACCTTTCTGA AGATGACCAA 
TATAGAAATG GAGAATTCGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



PCT/CAOO/01150 

ACAACTCCGA 950 

TCTACGGTCG 1000 

TCGTCATCAA 1050 

CCAATCGTGA 1100 

GCTTTAGAAT 1150 

CCCTTATAAC 1200 

TCTACTTTGG 1250 

GTCACAGCCG 1300 

CTTCCTTAAT 1350 

CCATTTCAAG 1400 
GAAAAAATGG " 1450 

CAAACCAATG 1500 

AATTTTCTGA 1550 

ATGACTGAAA 1600 

CTACCTACCA 1650 

CTGACGAAGA 1700 

GATGAAATGT 1750 

AGGATACTCA 1800 

CTGGTAAAGT 1850 

CCTGGAGATT 1900 

ATTTAAAAAT 1950 

CCCCATCAAC 2000 

TCTAACTCAA 2050 

CAATAATACG 2100 

CTCAACCAGC 2150 
2160 



2) INFORMATION FOR SEQ ID NO: 2048 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: M90527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2048 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGATCAG CTCTCACTCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC TTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAGC AAGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGGAAC TATGGAATGC 550 
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AGACAGCAGC TCAAAACTAC TATGGTAAAG 
CCTCAGTTAG CCTTGCTGGC TGGAATGCCT 
CCCCTATTCA CATCCAGAAG CAGCCCAAGA 
CTGAAATGAA AAATCAAGGC TACATCTCTG 
GTCAATACAC CAATTACTGA TGGGCTACAA 
TTACCCTGCT TACATGGATA ATTACCTCAA 
AAGAAGAAAC AGGCTATAAC CTACTCACAA 
AATGTAGACC AAGAAGCTCA AAAACATCTG 
CGAATACGTT GCCTATCCAG ACGATGAATT 
TTGATGTTTC TAACGGTAAA GTCATTGCCC 
TCAAGTAATG TTTCCTTCGG AATTAACCAA 
CTGGGGATCA ACTATGAAAC CGATCACAGA 
ACGGTGTCTA CGAGTCAACT GCCACTATCG 
TACCCTGGGA CAAATACCCC TGTTTATAAC 
CAACATCACC TTGCAATACG CCCTGCAACA 
TGGAAACTCT AAACAAGGTC GGACTCAACC 
GGTCTCGGAA TCGACTACCC AAGTATTCAC 
TAACACAACC GAATCAGACA AAAAATATGG 
CTGCTGCTTA CGCTGCCTTT GCAAATGGTG 
TATATCCATA AAGTCGTCTT TAGTGATGGG 
TGTCGGAACT CGTGCCATGA AAGAAACGAC 
TGATGAAAAC AGTCTTGAGT TATGGAACTG 
TGGCTCCCTC AGGCTGGTAA AACAGGAACC 
AATTGAAAAC CACATCAAGA CCTCTCAATT 
TTGCTGGCTA TACGCGTAAA TATTCAATGG 
AACCGTCTGA CACCACTTGT AGGCAATGGC 
TTACCGCTCT ATGATGACCT ACCTGTCTGA 
GGAATATACC AGAGGGGCTC TACAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 



PCT/CAOO/01150 

ACCTCAATAA TTTAAGTTTA 600 

CAGGCACCAA ACCAATATGA 650 

CCGCCGAAAC TTGGTCTTAT 700 

CTGAACAGTA TGAGAAAGCA 750 

AGTCTCAAAT CAGCAAGTAA 800 

GGAAGTCATC AATCAAGTTG 850 

CTGGGATGGA TGTCTACACA 900 

TGGGATATTT ACAATACAGA 950 

GCAAGTCGCT TCTACCATTG 1000 

AGCTAGGAGC ACGCCATCAG 1050 
GCAGTAGAAA CAAACCGCGA " 1100 

CTATGCTCCT GCCTTGGAGT 1150 

TTCACGATGA GCCCTATAAC 1200 

TGGGATAGGG GCTACTTTGG 1250 

ATCGCGAAAC GTCCCAGCCG 1300 

GCGCCAAGAC TTTCCTAAAT 1350 

TACTCAAATG CCATTTCAAG 1400 

AGCAAGTAGT GAAAAGATGG 1450 

GAACTTACTA TAAACCAATG 1500 

AGTGAAAAAG AGTTCTCTAA 1550 

AGCCTATATG ATGACCGACA 1600 

GACGAAATGC CTATCTTGCT 1650 

TCTAACTATA CAGACGAGGA 1700 

TGTAGCACCT GATGAACTAT 1750 

CTGTATGGAC AGGCTATTCT 1800 

CTTACGGTCG CTGCCAAAGT 1850 

AGGAAGCAAT CCAGAGGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAGCTCAA 2050 

CCAATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 2049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: URU-E159 

(C) ACCESSION NUMBER: AF139890 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2049 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 
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TTCTGAACGC CGCGTCAATG 
TTAAGGCAAT CGTTTCTATC 
ATTGATTCCA TCCGTATCCT 
TTCCCTCCAA GGTGGATCAA 
ACTTTTCAAC CTCGACTTCC 
GCTTGGTTAG CGATTCAGTT 
GACCTACTAT ATAAATAAGG 
AGACAGCAGC TCAAAACTAC 
CCTCAGTTAG CCTTGCTGGC 
CCCCTATTCA CATCCAGAAG 
CTGAAATGAA AAATCAAGGT 
GTCAATACAC CAATTACTGA 
TTACCCTGCT TACATGGATA 
AACAAGAAAC TGGCTATAAC 
AATGTAGACC AAGAAGCTCA 
TCAATACGTC TCTTACCCTG 
TAGATGTTTC AAATGGTAAA 
GCAAGTAACG TTTCATTTGG 
CTGGGGTTCT GCTATGAAAC 
ACGGTGTTTA TGATTCCACT 
TATCCGGGAA CAAGCACACC 
TAATATTACT CTGCAATATG 
TTGAGACTTT GAATAAGGTC 
GGTCTTGGTA TCGACTATCC 
TAATACAACA GAATCTAATA 
CTGCTGCTTA TGCTGCCTTT 
TATATCCATA AAGTCGTCTT 
TGTCGGAACT CGTGCCATGA 
TGATGAAAAC AGTCTTGACT 
TGGCTTCCTC AAGCTGGTAA 
AGTTGAAAAC CACATCAAGA 
TTGTTGGTTA TACTCGTAAG 
AATCGTTTAA CTCCTATCGT 
TTATCGCTCA ATGATAACGT 
GGACGATGCC AGACGGACTT 
GGTGCTCGTT CTACGTGGAA 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



CCCAAGCTAA TGATATTCCC 
GAAGACCATC GCTTCTTCGA 
GGGAGCTTTC TTGCGCAATC 
CTCTCACCCA ACAGTTGATT 
GACCAGACTA TTTCTCGTAA 
AGAACAAAAA GCAACCAAAC 
TCTACATGTC TAATGGCAAC 
TATGGTAAAG ACCTCAATAA 
TGGAATGCCT CAGGCACCAA 
CAGCCCAAGA CCGCCGAAAC 
TACATCTCTG CTGAACAGTA 
TGGACTACAA AGTCTCAAAT 
ATTACCTCAA GGAGGTCATC 
CTTCTAACTA, CTGGGATGGA 
AAAACATCTG TGGGATATCT 
ACGATGATTT GCAAGTCGCA 
GTCATCGCCC AACTTGGAGC 
TACCAACCAA GCTGTGGAAA 
CAATCACCGA TTATGCACCT 
GCAACTATGG TTAATGATAT 
TGTCTACAAC TGGGATAGAG 
CTCTTCAACA ATCACGAAAT 
GGTCTAGATA GAGCTAAAAC 
AAGCATGCAT TATGCAAACG 
AACAATACGG AGCAAGTAGT 
GCAAATGGTG GCACTTACTA 
CAGTGATGGA AGTAAAAAAG 
AGGAAACGAC AGCCTATATG 
TATGGAACTG GGCGTGGAGC 
AACAGGAACC TCTAACTATA 
ACACTGGCTA TGTAGCTCCA 
TATTCTATGG CTGTATGGAC 
TGGAGATGGT TTCCTAGTTG 
ATCTATCAGA AGATACTCAT 
TTCAGAAATG GAGAATTCGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



ACAGATTTGG 


250 


CCACAGGGGG 


300 


TGCAAAGTAA 


350 


AAGTTGACTT 


400 


GGCTCAGGAA 


450 


AGGAAATCTT 


500 


TATGGAATGC 


550 


TTTAAGTTTA 


600 


ACCAATATGA 


650 


TTGGTCTTAT 


700 


TGAGAAAGCA 


750 


CAGCAAGTAA 


800 


AATCAAGTAG 


850 


TGTTTACACA 


900 


ACAACTCCGA 


950 


TCTACGGTCG 


1000 


TCGTCACCAA 


1050 


CCAATCGTGA 


1100 


GCCATAGAAT 


1150 


TCCTTATAAC 


1200 


CATATTTCGG 


1250 


GTCACAGCCG 


1300 


CTTCCTTAAT 


1350 


CCATTTCAAG 


1400 


GAAAAAATGG 


1450 


TAAACCAATG 


1500 


AGTTCTCTAA 


1550 


ATGACCGACA 


1600 


CTATCTTCCT 


1650 


CAGATGAGGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTTAAAAAT 


1950 


CCCCATCAAC 


2000 


TCTAGCTCAA 


2050 


CAATAATACG 


2100 


CTCAACCAGC 


2150 




2160 



2) INFORMATION FOR SEQ ID NO: 2050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 8303; 35193 

(C) ACCESSION NUMBER: AF046230 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CTGTGGGATA TCTACAACTC CGATCAATAC 
TTTGCAAGTC GCATCTACGG TCGTAGATGT 
CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 
CAAGCTGTGG AAACCAATCG TGACTGGGGT 
CGATTATGCA CCTGCCATAG AATACGGTGT 
TGGTTAATGA TATTCCTTAT AACTATCCGG 
AACTGGGATA GAGCATATTT CGGTAATATT 
ACAATCACGA AATGTCACAG CCGTTGAGAC 
ATAGAGCTAA AACCTTCCTT AATGGTCTTG 
CATTATGCAA ACGCCATTTC AAGTAATACA 
CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 
GTGGCACTTA CTATAAACCA ATGTATATCC 
GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 
GACAGCCTAT ATGATGACCG ACATGATGAA 
CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 
ACCTCTAACT ATACAGATGA GGAAGTTGAA 
CTATGTAGCT CCAGATGAAA TGTTTGTTGG 
TGGCTGTATG GACAGGTTAT TCGAATCGTT 
GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 



ID NO: 2050 

GTCTCTTACC CTGACGATGA 50 

TTCAAATGGT AAAGTCATCG 100 

ACGTTTCATT TGGTACCAAC 150 

TCTGCTATGA AACCAATCAC 200 

TTATGATTCC ACTGCAACTA 250 

GAACAAGCAC ACCTGTCTAC 300 

ACTCTGCAAT ATGCTCTTCA 350 
TTTGAATAAG GTCGGTCTAG - 400 

GTATCGACTA TCCAAGCATG 450 

ACAGAATCTA ATAAACAATA 500 

TTATGCTGCC TTTGCAAATG 550 

ATAAAGTCGT CTTCAGTGAT 600 

ACTCGTGCCA TGAAGGAAAC 650 

AACAGTCTTG ACTTATGGAA 700 

CTCAAGCTGG TAAAACAGGA 750 

AACCACATCA AGAACACTGG 800 

TTATACTCGT AAGTATTCTA 850 

TAACTCCTAT CGTTGGAGAT 900 

930 



2) INFORMATION FOR SEQ ID NO: 2051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 63509; Mil 

(C) ACCESSION NUMBER: AF04 6238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2051 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGATGATGA 50 

CATGCAAGTA GCTTCGACTA TTGTAGATGT ATCTAATGGT AACGTTATTG 100 

CACAACTTGG TGCTCGTCAT CAAGCAAGTA ATGTTTCATT CGGCACCAAC 150 

CAGGCTGTGG AGACCAATCG TGACTGGGGT TCTTCTATGA AACCAATCAC 200 

TGACTATGCT CCCGCTTTAG AATATGGAGT CTATGACTCT ACTGCTTCTA 250 

TTGTACATGA TGTTCCTTAT AACTATCCTG GCACTGATAC TCCAGTCTAC 300 

AACTGGGATC ATGTCTACTT TGGAAACATT ACAATCCAGT ATGCTCTTCA 350 

ACAATCACGA AATGTCACAG CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 400 

ATAGAGCTAA AACCTTCCTT AATGGTCTTG GTATCGACTA TCCAAGCATG 450 

CATTATGCAA ACGCCATTTC AAGTAACACA ACTGAATCCA ACAAACAGTA 500 

CGGTGCAAGT AGTGAAAAGA TGGCTGCTGC TTATGCCGCC TTTTCTAATG 550 

GTGGTATTTA CCACAAACCA ATGTATATCA ATAAAATCGT CTTCAGTGAT 600 

GGTAGTGAAA AAGAATTTTC TGATGCTGGT ACACGAGCTA TGAAAGAAAC 650 

TACTGCCTAT ATGATGACCG AAATGATGAA AACTGTTTTA GCTTACGGAA 700 

CCGGACGTGG TGCTTACCTA CCATGGCTTC CACAAGCAGG TAAGACAGGT 750 
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ACTTCTAACT ATACTGACGA CGAAATTGAA 
CTACGTAGCT CCAGATGAAA TGTTTGTTGG 
TGGCTGTATG GACTGGTTAC TCAAATCGTT 
GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 



AAGTATATCA AGAACACTGG 800 
TTATACTCGT AAGTATTCTA 850 
TAACTCCAAT CGTAGGAGAT 900 

930 



2) INFORMATION FOR SEQ ID NO: 2052 

- (i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1195 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: #22/HA5 

(C) ACCESSION NUMBER: AB006877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2052 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 50 

TTCCAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 100 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 150 

TCAACAATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 200 

ATATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 250 

GAACAAGCAC ACCTGTCTAC AACTGGGATC GAGCATATTT TGGTAATATT 300 

ACTCTGCAAT ATGCCCTTCA ACAATCACGT AATGTTACAG CAGTTGAAAC 350 

ATTAAACAAG GTTGGTTTAG ATCGAGCCAA AACTTTCCTA AATGGCTTAG 400 

GTATCGATTA CCCAAGCATA CACTATGCTA ACGCCATTTC AAGTAATACA 450 

ACAGAATCTA ACAAACATTA CGGTGCAAGT AGTGAAAAAA TGGCTGCTGC 500 

TTATGCCGCC TTTGCTAATG GTGGTATTTA CCACAAACCA ATGTATATCA 550 

ATAAAATCGT CTTTAGTGAT GGTAGTGAGA AAGAATTTTC TGATGCTGGT 600 

ACACGAGCTA TGAAAGAAAC TACTGCCTAT ATGATGACCG AAATGATGAA 650 

AACTGTTTTA GCTTACGGAA CCGGACGTGG TGCTTACCTA CCATGGCTTC 700 

CACAAGCAGG TAAGACAGGT ACTTCTAACT ATACTGATGA CGAAATCGAA 750 

AAACATATCA AGAATACTGG CTACGTAGCT CCAGATGAAA TGTTTGTAGG 800 

CTATACTCGT AAGTATTCTA TGGCTGTATG GACTGGTTAC TCAAATCGTT 850 

TAACTCCAAT TGTTGGAGAT GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 900 

TCAATGATAT CGTATCTATC AGAAGATGAC CAACCTGGAG ATTGGACGAT 950 

GCCAGACGGC CTGTTCCGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1000 

GTTCTACGTG GAACTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1050 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1100 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1150 

CAAATACAAC CCCTGATCAA CAAAATCAGA ATCCTCAACC AGCAC 1195 



2) INFORMATION FOR SEQ ID NO: 2053 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 17619 

(C) ACCESSION NUMBER: AF046237 

- (xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 2053 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 50 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 100 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 150 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 200 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 250 

TCGTTCACGA TGAGCCCTAT, AACTACCCTG GGACAAATAC TCCTGTTTAT 300 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 350 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 400 

ACCGCGCCAA GACTTTCCTA AATGGTCTCG GAATCGACTA CCCAAGTATT 450 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 500 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 550 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 600 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 650 

GACAGCCTAT ATGATGACCG AAATGATGAA AACAGTCTTG AGTTATGGAA 700 

CTGGACGAAA TGCCTATCTT GCTTGGCTTC CTCAAGCTGG TAAAACAGGT 750 

ACCTCTAACT ATACAGATGA AGAAATTGAA AAACACATCA AAAACACTGG 800 

CTATGTAGCT CCAGATGAAA CGTTTGTTGG TTATACTCGT AAGTATTCTA 850 

TGGCTGTATG GACAGGTTAC ACAAACCGTC TCACACCAAT TGTGGGGGAT 900 

GGCTTCACAG TTGCTGCCAA AGTTTACCGC 930 



2) INFORMATION FOR SEQ ID NO: 2054 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: Y07845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2054 

CGTCGCATTC TCTACGGAAT GAATGAATTG GGTGTGACCC CAGACAAACC 50 

CCATAAAAAA TCTGCTCGTA TTACAGGGGA TGTCATGGGT AAATACCACC 100 

CACACGGGGA TTCCTCTATT TATGAAGCCA TGGTCCGTAT GGCTCAATGG 150 

TGGAGCTACC GTTACATGCT TGTAGATGGT CATGGGAATT TTGGTTCCAT 200 

GGATGGAGAT AGTGCTGCCG CTCAACGTTA TACCGAGGCA CGTATGAGCA 250 

AGATTGCTCT GGAAATGCTT CGTGATATCA ACAAAAATAC AGTTGATTTC 300 

GTTGAT 306 
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2) INFORMATION FOR SEQ ID NO: 2055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2472 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
(Q) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 7785 

(C) ACCESSION NUMBER: Z67739 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2055 

ATGTCTAACA TTCAAAACAT GTCCCTGGAG GACATCATGG GAGAGCGCTT 50 

TGGTCGCTAC TCCAAGTACA TTATTCAAGA CCGGGCTTTG CCAGATATTC 100 

GTGATGGGTT GAAGCCGGTT CAGCGCCGTA TTCTTTATTC TATGAATAAG 150 

GATAGCAATA CTTTTGACAA GAGCTACCGT AAGTCGGCCA AGTCAGTCGG 2 00 

GAACATCATG GGGAATTTCC ACCCACACGG GGATTCTTCT ATCTATGATG 250 

CCATGGTTCG TATGTCACAG AACTGGAAAA ATCGTGAGAT TCTAGTTGAA 3 00 

ATGCACGGTA ATAACGGTTC TATGGACGGA GATCCTCCTG CGGCTATGCG 350 

TTATACTGAG GCACGTTTGT CTGAAATTGC AGGCTACCTT CTTCAGGATA 4 00 

TCGAGAAAAA GACAGTTCCT TTTGCATGGA ACTTTGACGA TACGGAGAAA 4 50 

GAACCAACGG TCTTGCCAGC AGCCTTTCCA AACCTCTTGG TCAATGGTTC 500 

GACTGGGATT TCGGCTGGTT ATGCCACAGA CATTCCTCCC CATAATTTAG 550 

CTGAGGTCAT AGATGCTGCA GTTTACATGA TTGACCACCC AACTGCAAAG 600 

ATTGATAAAC TCATGGAATT CTTACCTGGA CCAGACTTCC CTACAGGGGC 650 

TATTATTCAG GGTCGTGATG AAATCAAGAA AGCTTATGAG ACTGGGAAAG 700 

GGCGCGTGGT TGTTCGTTCC AAGACTGAAA TTGAAAAGCT AAAAGGTGGT 750 

AAGGAACAAA TCGTTATTAC TGAGATTCCT TATGAAATCA ATAAGGCCAA 800 

TCTAGTCAAG AAAATCGATG ATGTTCGTGT TAATAACAAG GTAGCTGGGA 8 50 

TTGCTGAGGT TCGTGATGAG TCTGACCGTG ATGGTCTTCG TATCGCTATC 9 00 

GAACTTAAGA AAGACGCTAA TACTGAGCTT GTTCTCAACT ACTTATTTAA 9 50 

GTACACCGAC CTACAAATCA ACTACAACTT TAATATGGTG GCGATTGACA 1000 

ATTTCACACC TCGTCAGGTT GGGATTGTTC CAATCCTGTC TAGCTACATC 1050 

GCTCACCGTC GAGAAGTGAT TTTGGCGCGT TCACGCTTTG ACAAAGAAAA 1100 

GGCTGAGAAA CGTCTCCATA TCGTCGAAGG TTTGATTCGT GTGATTTCGA 1150 

TTTTGGATGA AGTCATTGCT CTTATCCGTG CTTCTGAGAA TAAGGCGGAC 1200 

GCCAAGGAAA ACCTCAAAGT TAGCTATGAT TTTACGGAAG AACAGGCTGA 1250 

GGCTATCGTA ACTTTGCAAC TGTACCGTTT GACCAATACC GATGTGGTTG 13 00 

TCTTGCAGGA AGAAGAAGCA GAGCTTCGTG AGAAGATTGC TATGCTGGCG 13 50 

GCTATTATCG GTGATGAAAG GACTATGTAC AATCTCATGA AGAAAGAACT 14 00 

TCGTGAGGTC AAGAAGAAAT TTGCAACTCC TCGTTTGAGT TCTTTAGAAG 14 50 

ACACTGCGAA AGCAATTGAG ATTGATACAG CTAGTCTTAT CGCTGAGGAA 1500 

GATACCTACG TCAGCGTGAC CAAGGCAGGT TACATCAAGC GTACCAGTCC 1550 

ACGTTCCTTT GCGGCTTCCA CCTTGGAAGA AATTGGCAAG CGTGATGATG 1600 

ACCGTTTGAT TTTTGTTCAA TCTGCCAAGA CAACCCAGCA CCTCTTGATG 1650 

TTCACAAGTC TTGGAAATGT CATCTACAGA CCAATCCATG AGTTGGCAGA 1700 

TATTCGTTGG AAGGACATCG GAGAGCATCT GAGCCAAACC ATCACAAACT 175 0 

TTGAAACGAA TGAAGAAATC CTTTATGTGG AAGTACTGGA TCAGTTTGAC 1800 

GATGCGACAA CCTACTTTGC AGTGACTCGC CTTGGTCAAA TCAAACGGGT 1850 
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AGAGCGAAAA GAATTCACTC CATGGCGGAC CTATAGATCT AAGTCTGTCA 1900 

AGTATGCTAA GCTCAAAGAC GATACAGATC AGATTGTAGC AGTGGCTCCG 1950 

ATTAAACTAG ATGATGTTGT CTTGGTTAGT CAAAATGGTT ATGCCCTGCG 2000 

TTTCAATATC GAAGAGGTTC CGGTTGTCGG TGCTAAGGCA GCAGGTGTCA 2050 

AGGCTATGAA TTTGAAAGAA GATGATGTCC TCCAATCTGG CTTTATCTGT 2100 

AATACTTCGT CCTTCTACCT CTTGACCCAG CGTGGAAGCT TGAAACGTGT 2150 

TTCTATTGAG GAAATTCTAG CAACCAGCCG TGCCAAACGA GGATTACAAG 2200 

TCTTGCGTGA GTTGAAAAAC AAACCGCATC GTGTCTTCTT GGCAGGAGCA 2250 

GTTGCAGAGC AAGGATTTGT TGGCGATTTC TTCAGTACGG AAGTGGATGT 23 00 

. GAACGACCAA ACTCTGCTTG TCCAATCCAA TAAAGGAACA ATCTATGAAA 2350 

GCCGATTGCA AGACTTGAAC TTGTCAGAAC "GCACTAGCAA TGGAAGCTTC 2400 

ATTTCTGACA CGATTTCAGA TGAAGAAGTT TTTGACGCTT ATCTTCAGGA 2450 

AGTAGTTACT GAAGATAAAT AA 2472 



2) INFORMATION FOR SEQ ID NO: 2056 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2056 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATTGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGCAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCAACA 200 

ATGAAACCAA TCACAGACTA TGCTCCTGCC TTGGAATACG GTGTCTACGA 250 

TTCAACTGCT ACTATCGTTC ACGATGAGCC CTATAACTAT CCTGGGACAG 300 

ATACCCCTGT CTATAACTGG GATAGGGGCT ACTTTGGCAA TATCACCTTG 350 

CAATACGCCC TGCAACAATC GCGAAACGTC CCAGCCGTGG AAACACTAAA 400 

CAAGGTCGGG CTCAACCGCG CCAAGACTTT TCTAAATGGT CTCGGAATCG 450 

ACTACCCAAG TATTCACTAC TCAAATGCCA TTTCAAGTAA CACAACCGAG 500 

TCAGACAAAA AATATGGAGC AAGTAGTGAA AAGATGGCTG CTGCTTACGC 550 

TGCCTTTGCA AATGGTGGAA CTTACTATAA ACCAATGTAT ATCCATAAAG 6 00 

TCGTCTTTAG TGATGGAAGT GAAAAAGAGT TCTCTAATGT CGGAACTCGC 650 

GCCATGAAAG AAACGACTGC TTACATGATG ACAGAAATGA TGAAAACAGT 700 

CTTGACGTAC GGAATTGGTC GTGGTGCCTA CCTGCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGTACTTCT AACTATACTG ACGAAGAAAT TGAAAAGTAT 800 

ATCAAGAACA CTGGTTACGT AACTCCAGAT GAAATGTTTG TAGGGTATAC 850 

CCGTAAATAT GCAATGGCTG TATGGACAGG CTATTCTAAC CGTCTGACAC 900 

CACTTGTAGG CGATGGCCTT ACGGTCGCTG CCAAGGTTTA CCGCTCTATG 95 0 

ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 
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2) INFORMATION FOR SEQ ID NO: 2057 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
.(D). TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2057 

ATGTAGACCA AGAAGCTCAA AAACATCTGT GGGATATTTA CAATACAGAC 50 

GAATACGTTG CCTATCCAGA CGATGAATTG CAAGTCGCTT CTACCATTGT 100 

TGATGTTTCT AACGGGAAAG TTATTGCTCA GTTAGGTTCT CGTCACCAAT 150 

CAAGCAATGT TTCCTTCGGA ATCAACCAAG CTGTTGAAAC CAACCGTGAC 200 

TGGGGTTCTG CCATGAAGCC AATCACAGAC TATGCTCCTG CCTTAGAGTA 250 

TGACATCTAC GACTCAACTG CTTCGATTGT ACATGATGTT CCTTATAACT 300 

ATCCAGGTAC TGATACTCCC CTCTACAACT GGGATAAAGT CTACTTTGGA 350 

AATATTACAA TCCAGTATGC ACTTCAACAG TCACGTAATG TCACAGCCGT 4 00 

TGAGACTTTG AATAAGGTCG GTCTAGATAG AGCTAAAACC TTTCTTAATG 450 

GTCTTGGTAT CGACTATCCA AGCATGCATT ATGCAAACGC CATTTCAAGT 500 

AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG AGAAAATGGC 550 

TGCTGCTTAC GCTGCTTTTG CTAACGGTGG TATCTACCAT AAACCAATGT 600 

ATATCAACAA AATCGTCTTT AGCGATGGTA GCTCAAAAGA ATACGCTGAT 650 

CCTGGTACTC GTGCCATGAA AGAGACGACC GCCTATATGA TGACAGAAAT 700 

GATGAAGACT GTCTTGGCAT ACGGAACGGG TCGTGGTGCT TATCTCCCTT 750 

GGCTACCTCA AGCTGGTAAG ACTGGTACAT CAAACTATAC AGATGATGAA 800 

ATTGAAAACT ACATCAAAAA TACTGGTTAT GTAGCTCCAG ACGAAATGTT 850 

TGTTGGTTAT ACTCGCAAAT ATTCAATGGC GGTATGGACA GGTTACTCAA 900 

ACCGCCTGAC TCCTATCGTT GGTGATGGCT TCTATGTTGC AGCTAAGGTT 950 

TACCGTTCAA TGATGACTTA TCTGTCTGAG GATAACAACC CTGGCGACTG 1000 

GACTATGCCA GAAGGTCTCT ATCGAAGTGG TGAGTTCGTC TTTAAAAAAG 1050 

GTGCTCGTTC TGCATGGACT GCTCCTGCTC CGCAACAGGC CCCAACACCA 1100 

GAAAGTTCGA GCTCGACATC AGAAAGTTCA ACTTCACAGT CAAGCTCAAC 1150 

TACTCCAAGC ACGAATAATA GTGCAAACAA TAATACCAAT AACCAGCAAC 1200 

CAAATACAAC GCCTGGTCAA CAAAACCAGA ACCAAAATCA GA 1242 



2) INFORMATION FOR SEQ ID NO: 2058 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2058 

GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG AATACGTTGC 50 

CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT GATGTTTCTA 100 

ACGGGAAAGT TATTGCTCAG TTAGGTTCTC GTCACCAATC AAGCAATGTT 150 

TCGTTCGGAA TCAACCAAGC TGTTGAAACC AACCGTGACT GGGGTTCTGC 200 

CATGAAGCCA ATCACAGACT ATGCTCCTGC CTTAGAGTAT GACATCTACG 250 " 

ACTCAACTGC TTCGATTGTA CATGATGTTC CTTATAACTA TCCAGGTACT 300 

GATACTCCCC TCTACAACTG GGATAAAGTC TACTTTGGAA ATATTACAAT 350 

CCAGTATGCA CTTCAACAGT CACGTAATGT CACAGCCGTT GAGACTTTGA 400 

ATAAGGTCGG TCTAGATAGA GCTAAAACCT TTCTTAATGG TCTTGGTATC 450 

GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA ATACGACTGA 500 

GTCAAACAAA AAGTACGGAG CAAGTAGTGA GAAAATGGCT GCTGCTTACG 550 

CTGCTTTTGC TAACGGTGGT ATCTACCATA AACCAATGTA TATCAACAAA 600 

ATCGTCTTTA GCGATGGTAG CTCAAAAGAA TACGCTGATC CTGGTACTCG 650 

TGCCATGAAA GAGACGACCG CCTATATGAT GACAGAAATG ATGAAGACTG 700 

TCTTGGCATA CGGAACGGGT CGTGGTGCTT ATCTCCCTTG GCTACCTCAA 750 

GCTGGTAAGA CTGGTACATC AAACTATACA GATGATGAAA TTGAAAACTA 800 

CATCAAAAAT ACTGGTTATG TAGCTCCAGA CGAAATGTTT GTTGGTTATA 850 

CTCGCAAATA TTCAATGGCG GTATGGACAG GTTACTCAAA CCGCCTGACT 900 

CCTATCGTTG GTGATGGCTT CTATGTTGCA GCTAAGGTTT ACCGTTCAAT 950 

GATGACTTAT CTGTCTGAGG ATAACAACCC TGGCGACTGG ACTATGCCAG 1000 

AAGGTCTCTA TCGAAGTGGT GAGTTCGTCT TTAAAAAAGG TGCTCGTTCT 1050 

GCATGGACTG CTCCTGCTCC GCAACAGGCC CCAACACCAG AAAGTTCGAG 1100 

CTCGACATCA GAAAGTTCAA CTTCACAGTC AAGCTCAACT ACTCCAAGCA 1150 

CGAATAATAG TGCAAACAAT AATACCAATA ACCAGCAACC AAATACAACG 1200 

CCTGGTCAAC AAAACCAGAA CCAAA 1225 



2) INFORMATION FOR SEQ ID NO: 2059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2059 

TGGCTACTTT GGAAACATTA CAGTCCAGTA TGCTCTTCAA CAATCACGAA 50 

ATGTCACAGC CGTTGAGACT TTGAATAAGG TCGGTCTAGA TAGAGCTAAA 100 

ACCTTCCTCA ATGGACTTGG TATCGATTAT CCAAGCATGC TTTATGCAAA 150 

CGCCATTTCA AGTAACACAA CTGAATCCAA CAAAAAGTAC GGAGCAAGTA 200 

GTGAAAAAAT GGCCGCTGCC TACGCAGCTT TTGCTAATGG TGGTACTTAC 250 

CACAAACCAA TGTATATCAA TAAAATCGTC TTTAGTGATG GTAGTGAAAA 3 00 

AGAATTTTCT GATGCCGGTA CTCGGGCTAT GAAAGAAACT ACTGCCTATA 350 
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TGATGACCGA AATGATGAAA ACTGTCTTAT 
GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 
TACTGACGAA GAAATTGAAA AGTATATCAA 
CAGATGAAAT GTTTGTTGGT TATACCCGCA 
ACAG 



TATACGGAAC CGGACGTGGA 400 
AAGACAGGTA CTTCTAACTA 450 
GAATGCTGGT TACGTAGCTC . 500 
AATATGCAAT GGCTGTTTGG 550 

554 



2) INFORMATION FOR SEQ ID NO: 2060 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2060 

CAACTGGGAT GGATGTCTAC ACAAATGTAG ACCAAGAAGC TCAAAAACAT 50 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 100 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 150 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 200 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 250 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 300 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 350 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 400 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 450 

ACCGCGCCAA GACTTTCCTA AATGGTCTAG GAATCGACTA CCCAAGTATT 500 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 550 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 600 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 650 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 700 

GACAGCCTAT ATGATGACCG ACATGATGAA AACAGTCTTG ACTTATGGAA 750 

CTGGACGAAA TGCCTATCTT GCTTGGCTCC CTCAGGCTGG TAAAACAGGA 800 

ACCTCTAACT ATACAGACGA GGAAATTGAA AACCACATCA AGACCTCTCA 850 

ATTTGTAGCA CCTGATGAAC TATTTGCTGG CTATACGCGT AAATATTCAA 900 

TGGCTGTATG GACAGGCTAT TCTAACCGTC TGACACCACT TGTAGGCAAT 950 

GGCCTTACGG TCGCTGCCAA AGTTTACCGC TCTATGATGA CCTACCTGTC 1000 

TGAAGGAAGC AATCCAGAAG ATTGGAATAT ACCAGAGGGG CTCTACAGAA 1050 

ATGGAGAATT CGTATTTAAA AATGGTGCTC GTTCTACGTG GAGCTCACCT 1100 

GCTCCACAAC AACCCCCATC AACTGAAAGT TCAAGCTCAT CATCAGATAG 1150 

TTCAACTTCA CAGTCTAGCT CAACCACTCC AAGCACAAAT AATAGTACGA 1200 

CTACCAATCC TAACAATAAT ACGCAACAAT CAAATACAAC CCCTGATCA 1249 



2) INFORMATION FOR SEQ ID NO: 2061 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 9 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2061 

GAAGTTGAAC AGACTGGTCA CCAAGCTCCT TCATATCCGC CAACACTGCA 50 

ATTTTCTTGC CACCTTCATT GGCTGGAATG GCAGAGAAAG TCTCTAAAAT 100 

CAGTTTCATA GCAGTTGGAT TGGCATTATA AACATCTGAC AGGATATCTG 150 

CTCCATTGGC TGCTTTCTTC CACTCGGTAC GGTTACGCGT CAATTCAAGA 2 00 

TGTTGGAAGG CCAAACGAAT TTGCTCCTCT GAAACTCCTT CTTGCAAGGC 2 50 

AACATAGGAT GCAATCATAG, CATTTGTCGC ATTGTACTTG CCAGTTACTG 3 00 

GCAAATCAAG GGCTTGCTCT AAGAAATTGA CCTTGAAGGT CAGACTATCT 350 

TTGCGCTCAA CCAAGTCGGT AATTCCCAGC TCTGCTCCTT GACCAAAACG 4 00 

AACCACCTTT TTATCAGTTG GCAAATAGTC CTCTACGATA GGGTCAGCCG 4 50 

GCGCTAAAAG CAAGGAACCT GAAGCCATTC CGTCTGCAAT TTGCATTTTT 500 

CCTTTAGCAA TCTCAGAACG GTCTTTGAAA AAGGCCAAAT GAGCTTCTCC 550 

AACCAAGGTC ACGATGGCTG TATGGACAG 579 



2) INFORMATION FOR SEQ ID NO: 2062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2062 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA AGTAATGTTT 150 

CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG GGGATCATCA 200 

ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG GAGTCTATGA 250 

CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT CCTGGCACTG 300 

ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA CATTACAATC 350 

CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA CACAACTGAA 500 

TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG CTGCCTACGC 550 

TGCTTTTGCT AATGGTGGTA TTTATCACAA ACCAATGTAT ATCAATAAAA 600 

TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC TGGTACACGA 650 

GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA TGAAAACTGT 700 

TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG CTTCCACAAG 75 0 
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CAGGTAAGAC AGGTACTTCT AACTATACTG 
ATCAAGAACA CTGGTTACGT AGCTCCAGAT 
CCGTAAATAT GCAATGGCTG TTTGGACAGG 
CAATCATCGG AGATGGTTTC CTTGTTGCTG 
ATAACTTACC TTTCTGAAGA TGACCAACCT 
TGGCTTGTAT AGAAATGGAG AATTCGTATT 
CGTGGAGCTC ACCTGCTCCA CAACAACCCC 
TCATCATCAG ATAGTTCAAC TTCACAGTCT 
AAATAATAGT ACGACTACCA ATCCTAACAA 
CAACCCCTGA TCAACA 



ACGAAGAAAT TGAAAAGTAT 800 

GAAATGTTTG TAGGGTATAC 850 

ATACTCAAAT CGTCTAACTC 900 

GTAAAGTCTA TCGTTCAATG 950 

GGAGATTGGA CAATGCCAGA 1000 

TAAAAATGGT GCTCGTTCTA 1050 

CATCAACTGA AAGTTCAAGC 1100 

AGCTCAACCA CTCCAAGCAC 1150 

TAATACGCAA CAATCAAATA 1200 

1216 



2) INFORMATION FOR SEQ ID NO: 2063 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8,10 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2063 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC CACGGTCGTA 50 

GATGTTTCAA ATGGTAAAGT CATCGCCCAA CTTGGAGCTC GTCACCAAGC 100 

AAGTAACGTT TCATTTGGTA CCAACCAAGC TGTGGAAACC AATCGTGACT 150 

GGGGTTCAAC AATGAAACCC ATCACCGATT ATGCACCTGC CATAGAATAC 200 

GGTGTATATG ATTCCACTGC AACTATGGTT AATGATATTC CTTATAACTA 250 

TCCGGGAACA AGCACACCTG TCTACAACTG GGATCGAGCA TATTTTGGTA 3 00 

ATATTACTCT GCAATATGCC CTTCAACAAT CTCGTAACGT ACCCGCCGTT 350 

GAGACACTAA ACAAGGTTGG TTTAGATAGA GCTAAAAGTT TCCTAAATGG 4 00 

TTTAGGAATC GACTATCCTG TAATGCACTA TTCAAATGCT ATTTCAAGTA 450 

ATACTACCGA ATCTAGTAAA CAGTACGGGG CAAGTAGTGA AAAAATGGCC 500 

ACTGCCTATG CCGCATTCGC AAACGGCGGT ATTTACCACA AACCAATGTA 550 

CATCAATAAG GTTGTCTTTA GCGATGGTAG CGAAAAAGAA TTTTCTGACC 600 

CTGGCACAAG AGCCATGAAA GAAACGACTG CTTACATGAT GACAGAGATG 650 

ATGAAAACAG TCTGGACTTA CGGAACTGGT CGCGGTGCCT ACCTACCTTG 700 

GCTTCCACAA GCAGGTAAAA CAGGTACTTC TAACTATACT GACGAAGAAA 750 

TTGAAAAGTA TATCAAGAAC ACTGGTTACG TAGCTCCAGA TGAAATGTTT 800 

GTAGGGTATA 810 



2) INFORMATION FOR SEQ ID NO: 2064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700678 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2064 

TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG TAGATGTTTC 50 

AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA GCAAGTAACG 100 

TTTCATTTGG" TACCAACCAAGCTGTGGAAA CCAATCGTGA CTGGGGATCA 150 

ACTATGAAAC CAATCACAGA CTATGCTCCT GCCTTGGAGT ACGGTGTCTA 200 

CGATTCAACT GCTACTATCG TTCACGATGA GCCCTATAAC TACCCTGGGA 250 

CAGATATCCC TCTCTATAAC TGGGATCGAG CATATTTCGG TAATATTACT 300 

CTGCAATATG CCCTTCAACA ATCTCGTAAC GTACCTGCCG TTGAAACACT 350 

AAACAAGGTC GGTCTAGATA AGGCTAAAAC CTTCCTTAAT GGTCTTGGTA 400 

TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG TAATACAACT 450 

GAATCCAACA AAAAATATGG TGCAAGTAGT GAAAAAATAG CTACCGCCTA 500 

TGCCGCATTC GCAAATGGTG GTATTTACCA CAAACCAATG TACATCAATA 550 

AAGTTGTCTT TAGCGATGGT AGCGAAAAAG AATTTTCTGA CCCTGGCACA 600 

AGAGCCATGA AAGAAACGAC TGCTTACATG ATGACAGAAA TGATGAAAAC 650 

AGTCTGGACG TACGGAACTG GTCGTGGTGC CTACCTGCCT TGGCTTCCTC 700 

AAGCTGGTAA AACAGGTACC TCTAACTATA CTGACGAAGA AATTGAAAAG 750 

TATATCAAGA ACACTGGTTA CGTAGCTCCA GA 782 



2) INFORMATION FOR SEQ ID NO: 2065 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2065 
CCAGGACGTG GAGGCGATCA CA 22 



2) INFORMATION FOR SEQ ID NO: 2066 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2066 
CACCGACAGC GAGCCGATCA GA 22 
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2) INFORMATION FOR SEQ ID NO: 2067 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear , 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2067 
AGCTGAGCCA ATTCATGG 



2) INFORMATION FOR SEQ ID NO: 2068 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 
ATTCATGGAC CAGAACAAC 



2) INFORMATION FOR SEQ ID NO: 2069 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2069 
CGCTGTCGGG GTTGACCC 



2) INFORMATION FOR SEQ ID NO: 2070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2070 
GTTGACCCAC AAGCGCCG 



2) INFORMATION FOR SEQ ID NO: 2071 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: ■ Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2071 
CGACTGTCGG CGCTGGGG 



2) INFORMATION FOR SEQ ID NO: 2072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Rv 

(C) ACCESSION NUMBER: L27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2072 

GTGCTGGAAG GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC 50 

TAGTCCTAGT CCGAGTCGCC CGCAAAGTTC CTCGAATAAC TCCGTACCCG 100 

GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC TGCGCGAACC ACTTGAGGTT 150 

CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC TGATCGGTTC 200 

GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 250 

GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 300 

GGGTCGATGT CGTTGTCGTT CTCTGACCCT CGTTTCGACG ATGTCAAGGC 350 

ACCCGTCGAC GAGTGCAAAG ACAAGGACAT GACGTACGCG GCTCCACTGT 400 

TCGTCACCGC CGAGTTCATC AACAACAACA CCGGTGAGAT CAAGAGTCAG 450 

ACGGTGTTCA TGGGTGACTT CCCGATGATG ACCGAGAAGG GCACGTTCAT 500 

CATCAACGGG ACCGAGCGTG TGGTGGTCAG CCAGCTGGTG CGGTCGCCCG 550 

GGGTGTACTT CGACGAGACC ATTGACAAGT CCACCGACAA GACGCTGCAC 600 

AGCGTCAAGG TGATCCCGAG CCGCGGCGCG TGGCTCGAGT TTGACGTCGA 650 

CAAGCGCGAC ACCGTCGGCG TGCGCATCGA CCGCAAACGC CGGCAACCGG 700 
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TCACCGTGCT GCTCAAGGCG CTGGGCTGGA CCAGCGAGCA GATTGTCGAG 750 

CGGTTCGGGT TCTCCGAGAT CATGCGATCG ACGCTGGAGA AGGACAACAC 800 

CGTCGGCACC GACGAGGCGC TGTTGGACAT CTACCGCAAG CTGCGTCCGG 850 

GCGAGCCCCC GACCAAAGAG TCAGCGCAGA CGCTGTTGGA AAACTTGTTC 900 

TTCAAGGAGA AGCGCTACGA CCTGGCCCGC GTCGGTCGCT ATAAGGTCAA 950 

CAAGAAGCTC GGGCTGCATG TCGGCGAGCC CATCACGTCG TCGACGCTGA 1000 

CCGAAGAAGA CGTCGTGGCC ACCATCGAAT ATCTGGTCCG CTTGCACGAG 1050 

GGTCAGACCA CGATGACCGT TCCGGGCGGC GTCGAGGTGC CGGTGGAAAC 1100 

CGACGACATC GACCACTTCG GCAACCGCCG CCTGCGTACG GTCGGCGAGC 1150 

TGATCCAAAA CCAGATCCGG GTCGGCATGT CGCGGATGGA GCGGGTGGTC .12 00. 

cgggagcgga"tga"ccaccca"ggacgtggag GCGATCACAC CGCAGACGTT 1250 

GATCAACATC CGGCCGGTGG TCGCCGCGAT CAAGGAGTTC TTCGGCACCA 1300 

GCCAGCTGAG CCAATTCATG GACCAGAACA ACCCGCTGTC GGGGTTGACC 1350 

CACAAGCGCC GACTGTCGGC GCTGGGGCCC GGCGGTCTGT CACGTGAGCG 14 00 

TGCCGGGCTG GAGGTCCGCG ACGTGCACCC GTCGCACTAC GGCCGGATGT 14 50 

GCCCGATCGA AACCCCTGAG GGGCCCAACA TCGGTCTGAT CGGCTCGCTG 1500 

TCGGTGTACG CGCGGGTCAA. CCCGTTCGGG TTCATCGAAA CGCCGTACCG 1550 

CAAGGTGGTC GACGGCGTGG TTAGCGACGA GATCGTGTAC CTGACCGCCG 1600 

ACGAGGAGGA CCGCCACGTG GTGGCACAGG CCAATTCGCC GATCGATGCG 1650 

GACGGTCGCT TCGTCGAGCC GCGCGTGCTG GTCCGCCGCA AGGCGGGCGA 1700 

GGTGGAGTAC GTGCCCTCGT CTGAGGTGGA CTACATGGAC GTCTCGCCCC 1750 

GCCAGATGGT GTCGGTGGCC ACCGCGATGA TTCCCTTCCT GGAGCACGAC 180 0 

GACGCCAACC GTGCCCTCAT GGGGGCAAAC ATGCAGCGCC AGGCGGTGCC 1850 

GCTGGTCCGT AGCGAGGCCC CGCTGGTGGG CACCGGGATG GAGCTGCGCG 1900 

CGGCGATCGA CGCGGCGACG TCGTCGTCGC AAGAAAGCGG CGTCATCGAG 1950 

GAGGTGTCGG CCGACTACAT CACTGTGATG CACGACAACG GCACCCGGCG 2000 

TACCTACCGG ATGCGCAAGT TTGCCCGGTC CAACCACGGC ACTTGCGCCA 2 050 

ACCAGTGCCC CATCGTGGAC GCGGGCGACC GAGTCGAGGC CGGTCAGGTG 2100 

ATCGCCGACG GTCCCTGTAC TGACGACGGC GAGATGGCGC TGGGCAAGAA 2150 

CCTGCTGGTG GCCATCATGC CGTGGGAGGG CCACAACTAC GAGGACGCGA 2200 

TCATCCTGTC CAACCGCCTG GTCGAAGAGG ACGTGCTCAC CTCGATCCAC 2250 

ATCGAGGAGC ATGAGATCGA TGCTCGCGAC ACCAAGCTGG GTGCGGAGGA 23 0 0 

GATCACCCGC GACATCCCGA ACATCTCCGA CGAGGTGCTC GCCGACCTGG 2350 

ATGAGCGGGG CATCGTGCGC ATCGGTGCCG AGGTTCGCGA CGGGGACATC 24 0 0 

CTGGTCGGCA AGGTCACCCC GAAGGGTGAG ACCGAGCTGA CGCCGGAGGA 24 50 

GCGGCTGCTG CGTGCCATCT TCGGTGAGAA GGCCCGCGAG GTGCGCGACA 2500 

CTTCGCTGAA GGTGCCGCAC GGCGAATCCG GCAAGGTGAT CGGCATTCGG 2550 

GTGTTTTCCC GCGAGGACGA GGACGAGTTG CCGGCCGGTG TCAACGAGCT 26 00 

GGTGCGTGTG TATGTGGCTC AGAAACGCAA GATCTCCGAC GGTGACAAGC 2650 

TGGCCGGCCG GCACGGCAAC AAGGGCGTGA TCGGCAAGAT CCTGCCGGTT 2700 

GAGGACATGC CGTTCCTTGC CGACGGCACC CCGGTGGACA TTATTTTGAA 2750 

CACCCACGGC GTGCCGCGAC GGATGAACAT CGGCCAGATT TTGGAGACCC 2800 

ACCTGGGTTG GTGTGCCCAC AGCGGCTGGA AGGTCGACGC CGCCAAGGGG 2850 

GTTCCGGACT GGGCCGCCAG GCTGCCCGAC GAACTGCTCG AGGCGCATGC 2900 

GAACGCCATT GTGTCGACGC CGGTGTTCGA CGGCGCCCAG GAGGCCGAGC 2950 

TGCAGGGCCT GTTGTCGTGC ACGCTGCCCA ACCGCGACGG TGACGTGCTG 3000 

GTCGACGCCG ACGGCAAGGC CATGCTCTTC GACGGGCGCA GCGGCGAGCC 3 050 

GTTCCCGTAC CCGGTCACGG TTGGCTACAT GTACATCATG AAGCTGCACC 3100 

ACCTGGTGGA CGACAAGATC CACGCCCGCT CCACCGGGCC GTACTCGATG 3150 

ATCACCCAGC AGCCGCTGGG CGGTAAGGCG CAGTTCGGTG GCCAGCGGTT 32 00 

CGGGGAGATG GAGTGCTGGG CCATGCAGGC CTACGGTGCT GCCTACACCC 3250 

TGCAGGAGCT GTTGACCATC AAGTCCGATG ACACCGTCGG CCGCGTCAAG 3300 

GTGTACGAGG CGATCGTCAA GGGTGAGAAC ATCCCGGAGC CGGGCATCCC 3 350 

CGAGTCGTTC AAGGTGCTGC TCAAAGAACT GCAGTCGCTG TGCCTCAACG 34 00 

TCGAGGTGCT ATCGAGTGAC GGTGCGGCGA TCGAACTGCG CGAAGGTGAG 34 50 

GACGAGGACC TGGAGCGGGC CGCGGCCAAC CTGGGAATCA ATCTGTCCCG 3500 

CAACGAATCC GCAAGTTTCG AGGATCTTGC GTAA 3534 
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2) INFORMATION FOR SEQ ID NO: 2073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2073 
CCGAGCAACA TGATTGAACC ATCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2074 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2074 
CCGAGCAACA TGATTGAAGC TTCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2075 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2075 
CCGAGCCAGG TTCTGAAGTC TCTGCATTAT TAGGTGCTCG G 41 



2) INFORMATION FOR SEQ ID NO: 2076 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 

CCGAGCYGAY AACATTTTCA GATTCACCCA 



PCT/CAOO/01150 
V. 

ID NO: 2076 

RGCGCTCGG 39 



2) INFORMATION FOR SEQ ID NO: 2077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2077 
CCGAGCAACC GATCCAGCTC CAGCTACGCT CGG 33 



2) INFORMATION FOR SEQ ID NO: 2078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2078 
CCGAGCCTTG GTCTTCGGCC AAATGAACGC TCGG 34 



2 ) INFORMATION FOR SEQ ID NO: 207 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2079 
CCGAGCGTTC AGTTACTTCA GTCCAAGCCG GCTCGG 36 
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2) INFORMATION FOR SEQ ID NO: 2080 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2080 
CCGAGCCGAA GAGGGCCAAG ATGTCGCTCG G 



2) INFORMATION FOR SEQ ID NO: 2081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2081 
GRATYRTYAA AGTTGGTGAG GAAG 



2) INFORMATION FOR SEQ ID NO: 2082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2082 
CMACTTCATC YCGCTTCGTA CC 



2) INFORMATION FOR SEQ ID NO: 2083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2083 
CCGCCGATGT TCCGTAAATT ACTTGAIGAA GGTCGAGCCG GCGG 44 



2) INFORMATION FOR SEQ ID NO: 2084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY :. Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2084 
CACGCGTCAA CACCCGTACA AGTCGTCTTT TGCGCGTG 38 



2) INFORMATION FOR SEQ ID NO: 2085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2085 
CAAACTAAAG AACATATCTT GCTA 24 



2) INFORMATION FOR SEQ ID NO: 2086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2086 
ATATAATTTG CATCACCTTC AAG 23 



1079 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



2) INFORMATION FOR SEQ ID NO: 2087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2087 
TCAGCTCGTG GGATTAGGAG AG 



2) INFORMATION FOR SEQ ID NO: 2088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2088 
AGGCTTCACG CTGTTAGGCT GA 



2) INFORMATION FOR SEQ ID NO: 2089 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2089 
ATGCTGAACT TATTGACCTT 



2) INFORMATION FOR SEQ ID NO: 2090 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2090 
CGTTACTGGA GTCGAAATG 



2)~INFORMATION FOR SEQ ID NO: 2091 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2091 
CGCGACTTGA GATGGAACTT AGTGAGCTTC TTGGTCGCG 



2) INFORMATION FOR SEQ ID NO: 2092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2092 
CGCGACGAAA GAACTTCCTG AAGGTCGTGC AGGTCCAG 



2) INFORMATION FOR SEQ ID NO: 2093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2093 
TGTTGGCAAT CGAAGACACC 
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2) INFORMATION FOR SEQ ID NO: 2094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
(P) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2094 

TTCAATTTCT TGACCTACTT TCAA 24 
2) INFORMATION FOR SEQ ID NO: 2095 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2095 
CGGTCGGGTT GAACGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2096 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2096 
CGCGACCGGT ACCACGGCCA GTAATCGTGT CGCG 34 
2) INFORMATION FOR SEQ ID NO: 2097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycoplasma pneumoniae 

(B) STRAIN: ATCC 29342 

(C) ACCESSION NUMBER: AE000019 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2097 

ATGGCAAGAG AGAAATTTGA CCGATCTAAA CCCCACGTTA ATGTAGGTAC 50 

TATTGGCCAC ATTGACCACG GTAAAACAAC TTTAACAGCA GCTATTTGTA 100 

CTGTATTAGC AAAAGAAGGT AAATCAGCTG CTACTCGTTA CGACCAAATC 150 

GATAAGGCTC CGGAAGAAAA AGCACGGGGA ATTACGATTA ACTCCGCTCA 200 

CGTGGAGTAC TCCTCTGACA AGCGTCACTA TGCTCACGTT GACTGTCCAG 250 

GACACGCTGA CTACATTAAG AACATGATTA CTGGTGCTGC ACAAATGGAT 300 

GGTGCCATTC TAGTAGTTTC AGCAACTGAC AGTGTTATGC CCCAAACCCG 350 

TGAACACATT TTGTTGGCCC GCCAAGTGGG, TGTGCCACGC ATGGTAGTGT 400 

TCCTAAACAA GTGTGACATT GCAACTGATG AAGAAGTGCA AGAGTTAGTA 450 

GCAGAAGAGG TACGTGACTT ATTAACTTCT TACGGCTTTG ATGGCAAGAA 500 

CACCCCTATT ATTTATGGTT CTGCACTTAA AGCGCTTGAA GGTGATCCTA 550 

AGTGGGAAGC TAAGATCCAT GATTTAATGA ATGCAGTTGA TGAATGGATT 600 

CCAACTCCTG AACGTGAAGT GGACAAACCC TTCTTGTTGG CAATCGAAGA 650 

CACCATGACG ATTACTGGCC GTGGTACCGT GGTTACCGGT CGGGTTGAAC 700 

GTGGTGAATT GAAAGTAGGT CAAGAAATTG AAATCGTTGG TTTACGTCCA 750 

ATCCGTAAAG CAGTTGTTAC CGGAATCGAA ATGTTCAAAA AGGAACTTGA 800 

TTCAGCAATG GCTGGGGACA ACGCTGGGGT ATTACTCCGT GGTGTGGACC 850 

GTAAAGAAGT GGAACGTGGT CAAGTGTTAG CTAAACCAGG TTCGATTAAA 900 

CCGCACAAGA AATTTAAAGC GGAAATCTAT GCTTTAAAGA AGGAAGAAGG 950 

TGGTCGTCAC ACCGGTTTCT TAAACGGTTA CCGTCCCCAA TTCTACTTCC 1000 

GTACTACAGA CGTTACTGGT TCGATTTCCC TACCAGAAAA CACCGAAATG 1050 

GTGCTACCAG GTGACAATAC CTCGATTACA GTTGAACTAA TTGCACCAAT 1100 

TGCTTGTGAA AAAGGTAGTA AGTTCTCCAT CCGTGAAGGT GGTCGAACGG 1150 

TTGGTGCTGG TTCAGTCACG GAAGTGCTTG AATAG 1185 



2) INFORMATION FOR SEQ ID NO: 2098 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2098 
CTGAGTCACA CCGACAAACG TC 



2) INFORMATION FOR SEQ ID NO: 2099 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2099 
CCAGGACTGA ACGGGATACG AA 22 



2) INFORMATION FOR SEQ ID NO: 2100 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:. DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2100 
GCGAGACGAT AGGTTGTC 18 



2) INFORMATION FOR SEQ ID NO: 2101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2609 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37Rv 

(C) ACCESSION NUMBER: Z79701 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2101 

CAGCCCGCGA GCGTAACCTG GCTGCGATTT CCGGCGCGGA TTTTCGCAGT 50 

GCGGTTACGC TCGGAAAGCG CGGGCCTCGC CCACGCGGCG GATGATGTCA 100 

GCGGGGTGGT CCTCGGCGAC GACCCGGACC ACGATCCACC CGTAGCGGTG 150 

CTGGACTTTC TCGTGCCGGA GGATGTCTTT CCGGTAGTGG TAGCGACTGG 200 

TCAGATGGTG GTCGCCGTCA TACTCGGCCG CGACCTTGAT GTCTTGCCAG 250 

CCCATATCCA AATGGGCTTC CGCCCAGCCC CATTCGTTGC GCACCGCGAT 300 

CTGCGTCTGG GGGCGCGGAA AGCCGGCGCG GATCAACAAC AAGCGCAGCC 350 

AGGTTTCCTT GGGGGACTGG GCACCGCCGT CGACGAGGTC CAGAGCGGCT 400 

CTTGCGGCCT TCATGCCACG GCGGCCCCGA TAGCGCTCGA TCAGCGGCTC 450 

GACGTCGGCC ACCTTCAAAT CGGTGGCCTG TATCAGGGCG TCGACGGCCG 500 

CGACGGCGGG GTCCAATGGA AATCGACTGG TCAGGTCGAG CGCCGTTCGC 550 

TCCGGTGTGG TCACGCGCAT GCCCTCGATG ACGCAGATCT CGTCGGGCTC 600 

GATGCGCTCT TCCCAGACTT GCAGCCCCGG GGCACGGCGG CGGTTGGTGT 650 

CGATGATCGC GGCGGGAAGA TCCGCGTCGA TCCACTTGGC GCCATGGAAG 700 

GCAGAAGCCG AGTAGCCGGC CAGCACGCCG CGGCGGCGCG AGCGCAGCCA 750 

CAGCGCTTTT GCACGCAATT GCGCGGTCAG TTCCACACCC TGCGGCACGT 800 
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ACACGTCTTT ATGTAGCGCG ACATACCTGC 
AATACACCCG CAGCCAGGGC CTCGCTGCCC 
CGAAGTGTGC TGAGTCACAC CGACAAACGT 
GAAAGTTCCC GCCGGAAATC GCAGCCACGT 
TTTCGGCCCG GCCGCGGCGA GACGATAGGT 
CCACTGAAGG GGCCAAACCC CCATTCGTAT 
GGAGGAAACC GGGGGATCGG GCTGGCGATC 
CGGCCACAAG GTGGCCGTCA CCCACCGTGG 
TGTTTGGCGT CGAATGTGAC GTCACCGACA 
TTCACGGCGG TAGAAGAGCA CCAGGGTCCG 
CGCCGGCCTA TCCGCGGACG CATTCCTCAT 
TCGAGAAGGT CATCAACGCC AACCTCACCG 
CGGGCATCGC GCAGCATGCA GCGCAACAAA 
AGGTTCGGTC TCCGGCAGCT GGGGCATCGG, 
CCTCCAAGGC CGGAGTGATT GGCATGGCCC 
TCGAAGGCAA ACGTGACCGC GAATGTGGTG 
CGATATGACC CGCGCGCTGG ATGAGCGGAT 
TTATCCCAGC GAAGCGGGTC GGCACCCCCG 
AGCTTCCTGG CTTCCGAGGA TGCGAGCTAT 
GGTCGACGGC GGCATGGGTA TGGGCCACTG 
TGACAGGACT GCTGGACGGC AAACGGATTC 
GACTCGTCGA TCGCGTTTCA CATCGCACGG 
CCAGCTGGTG CTCACCGGGT TCGACCGGCT 
CCGACCGGCT GCCGGCAAAG GCCCCGCTGC 
GAGGAGCACC TGGCCAGCTT GGCCGGCCGG 
GGGCAACAAG CTCGACGGGG TGGTGCATTC 
CCGGGATGGG CATCAACCCG TTCTTCGACG 
AAGGGCATCC ACATCTCGGC GTATTCGTAT 
GCTGCCGATC ATGAACCCCG GAGGTTCCAT 
CGAGCCGGGC GATGCCGGCC TACAACTGGA 
TTGGAGTCGG TCAACAGGTT CGTGGCGCGC 
GCGTTCGAAT CTCGTTGCCG CAGGCCCTAT 
CGATCGTCGG CGGTGCGCTC GGCGAGGAGG 
CTCGAGGAGG GCTGGGATCA GCGCGCTCCG 
TGCGACGCCG GTCGCCAAGA CGGTGTGCGC 
CGGCGACCAC GGGTGACATC ATCTACGCCG 
TTGCTCTAG 



TGCGCAATTC 


GTAGGGCGTC 


850 


AGAAAGGGAT 


CCGTCATGGT 


900 


CACGAGCGTA 


ACCCCAGTGC 


950 


TACGCTCGTG 


GACATACCGA 


1000 


TGTCGGGGTG 


ACTGCCACAG 


1050 


CCCGTTCAGT 


CCTGGTTACC 


1100 


GCACAGCGGC 


TGGCTGCCGA 


1150 


ATCCGGAGCG 


CCAAAGGGGC 


1200 


GCGACGCCGT 


CGATCGCGCC 


1250 


GTCGAGGTGC 


TGGTGTCCAA 


1300 


GCGGATGACC 


GAGGAAAAGT 


1350 


GGGCGTTCCG 


GGTGGCTCAA 


1400 


TTCGGTCGAA 


TGATATTCAT 


1450 


CAACCAGGCC 


AACTACGCAG 


1500 


GCTCGATCGC 


CCGCGAGCTG 


1550 


GCCCCGGGCT 


ACATCGACAC 


1600 


TCAGCAGGGG 


GCGCTGCAAT 


1650 


CCGAGGTCGC 


CGGGGTGGTC 


1700 


ATCTCCGGTG 


CGGTCATCCC 


1750 


ACACAACACA 


AGGACGCACA 


1800 


TGGTTAGCGG 


AATCATCACC 


1850 


GTAGCCCAGG 


AGCAGGGCGC 


1900 


GCGGCTGATT 


CAGCGCATCA 


1950 


TCGAACTCGA 


CGTGCAAAAC 


2000 


GTGACCGAGG 


CGATCGGGGC 


2050 


GATTGGGTTC 


ATGCCGCAGA 


2100 


CGCCCTACGC 


GGATGTGTCC 


2150 


GCTTCGATGG 


CCAAGGCGCT 


2200 


CGTCGGCATG 


GACTTCGACC 


2250 


TGACGGTCGC 


CAAGAGCGCG 


2300 


GAGGCCGGCA 


AGTACGGTGT 


2350 


CCGGACGCTG 


GCGATGAGTG 


2400 


CCGGCGCCCA 


GATCCAGCTG 


2450 


ATCGGCTGGA 


ACATGAAGGA 


2500 


GCTGCTGTCT 


GACTGGCTGC 


2550 


ACGGCGGCGC 


GCACACCCAA 


2600 






2609 



2) INFORMATION FOR SEQ ID NO: 2102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2102 
CACCTTCACC CTGACCGACG 20 



2) INFORMATION FOR SEQ ID NO: 2103 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: _ SEQ _ID NO: 2103.. . ... 

CGAACCAGCG GAAATAGTTG GAC 23 



2) INFORMATION FOR SEQ ID NO: 2104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2104 
CTGGGCATGG CICGAGTC 18 



2) INFORMATION FOR SEQ ID NO: 2105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37rv 

(C) ACCESSION NUMBER: U68480 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2105 

ATGACACAGT GCGCGAGCAG ACGCAAAAGC ACCCCAAATC GGGCGATTTT 50 

GGGGGCTTTT GCGTCTGCTC GCGGGACGCG CTGGGTGGCC ACCATCGCCG 100 

GGCTGATTGG CTTTGTGTTG TCGGTGGCGA CGCCGCTGCT GCCCGTCGTG 150 

CAGACCACCG CGATGCTCGA CTGGCCACAG CGGGGGCAAC TGGGCAGCGT 200 

GACCGCCCCG CTGATCTCGC TGACGCCGGT CGACTTTACC GCCACCGTGC 250 

CGTGCGACGT GGTGCGCGCC ATGCCACCCG CGGGCGGGGT GGTGCTGGGC 300 

ACCGCACCCA AGCAAGGCAA GGACGCCAAT TTGCAGGCGT TGTTCGTCGT 350 

CGTCAGCGCC CAGCGCGTGG ACGTCACCGA CCGCAACGTG GTGATCTTGT 4 00 

CCGTGCCGCG CGAGCAGGTG ACGTCCCCGC AGTGTCAACG CATCGAGGTC 4 50 
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ACCTCTACCC ACGCCGGCAC CTTCGCCAAC 
GTCGGGCGCG CCGCTGCGCA GCGGCTTCCC 
AGATTGTCGG GGTGTTCACC GACCTGACCG 
GCGGTCTCGG CGACCATCGA CACCCGGTTC 
GAAACTGCTG GCGATCATCG GGGCGATCGT 
TCGCGTTGTG GCGCCTGGAC CAGTTGGACG 
CTCCTCCTCA GGCCGTTCCG GCCTGCATCG 
GCTGATTCCG GCAAGCTGGC GCACCTTCAC 
TATTCGGCTT CCTGCTCTGG CATGTCATCG 
- GGCTACAT.CC TGGGCAT.GGC .CCGAGTCGCC 
CAACTATTTC CGCTGGTTCG GCAGCCCGGA 
ACAACCTGCT GGCGCTGATG ACCCATGTCA 
CGCCTGCCAG ACCTGGCCGC CGGGCTAGTG 
TGAGGTGCTG CCCCGCCTCG GGCCGGCGGT, 
ACTGGGCGGC GGCCATGGTC TTGCTGACCG 
GGCCTGCGGC CGGAGGGCAT CATCGCGCTC 
GCTGATCGAG CGGTCCATGC GGTACAGCCG 
CCGTCGTTAC CGCCGCATTC ACACTGGGTG 
GCGGTGGCCG CGCTGGTGGC CGGCGGCCGC 
GCGCCGYCAT CGCCTGGTCG GCACGTTGCC 
CCGCCGGCAC CGTCATCCTG ACCGTGGTGT 
ACGGTGTTGG AAGCCACCAG GGTTCGCGCC 
GTGGTATACC GAGAACCTGC GTTACTACTA 
ACGGTTCGCT GTCGCGGCGC TTCGGCTTTT 
TTCACCGCGG TGTTCATCAT GTTGCGGCGC 
CCGCGGACCG GCGTGGCGGC TGATGGGCGT 
TCCTGATGTT CACGCCCACC AAGTGGGTGC 
GCCGTAGGGG CGGCGATGGC CGCGCTGACG 
GGTGCTGCGC TGGTCGCGCA ACCGGATGGC 
TCCTGCTGGC GTTGTGTTGG GCCACCACCA 
AGCTACGGTG TGCCGTTCAA CAGCGCGATG 
AGTCAGCACA ATCTTTTTCG CCCTGTTTGC 
CCTGGCTGCA CTTCGCGCCC CGCGGCGCCG 
GCGCTGACGA CAGCCCCGGT ACCGATCGTG 
GTTCGTCGCG TCCATGGTGG CCGGGATCGT 
CCAACGGCTG GTCCAACGTG CGGGCGTTTG 
GACGACGTAC TCGTCGAGCC TGATACCAAT 
GGACGGCGAT TCGGGTTCTT GGGGCCCCTT 
ACCCGGTCGG CTTCACGCCC AACGGCGTAC 
GCGATCGTGA TGAAACCCAA CCAGCCCGGC 
GCCGACCAAG CTGACGAGTC CTGGCATCAA 
CCTATGGGCT CGATCCCGCC CGGGTACCGT 
GGCGCACAGC AACAGAGCAC ACTCGTCTCG 
GCCGGACGAC GGGCATCCGC TGGTCGTGGT 
CCGGCAACAG CGTGCTGCAC GGGTACACCC 
GAATACGCCA TGCCGGGACC CGGAGCGCTG 
GCCGGACGAC CTATACGGAG AGCAGCCCAA 
TCGCCCGAGC AAAGATGCCC GCCGATGCCG 
GAGGATCTGT CGCTGACACC GGAGGACTGG 
GGTACCGGAC CTGCGCTCAC TGCAGGAATA 
TGCTGCTGGA CTGGGCGGTC GGTTTGGCCT 
CTGCACGCCA ATGGCATCGC CGAAATCCCG 
CTACTCGGCT AAGAAGCTGG ACACCGACAC 
GCGGCCTGCT CGGGATCACC GACCTGTTGC 
ACCTACCTGT CCCGCGACTG GGCCCGCGAT 
CGACACCCTG GTCGATGCCC CTCCCGCCCA 
CCCGCAGCGG CCTGTGGTCA CCGGGCAAGA 
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TTCGTCGGGC 


TCAAGGACCC 


500 


CGACCCCAAC 


CTGCGCCCGC 


550 


GGCCCGCGCC 


GCCCGGGCTG 


600 


TCCACCCGGC 


CGACCACGCT 


650 


GGCCACCGTC 


GTCGCACTGA 


700 


GGCGGGGCTC 


AATTGCCCAG 


750 


TCGCCGGGCG 


GCATGCGCCG 


800 


CCTGACCGAC 


GCCGTGGTGA 


850 


GCGCGAATTC 


GTCGGACGAC 


900 


GACCACGCCG. 


GCTACATGTC 


950 


GGATCCCTTC 


GGCTGGTATT 


1000 


GCGACGCCAG 


TCTGTGGATG 


1050 


TGCTGGCTGC 


TGCTGTCGCG 


1100 


GGAGGCCAGC 


AAACCCGCCT 


1150 


CGTGGATGCC 


GTTCAACAAC 


1200 


GGCTCGCTGG 


TCACCTATGT 


1250 


GCTCACACCG 


GCGGCGCTGG 


1300 


TGCAGCCCAC 


CGGCCTGATC 


1350 


CCGATGCTGC 


GGATCTTGGT 


1400 


GTTGGTGTCG 


CCGATGCTGG 


1450 


TCGCCGACCA 


GACCCTGTCA 


1500 


AAAATCGGGC 


CGAGCCAGGC 


1550 


CCTCATCCTG 


CCCACCGTCG 


1600 


TGATCACCGC 


GCTATGCCTG 


1650 


AAGCGAATTC 


CCAGCGTGGC 


1700 


CATCTTCGGC 


ACCATGTTCT 


1750 


ACCACTTCGG 


GCTGTTCGCC 


1800 


ACGGTGTTGG 


TATCCCCATC 


1850 


GTTCCTGGCG 


GCGTTATTCT 


1900 


ACGGCTGGTG 


GTATGTCTCC 


1950 


CCGAAGATCG 


ACGGGATCAC 


2000 


GATCGCCGCC 


GGCTATGCGG 


2050 


GCGAAGGGCG 


GCTGATCCGC 


2100 


GCCGGTTTCA 


TGGCGGCGGT 


2150 


GCGACAGTAC 


CCGACCTACT 


2200 


TCGGCGGCTG 


CGGACTGGCC 


2250 


GCGGGTTTCA 


TGAAGCCGCT 


2300 


GGGCCCGCTG 


GGTGGAGTCA 


2350 


CGGAACACAC 


GGTGGCCGAG 


2400 


ACCGACTACG 


ACTGGGATGC 


2450 


TGGTTCTACG 


GTGCCGCTGC 


2500 


TGGCAGGCAC 


CTACACCACC 


2550 


GCGTGGTATC 


TCCTGCCTAA 


2600 


GACCGCCGCG 


GGCAAGATCG 


2650 


CCGGGCAGAC 


TGTGGTGCTC 


2700 


GTACCCGCCG 


GGCGGATGGT 


2750 


GGCGTGGCGC 


AACCTGCGCT 


2800 


TCGCGGTCCG 


GGTGGTGGCC 


2850 


ATCGCGGTGA 


CCCCGCCGCG 


2900 


TGTGGGCTCG 


ACGCAGCCGG 


2950 


TCCCGTGCCA 


GCAGCCGATG 


3000 


AAGTTCCGCA 


TCACACCGGA 


3050 


GTGGGAAGAC 


GGCACTAACG 


3100 


TGCGGGCCCA 


CGTCATGGCC 


3150 


TGGGGTTCCC 


TGCGCAAGTT 


3200 


GCTCGAGTTG 


GGCACCGCGA 


3250 


TCCGAATTGG 


TCCATAG 


3297 



SUBSTITUTE SHEET (RULE 26) 



WO 01/2J604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 2106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

- (D) - TOPOLOGY.: .Linear . .. _ 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2106 

GCGAGCAGAG CACGCCCTCC TCGCCGCTCG C 31 
2) INFORMATION FOR SEQ ID NO: 2107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2107 
GCGAGCTCCC CATCTCTGGT TGGCACGCTC GC 32 



2) INFORMATION FOR SEQ ID NO: 2108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2108 
GCGGGCAACT TCRTCAAGAA GGTTGGTTAC AACCCGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2109 
GCGGGCCCAA TCTCTGGTTG GAAYGGTGAC AAGCCCGC 

2) INFORMATION FOR SEQ ID NO: 2110 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2110 
GCGGGCCCTT AACGATTTCA GCGAATCTGG ATTCAGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2111 
GCGGGCATGA TTGAAGCCAC CACCAACGCT TCCTGGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2112 
GCGGGCTTGA TGAAGTTTTG GGTTTCCTTG ACAATTTGCC CGC 



2) INFORMATION FOR SEQ ID NO: 2113 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2113 
GCGGGCACAA GGGTTGGACT AAGGAAACCA AGGCAGCCCG" C 



2) INFORMATION FOR SEQ ID NO: 2114 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2114 
GCGGGCATCG ATGCTATTGA ACCACCTGTC AGACCGCCCG C 
2) INFORMATION FOR SEQ ID NO: 2115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2115 
GCGGGCTTGA TGATTTCCTC GAATCTAGAT TGGGCCCGC 



2) INFORMATION FOR SEQ ID NO: 2116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2116 

GCGGGCGGTA AGTCCACCGG TAAGACCTTG TTGGCCCGC 
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2) INFORMATION FOR SEQ ID NO: 2117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
.(D). TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2117 
GCGGGCGACG CCATTGAGCC ACCTTCGAGA GCCCGC 



2) INFORMATION FOR SEQ ID NO: 2118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2118 
GCGGGCTCCT TGACAATTTC TTCGTATCTG TTCTTGGCCC GC 



2) INFORMATION FOR SEQ ID NO: 2119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2119 
GCGGGCTTAC AACCCTAAGG CTGTTCCATT CGTTGCCCGC 



2) INFORMATION FOR SEQ ID NO: 2120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2120 
GCGGGCAGAA ACCAAGGCTG GTAAGGTTAC CGGTAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2121 
CGACCGCIAG CCGCACACCA AGTTCCGGTC G 31 
2 ) INFORMATION FOR SEQ ID NO: 2122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2122 
CCGAGCGAAT GTAGGAGTCC AGGGTCTCTG CTCGG 35 



2) INFORMATION FOR SEQ ID NO: 2123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2123 
TCTACCACTG AAGCATTAC 19 
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2) INFORMATION FOR SEQ ID NO: 2124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2124 
TAGGTACTGT AGGTTTATTG 20 



2) INFORMATION FOR SEQ ID NO: 2125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2125 
CACGCGGATT TTGAATCTCT TCCTCTAGTA GCGCGTG 37 



2) INFORMATION FOR SEQ ID NO: 2126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2126 
ATATCAGAGA CTGATGAG . 18 



2) INFORMATION FOR SEQ ID NO: 2127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: 'Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2127 
TAGCATATTC AGAGAATATT GT 



2) INFORMATION FOR SEQ ID NO: 2128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2128 
CAACGCTGGA GAATCTATAT TTGTAGAAAC TGCGTTG 
2) INFORMATION FOR SEQ ID NO: 2129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X51797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2129 

ATGTCTTTAA TATCTAAAGA AGAGTTAATA AAACTCGCAT ATAGCATTAG 50 

ACCAAGAGAA AATGAGTATA AAACTATACT AACTAATTTA GACGAATATA 100 

ATAAGTTAAC TACAAACAAT AATGAAAATA AATATTTGCA ATTAAAAAAA 150 

CTAAATGAAT CAATTGATGT TTTTATGAAT AAATATAAAA CTTCAAGCAG 200 

AAATAGAGCA CTCTCTAATC TAAAAAAAGA TATATTAAAA GAAGTAATTC 250 

TTATTAAAAA TTCCAATACA AGCCCTGTAG AAAAAAATTT ACATTTTGTA 300 

TGGATAGGTG GAGAAGTCAG TGATATTGCT CTTGAATACA TAAAACAATG 350 

GGCTGATATT AATGCAGAAT ATAATATTAA ACTGTGGTAT GATAGTGAAG 4 00 

CATTCTTAGT AAATACACTA AAAAAGGCTA TAGTTGAATC TTCTACCACT 4 50 

GAAGCATTAC AGCTACTAGA GGAAGAGATT CAAAATCCTC AATTTGATAA 500 

TATGAAATTT TACAAAAAAA GGATGGAATT TATATATGAT AGACAAAAAA 550 

GGTTTATAAA TTATTATAAA TCTCAAATCA ATAAACCTAC AGTACCTACA 600 

ATAGATGATA TTATAAAGTC TCATCTAGTA TCTGAATATA ATAGAGATGA 650 

AACTGTATTA GAATCATATA GAACAAATTC TTTGAGAAAA ATAAATAGTA 700 

ATCATGGGAT AGATATCAGG GCTAATAGTT TGTTTACAGA ACAAGAGTTA 750 

TTAAATATTT ATAGTCAGGA GTTGTTAAAT CGTGGAAATT TAGCTGCAGC 8 00 
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ATCTGACATA GTAAGATTAT TAGCCCTAAA AAATTTTGGC GGAGTATATT ^850 

TAGATGTTGA TATGCTTCCA GGTATTCACT CTGATTTATT TAAAACAATA 900 

TCTAGACCTA GCTCTATTGG ACTAGACCGT TGGGAAATGA TAAAATTAGA 950 

GGCTATTATG AAGTATAAAA AATATATAAA TAATTATACA TCAGAAAACT 1000 

TTGATAAACT TGATCAACAA TTAAAAGATA ATTTTAAACT CATTATAGAA 1050 

AGTAAAAGTG AAAAATCTGA GATATTTTCT AAATTAGAAA ATTTAAATGT 1100 

ATCTGATCTT GAAATTAAAA TAGCTTTCGC TTTAGGCAGT GTTATAAATC 1150 

AAGCCTTGAT ATCAAAACAA GGTTCATATC TTACTAACCT AGTAATAGAA 1200 

CAAGTAAAAA ATAGATATCA ATTTTTAAAC CAACACCTTA ACCCAGCCAT 1250 

AGAGTCTGAT AATAACTTCA CAGATACTAC TAAAATTTTT CATGATTCAT 13 00 

TATTTAATTC AGCTACCGCA GAAAACTCTA TGTTTTTAAC AAAAATAGCA " 1350 

CCATACTTAC AAGTAGGTTT TATGCCAGAA GCTCGCTCCA CAATAAGTTT 14 00 

AAGTGGTCCA GGAGCTTATG CGTCAGCTTA CTATGATTTC ATAAATTTAC 14 50 

AAGAAAATAC TATAGAAAAA ACTTTAAAAG CATCAGATTT AATAGAATTT 1500 

AAATTCCCAG AAAATAATCT ATCTCAATTG' ACAGAACAAG AAATAAATAG 1550 

TCTATGGAGC TTTGATCAAG CAAGTGCAAA ATATCAATTT GAGAAATATG 16 00 

TAAGAGATTA TACTGGTGGA TCTCTTTCTG AAGACAATGG GGTAGACTTT 1650 

AATAAAAATA CTGCCCTCGA CAAAAACTAT TTATTAAATA ATAAAATTCC 1700 

ATCAAACAAT GTAGAAGAAG CTGGAAGTAA AAATTATGTT CATTATATCA 1750 

TACAGTTACA AGGAGATGAT ATAAGTTATG AAGCAACATG CAATTTATTT 1800 

TCTAAAAATC CTAAAAATAG TATTATTATA CAACGAAATA TGAATGAAAG 1850 

TGCAAAAAGC TACTTTTTAA GTGATGATGG AGAATCTATT TTAGAATTAA 1900 

ATAAATATAG GATACCTGAA AGATTAAAAA ATAAGGAAAA AGTAAAAGTA 1950 

ACCTTTATTG GACATGGTAA AGATGAATTC AACACAAGCG AATTTGCTAG 2000 

ATTAAGTGTA GATTCACTTT CCAATGAGAT AAGTTCATTT TTAGATACCA 2050 

TAAAATTAGA TATATCACCT AAAAATGTAG AAGTAAACTT ACTTGGATGT 2100 

AATATGTTTA GTTATGATTT TAATGTTGAA GAAACTTATC CTGGGAAGTT 2150 

GCTATTAAGT ATTATGGACA AAATTACTTC CACTTTACCT GATGTAAATA 2200 

AAAATTCTAT TACTATAGGA GCAAATCAAT ATGAAGTAAG AATTAATAGT 2250 

GAGGGAAGAA AAGAACTTCT GGCTCACTCA GGTAAATGGA TAAATAAAGA 2300 

AGAAGCTATT ATGAGCGATT TATCTAGTAA AGAATACATT TTTTTTGATT 23 50 

CTATAGATAA TAAGCTAAAA GCAAAGTCCA AGAATATTCC AGGATTAGCA 24 00 

TCAATATCAG AAGATATAAA AACATTATTA CTTGATGCAA GTGTTAGTCC 24 50 

TGATACAAAA TTTATTTTAA ATAATCTTAA GCTTAATATT GAATCTTCTA 2500 

TTGGGGATTA CATTTATTAT GAAAAATTAG AGCCTGTTAA AAATATAATT 2550 

CACAATTCTA TAGATGATTT AATAGATGAG TTCAATCTAC TTGAAAATGT 26 00 

ATCTGATGAA TTATATGAAT TAAAAAAATT AAATAATCTA GATGAGAAGT 26 50 

ATTTAATATC TTTTGAAGAT ATCTCAAAAA ATAATTCAAC TTACTCTGTA 2700 

AGATTTATTA ACAAAAGTAA TGGTGAGTCA GTTTATGTAG AAACAGAAAA 2750 

AGAAATTTTT TCAAAATATA GCGAACATAT TACAAAAGAA ATAAGTACTA 28 0 0 

TAAAGAATAG TATAATTACA GATGTTAATG GTAATTTATT GGATAATATA 2850 

CAGTTAGATC ATACTTCTCA AGTTAATACA TTAAACGCAG CATTCTTTAT 2900 

TCAATCATTA ATAGATTATA GTAGCAATAA AGATGTACTG AATGATTTAA 2950 

GTACCTCAGT TAAGGTTCAA CTTTATGCTC AACTATTTAG TACAGGTTTA 3000 

AATACTATAT ATGACTCTAT CCAATTAGTA AATTTAATAT CAAATGCAGT 3050 

AAATGATACT ATAAATGTAC TACCTACAAT AACAGAGGGG ATACCTATTG 3100 

TATCTACTAT ATTAGACGGA ATAAACTTAG GTGCAGCAAT TAAGGAATTA 3150 

CTAGACGAAC ATGACCCATT ACTAAAAAAA GAATTAGAAG CTAAGGTGGG 3200 

TGTTTTAGCA ATAAATATGT CATTATCTAT AGCTGCAACT GTAGCTTCAA 3250 

TTGTTGGAAT AGGTGCTGAA GTTACTATTT TCTTATTACC TATAGCTGGT 33 00 

ATATCTGCAG GAATACCTTC ATTAGTTAAT AATGAATTAA TATTGCATGA 3350 

TAAGGCAACT TCAGTGGTAA ACTATTTTAA TCATTTGTCT GAATCTAAAA 3400 

AATATGGCCC TCTTAAAACA GAAGATGATA AAATTTTAGT TCCTATTGAT 3450 

GATTTAGTAA TATCAGAAAT AGATTTTAAT AATAATTCGA TAAAACTAGG 3500 

AACATGTAAT ATATTAGCAA TGGAGGGGGG ATCAGGACAC ACAGTGACTG 3550 

GTAATATAGA TCACTTTTTC TCATCTCCAT CTATAAGTTC TCATATTCCT 3600 

TCATTATCAA TTTATTCTGC AATAGGTATA GAAACAGAAA ATCTAGATTT 3650 
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TTCAAAAAAA ATAATGATGT TACCTAATGC TCCTTCAAGA GTGTTTTGGT 3 700 

GGGAAACTGG AGCAGTTCCA GGTTTAAGAT CATTGGAAAA TGACGGAACT 3750 

AGATTACTTG ATTCAATAAG AGATTTATAC CCAGGTAAAT TTTACTGGAG 3800 

ATTCTATGCT TTTTTCGATT ATGCAATAAC TACATTAAAA CCAGTTTATG 3850 

AAGACACTAA TATTAAAATT AAACTAGATA AAGATACTAG AAACTTCATA 3900 

ATGCCAACTA TAACTACTAA CGAAATTAGA AACAAATTAT CTTATTCATT 3950 

TGATGGAGCA GGAGGAACTT ACTCTTTATT ATTATCTTCA TATCCAATAT 4 000 

CAACGAATAT AAATTTATCT AAAGATGATT TATGGATATT TAATATTGAT 4 050 

AATGAAGTAA GAGAAATATC TATAGAAAAT GGTACTATTA AAAAAGGAAA 4100 

GTTAATAAAA GATGTTTTAA GTAAAATTGA TATAAATAAA AATAAACTTA 4150 

TTATAGGCAA TCAAACAATA GATTTTTCAG GCGATATAGA TAATAAAGAT "4200 

AGATATATAT TCTTGACTTG TGAGTTAGAT GATAAAATTA GTTTAATAAT 4250 

AGAAATAAAT CTTGTTGCAA AATCTTATAG TTTGTTATTG TCTGGGGATA 4300 

AAAATTATTT GATATCCAAT TTATCTAATA CTATTGAGAA AATCAATACT 4350 

TTAGGCCTAG ATAGTAAAAA TATAGCGTAC AATTACACTG ATGAATCTAA 44 00 

TAATAAATAT TTTGGAGCTA TATCTAAAAC AAGTCAAAAA AGCATAATAC 4450 

ATTATAAAAA AGACAGTAAA AATATATTAG AATTTTATAA TGACAGTACA 4 500 

TTAGAATTTA ACAGTAAAGA TTTTATTGCT GAAGATATAA ATGTATTTAT 4 550 

GAAAGATGAT ATTAATACTA TAACAGGAAA ATACTATGTT GATAATAATA 5000 

CTGATAAAAG TATAGATTTC TCTATTTCTT TAGTTAGTAA AAATCAAGTA 5050 

AAAGTAAATG GATTATATTT AAATGAATCC GTATACTCAT CTTACCTTGA 5100 

TTTTGTGAAA AATTCAGATG GACACCATAA TACTTCTAAT TTTATGAATT 5150 

TATTTTTGGA CAATATAAGT TTCTGGAAAT TGTTTGGGTT TGAAAATATA 5200 

AATTTTGTAA TCGATAAATA CTTTACCCTT GTTGGTAAAA CTAATCTTGG 5250 

ATATGTAGAA TTTATTTGTG ACAATAATAA AAATATAGAT ATATATTTTG 53 00 

GTGAATGGAA AACATCGTCA TCTAAAAGCA CTATATTTAG CGGAAATGGT 5350 

AGAAATGTTG TAGTAGAGCC TATATATAAT CCTGATACGG GTGAAGATAT 54 00 

ATCTACTTCA CTAGATTTTT CCTATGAACC TCTCTATGGA ATAGATAGAT 54 50 

ATATAAATAA AGTATTGATA GCACCTGATT TATATACAAG TTTAATAAAT 5500 

ATTAATACCA ATTATTATTC AAATGAGTAC TACCCTGAGA TTATAGTTCT 5550 

TAACCCAAAT ACATTCCACA AAAAAGTAAA TATAAATTTA GATAGTTCTT 5600 

CTTTTGAGTA TAAATGGTCT ACAGAAGGAA GTGACTTTAT TTTAGTTAGA 5650 

TACTTAGAAG AAAGTAATAA AAAAATATTA CAAAAAATAA GAATCAAAGG 5700 

TATCTTATCT AATACTCAAT CATTTAATAA AATGAGTATA GATTTTAAAG 5750 

ATATTAAAAA ACTATCATTA GGATATATAA TGAGTAATTT TAAATCATTT 5800 

AATTCTGAAA ATGAATTAGA TAGAGATCAT TTAGGATTTA AAATAATAGA 5850 

TAATAAAACT TATTACTATG ATGAAGATAG TAAATTAGTT AAAGGATTAA 5900 

GCAAGTAATT GACAATAATA AGTATTATTT CAATCCTGAC ACTGCTATCA 5950 

TCTCAAAAGG TTGGCAGACT GTTAATGGTA GTAGATACTA CTTTGATACT 6000 

GATACCGCTA TTGCCTTTAA TGGTTATAAA ACTATTGATG GTAAACACTT 6050 

TTATTTTGAT AGTGATTGTG TAGTGAAAAT AGGTGTGTTT AGTACCTCTA 6100 

ATGGATTTGA ATATTTTGCA CCTGCTAATA CTTATAATAA TAACATAGAA 6150 

GGTCAGGCTA TAGTTTATCA AAGTAAATTC TTAACTTTGA ATGGTAAAAA 62 00 

ATATTACTTT GATAATAACT CAAAAGCAGT TACCGGATTG CAAACTATTG 6250 

ATAGTAAAAA ATATTACTTT AATACTAACA CTGCTGAAGC AGCTACTGGA 6300 

TGGCAAACTA TTGATGGTAA AAAATATTAC TTTAATACTA ACACTGCTGA 6350 

AGCAGCTACT GGATGGCAAA CTATTGATGG TAAAAAATAT TACTTTAATA 6400 

CTAACACTGC TATAGCTTCA ACTGGTTATA CAATTATTAA TGGTAAACAT 6450 

TTTTATTTTA ATACTGATGG TATTATGCAG ATAGGAGTGT TTAAAGGACC 6500 

TAATGGATTT GAATATTTTG CACCTGCTAA TACGGATGCT AACAACATAG 6550 

AAGGTCAAGC TATACTTTAC CAAAATGAAT TCTTAACTTT GAATGGTAAA 6600 

AAATATTACT TTGGTAGTGA CTCAAAAGCA GTTACTGGAT GGAGAATTAT 6650 

TAACAATAAG AAATATTACT TTAATCCTAA TAATGCTATT GCTGCAATTC 6700 

ATCTATGCAC TATAAATAAT GACAAGTATT ACTTTAGTTA TGATGGAATT 6750 

CTTCAAAATG GATATATTAC TATTGAAAGA AATAATTTCT ATTTTGATGC 6800 

TAATAATGAA TCTAAAATGG TAACAGGAGT ATTTAAAGGA CCTAATGGAT 6850 

TTGAGTATTT TGCACCTGCT AATACTCACA ATAATAACAT AGAAGGTCAG 6 900 
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GCTATAGTTT ACCAGAACAA ATTCTTAACT 
TTTTGATAAT GACTCAAAAG CAGTTACTGG 
AAAAATATTA CTTTAATCTT AACACTGCTG 
ACTATTGATG GTAAAAAATA TTACTTTAAT 
TACTGGATGG CAAACTATTG ATGGTAAAAA 
CTTTCATAGC CTCAACTGGT TATACAAGTA 
TTTAATACTG ATGGTATTAT GCAGATAGGA 
ATTTGAATAC TTTGCACCTG CTAATACGGA 
AAGCTATACT TTACCAAAAT AAATTCTTAA 
TACTTTGGTA GTGACTCAAA AGCAGTTACC 
TAAAAAATAT TACTTTAATA CTAACACTGC 
AAACTATTAA TGGTAAAAAA TACTACTTTA 
TCAACTGGTT ATACAATTAT TAGTGGTAAA 
TGGTATTATG CAGATAGGAG TGTTTAAAGG 
TTGCACCTGC TAATACAGAT GCTAACAATA 
TATCAAAATA GATTCCTATA TTTACATGAC 
TAATTCAAAA GCGGCTACTG GTTGGGTAAC 
ACTTCGAGCC TAATACAGCT ATGGGTGCGA 
AATAAAAATT TTTACTTTAG AAATGGTTTA 
AGGGTCTAAT GGATTTGAAT ACTTTGCACC 
ATATAGAAGG TCAAGCTATA CGTTATCAAA 
GGAAAAATAT ATTACTTTGG TAATAATTCA 
AACTATTAAT GGTAAAGTAT ATTACTTTAT 
CAGCTGGTGG ACTTTTCGAG ATTGATGGTG 
GATGGAGTAA AAGCCCCTGG GATATATGGC 



TTGAATGGCA AAAAATATTA 6950 

ATGGCAAACC ATTGATGGTA 7000 

AAGCAGCTAC TGGATGGCAA 7050 

CTTAACACTG CTGAAGCAGC 7100 

ATATTACTTT AATACTAACA 7150 

TTAATGGTAA ACATTTTTAT 7200 

GTGTTTAAAG GACCTAATGG 7250 

TGCTAACAAC ATAGAAGGTC 7300 

CTTTGAATGG TAAAAAATAT 7350 

GGACTGCGAA CTATTGATGG 7400 

TGTTGCAGTT ACTGGATGGC 7450 

ATACTAACAC TTCTATAGCT 7500 

CATTTTTATT TTAATACTGA 7550 

ACCTGATGGA TTTGAATACT 7600 

TAGAAGGTCA AGCTATACGT 7650 

AATATATATT ATTTTGGTAA 7700 

TATTGATGGT AATAGATATT 7750 

ATGGTTATAA AACTATTGAT 7800 

CCTCAGATAG GAGTGTTTAA 7850 

TGCTAATACG GATGCTAACA 7 900 

ATAGATTCCT ACATTTACTT 7950 

AAAGCAGTTA CTGGATGGCA 8000 

GCCTGATACT GCTATGGCTG 8050 

TTATATATTT CTTTGGTGTT 8100 

TAA 8133 



2) INFORMATION FOR SEQ ID NO: 2130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X53138 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2130 

ATGAGTTTAG TTAATAGAAA ACAGTTAGAA AAAATGGCAA ATGTAAGATT 50 

TCGTACTCAA GAAGATGAAT ATGTTGCAAT ATTGGATGCT TTAGAAGAAT 100 

ATCATAATAT GTCAGAGAAT ACTGTAGTCG AAAAATATTT AAAATTAAAA 150 

GATATAAATA GTTTAACAGA TATTTATATA GATACATATA AAAAATCTGG 200 

TAGAAATAAA GCCTTAAAAA AATTTAAGGA ATATCTAGTT ACAGAAGTAT 250 

TAGAGCTAAA GAATAATAAT TTAACTCCAG TTGAGAAAAA TTTACATTTT 300 

GTTTGGATTG GAGGTCAAAT AAATGACACT GCTATTAATT ATATAAATCA 350 

ATGGAAAGAT GTAAATAGTG ATTATAATGT TAATGTTTTT TATGATAGTA 4 00 

ATGCATTTTT GATAAACACA TTGAAAAAAA CTGTAGTAGA ATCAGCAATA 450 

AATGATACAC TTGAATCATT TAGAGAAAAC TTAAATGACC CTAGATTTGA 500 

CTATAATAAA TTCTTCAGAA AACGTATGGA AATAATTTAT GATAAACAGA 550 

AAAATTTCAT AAACTACTAT AAAGCTCAAA GAGAAGAAAA TCCTGAACTT 600 
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ATAATTGATG ATATTGTAAA 
GATAGATGAA CTTAATACCT 
AGAATAGTGG AAATGATGTT 
TCATTCAACT TATATGAACA 
TGCTTCTGAC ATATTAAGAA 
ATTTAGATGT TGATATGTTA 
ATAGAGAAAC CTAGTTCAGT 
AGAAGCTATA ATGAAATACA 
ATTTTGACAT GTTAGACGAA 
GCTTCTAAGT CAGATAAATC 

GGCATCACCA CTAGAAGTTA 

ATCAAGGGCT AATTTCTGTG 
AAACAAATCG AGAATAGATA 
TATTAGCGAG GATAATGATT 
GTATAATGGC TGAAGCTAAT 
GGAAAGTATT TAAGAGTTGG 
CTTAAGTGGC CCTGAAGCAT 
TTAAAGAAGG CAGTATGAAT 
TTTGAAATCT CTAAAACTAA 
TAGCTTATGG TCATTTGACG 
ATAAAAGGAA TTATTTTGAA 
TTTTCTCAAA ATATAGTAGT 
TTCATTAGCA AGAAGTTCAG 
TACAAGGAGA TAAAATTAGT 
ACTCCTTATG ATAGTGTACT 
TGCATATTAT TATAATCCTG 
ATAAAATTCC AAGTATAATT 
ATTGGTCATG GTAAAGATGA 
TGTAGATTCA TTATCCACAG 
AGGATATTTC TCCTAAGTCA 
TTTAGCTACT CTATCAACGT 
TAAAGTTAAA GATAAAATAT 
CTATTATAGT AAGTGCAAAT 
AGAAGAGAAT TATTGGATCA 
TATTATAAAG GATATTTCAT 
AAAATAAAAT TACAGTAAAA 
TTACAAGAAA TTAGAAATAA 
AAAAGTAATG TTAACAGAAT 
CGCAAATTGT TGAGGAAAGG 
TCTATTAATT ATATAAAAGA 
TGCACTATGT GACTTAAAAC 
TATCTTTTGA GGACATATCA 
ATTAATAAAG AAACTGGAGA 
ATTCTCTGAA TATGCTAATC 
GTACTATATT TGATACTGTA 
GATACTACAC ACGAAGTAAA 
ATTAATAGAA TATAATAGTT 
CAATGAAAGT CCAAGTTTAC 
ATTACAGATG CAGCCAAAGT 
AACTATAGAC TTACTTCCTA 
CTATTATAGA TGGTGTAAGT 
ACGAGTGACC CATTATTAAG 
GGCAGTAAAT TTAACAACAG 
GGATAGCTAG TGGATTTAGT 
GCAGGTATAC CAAGCTTAGT 
AACAAAGGTT GTAGATTATT 
GAGTATTTAC TTTATTAGAT 



GACATATCTT TCAAATGAGT 
ATATTGAAGA ATCCTTAAAT 
AGAAACTTTG AAGAATTTAA 
AGAGTTGGTA GAAAGGTGGA 
TATCTGCATT AAAAGAAATT 
CCAGGAATAC AACCAGACTT 
AACAGTGGAT TTTTGGGAAA 
AAGAATATAT ACCAGAATAT 
GAAGTTCAAA GTAGTTTTGA 
AGAAATATTC TCATCACTTG 
AAATTGCATT TAATAGTAAG 
AAAGACTCAT ATTGTAGCAA 
TAAAATATTG AATAATAGTT 
TTAATACTAC, AACGAATACC 
gcagataatg' GTAGATTTAT 
TTTCTTCCCA GATGTTAAAA 
ATGCGGCAGC TTATCAAGAT 
ATCCATTTGA TAGAAGCTGA 
TATTTCTCAA TCAACTGAAC 
ATGCAAGAGC TAAAGCTCAA 
GGTTCTCTTG GTGAAGATGA 
TGACAAGGAG TATCTTTTAG 
AGAGAGGATA TATACACTAT 
TATGAAGCAG CATGTAACTT 
GTTTCAGAAA AATATAGAAG 
GAGATGGTGA AATACAAGAA 
TCTGATAGAC CTAAGATTAA 
ATTTAATACT GATATATTTG 
AAATAGAAGC AGCAATAGAT 
ATAGAAATAA ATTTATTAGG 
AGAGGAGACT TATCCTGGAA 
CAGAATTAAT GCCATCTATA 
CAATATGAAG TTAGAATAAA 
TTCTGGTGAA TGGATAAATA 
CAAAAGAATA TATATCATTT 
TCTAAAAATT TACCTGAGCT 
TTCTAATTCA AGTGATATTG 
GTGAGATAAA TGTTATTTCA 
ATTGAAGAAG CTAAGAATTT 
TGAATTTAAA CTAATAGAAT 
AACAGAATGA ATTAGAAGAT 
GAGACTGATG AGGGATTTAG 
ATCTATATTT GTAGAAACTG 
ATATAACTGA AGAGATTTCT 
AATGGTAAGT TAGTAAAAAA 
TACTTTAAAT GCTGCATTTT 
CTAAAGAATC TCTTAGTAAT 
GCTCAATTAT TTAGTACTGG 
TGTTGAATTA GTATCAACTG 
CATTATCTGA AGGATTACCT 
TTAGGTGCAG CAATCAAAGA 
ACAAGAAATA GAAGCTAAGA 
CTACAACTGC AATCATTACT 
ATACTTTTAG TTCCTTTAGC 
AAACAATGAA CTTGTACTTC 
TTAAACATGT TTCATTAGTT 
GATAAAATAA TGATGCCAGA 
1098 



ATTCAAAGGA 


650 


AAAATTACAC 


700 


AAATGGAGAG 


750 


ATTTAGCTGC 


800 


GGTGGTATGT 


850 


ATTTGAGTCT 


900 


TGACAAAGTT 


950 


ACCTCAGAAC 


1000 


ATCTGTTCTA 


1050 


GTGATATGGA 


1100 


GGTATTATAA 


1150 


TTTAATAGTA 


1200 


TAAATCCAGC 


1250 


TTTATTGATA 


1300 


GATGGAACTA 


1350 


CTACTATTAA 


1400 


TTATTAATGT 


1450 


TTTAAGAAAC 


1500 


AAGAAATGGC 


1550 


TTTGAAGAAT 


1600 


TAATCTTGAT 


1650 


AAAAAATATC 


1700 


ATTGTTCAGT 


1750 


ATTTGCAAAG 


1800 


ATTCAGAAAT 


1850 


ATAGACAAGT 


1900 


ATTAACATTT 


1950 


CAGGTTTTGA 


2000 


TTAGCTAAAG 


2050 


ATGTAATATG 


2100 


AATTATTACT 


2150 


AGTCAAGACT 


2200 


TAGTGAAGGA 


2250 


AAGAAGAAAG 


2300 


AATCCTAAAG 


2350 


ATCTACATTA 


2400 


AACTAGAAGA 


2450 


AATATAGATA 


2500 


AACTTCTGAC 


2550 


CTATTTCTGA 


2600 


TCTCATTTTA 


2650 


TATAAGATTT 


2700 


AAAAAACAAT 


2750 


AAGATAAAAG 


2800 


AGTAAATTTA 


2850 


TTATACAATC 


2900 


TTAAGTGTAG 


2950 


TTTAAATACT 


3000 


CATTAGATGA 


3050 


ATAATTGCAA 


3100 


GCTAAGTGAA 


3150 


TAGGTATAAT 


3200 


TCATCTTTGG 


3250 


AGGAATTTCA 


3300 


GAGATAAGGC 


3350 


GAAACTGAAG 


3400 


AGATGATTTA 


3450 
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GTGATATCAG AAATAGATTT TAATAATAAT TCAATAGTTT TAGGTAAATG 3500 

TGAAATCTGG AGAATGGAAG GTGGTTCAGG TCATACTGTA ACTGATGATA 3550 

TAGATCACTT CTTTTCAGCA CCATCAATAA CATATAGAGA GCCACACTTA 3600 

TCTATATATG ACGTATTGGA AGTACAAAAA GAAGAACTTG ATTTGTCAAA 3650 

AGATTTAATG GTATTACCTA ATGCTCCAAA TAGAGTATTT GCTTGGGAAA 3700 

CAGGATGGAC ACCAGGTTTA AGAAGCTTAG AAAATGATGG CACAAAACTG 3750 

TTAGACCGTA TAAGAGATAA CTATGAAGGT GAGTTTTATT GGAGATATTT 3800 

TGCTTTTATA GCTGATGCTT TAATAACAAC ATTAAAACCA AGATATGAAG 3850 

ATACTAATAT AAGAATAAAT TTAGATAGTA ATACTAGAAG TTTTATAGTT 3900 

CCAATAATAA.CTACAGAATA TATAAGAGAA AAATTATCAT ATTCTT.TCTA 3950 

TGGTTCAGGA GGAACTTATG CATTGTCTCT TTCTCAATAT AATATGGGTA 4000 

TAAATATAGA ATTAAGTGAA AGTGATGTTT GGATTATAGA TGTTGATAAT 4050 

GTTGTGAGAG ATGTAACTAT AGAATCTGAT AAAATTAAAA AAGGTGATTT 4100 

AATAGAAGGT ATTTTATCTA CACTAAGTAT TGAAGAGAAT AAAATTATCT 4150 

TAAATAGCCA TGAGATTAAT TTTTCTGGTG AGGTAAATGG AAGTAATGGA 4200 

TTTGTTTCTT TAACATTTTC AATTTTAGAA GGAATAAATG CAATTATAGA 4250 

AGTTGATTTA TTATCTAAAT CATATAAATT ACTTATTTCT GGCGAATTAA 4300 

AAATATTGAT GTTAAATTCA AATCATATTC AACAGAAAAT AGATTATATA 4350 

GGATTCAATA GCGAATTACA GAAAAATATA CCATATAGCT TTGTAGATAG 4400 

TGAAGGAAAA GAGAATGGTT TTATTAATGG TTCAACAAAA GAAGGTTTAT 4450 

TTGTATCTGA ATTACCTGAT GTAGTTCTTA TAAGTAAGGT TTATATGGAT 4500 

GATAGTAAGC CTTCATTTGG ATATTATAGT AATAATTTGA AAGATGTCAA 4550 

AGTTATAACT AAAGATAATG TTAATATATT AACAGGTTAT TATCTTAAGG 4600 

ATGATATAAA AATCTCTCTT TCTTTGACTC TACAAGATGA AAAAACTATA 4650 

AAGTTAAATA GTGTGCATTT AGATGAAAGT GGAGTAGCTG AGATTTTGAA 4700 

GTTCATGAAT AGAAAAGGTA ATACAAATAC TTCAGATTCT TTAATGAGCT 4750 

TTTTAGAAAG TATGAATATA AAAAGTATTT TCGTTAATTT CTTACAATCT 4800 

AATATTAAGT TTATATTAGA TGCTAATTTT ATAATAAGTG GTACTACTTC 4850 

TATTGGCCAA TTTGAGTTTA TTTGTGATGA AAATGATAAT ATACAACCAT 4900 

ATTTCATTAA GTTTAATACA CTAGAAACTA ATTATACTTT ATATGTAGGA 4950 

AATAGACAAA ATATGATAGT GGAACCAAAT TATGATTTAG ATGATTCTGG 5000 

AGATATATCT TCAACTGTTA TCAATTTCTC TCAAAAGTAT CTTTATGGAA 5050 

TAGACAGTTG TGTTAATAAA GTTGTAATTT CACCAAATAT TTATACAGAT 5100 

GAAATAAATA TAACGCCTGT ATATGAAACA AATAATACTT ATCCAGAAGT 5150 

TATTGTATTA GATGCAAATT ATATAAATGA AAAAATAAAT GTTAATATCA 5200 

ATGATCTATC TATACGATAT GTATGGAGTA ATGATGGTAA TGATTTTATT 5250 

CTTATGTCAA CTAGTGAAGA AAATAAGGTG TCACAAGTTA AAATAAGATT 53 00 

CGTTAATGTT TTTAAAGATA AGACTTTGGC AAATAAGCTA TCTTTTAACT 5350 

TTAGTGATAA ACAAGATGTA CCTGTAAGTG AAATAATCTT ATCATTTACA 54 00 

CCTTCATATT ATGAGGATGG ATTGATTGGC TATGATTTGG GTCTAGTTTC 54 50 

TTTATATAAT GAGAAATTTT ATATTAATAA CTTTGGAATG ATGGTATCTG 5500 

GATTAATATA TATTAATGAT TCATTATATT ATTTTAAACC ACCAGTAAAT 5550 

AATTTGATAA CTGGATTTGT GACTGTAGGC GATGATAAAT ACTACTTTAA 5600 

TCCAATTAAT GGTGGAGCTG CTTCAATTGG AGAGACAATA ATTGATGACA 5650 

AAAATTATTA TTTCAACCAA AGTGGAGTGT TACAAACAGG TGTATTTAGT 5700 

ACAGAAGATG GATTTAAATA TTTTGCCCCA GCTAATACAC TTGATGAAAA 5750 

CCTAGAAGGA GAAGCAATTG ATTTTACTGG AAAATTAATT ATTGACGAAA 5800 

ATATTTATTA TTTTGATGAT AATTATAGAG GAGCTGTAGA ATGGAAAGAA 5850 

TTAGATGGTG AAATGCACTA TTTTAGCCCA GAAACAGGTA AAGCTTTTAA 5900 

AGGTCTAAAT CAAATAGGTG ATTATAAATA CTATTTCAAT TCTGATGGAG 5950 

TTATGCAAAA AGGATTTGTT AGTATAAATG ATAATAAACA CTATTTTGAT 6000 

GATTCTGGTG TTATGAAAGT AGGTTACACT GAAATAGATG GCAAGCATTT 6050 

CTACTTTGCT GAAAACGGAG AAATGCAAAT AGGAGTATTT AATACAGAAG 6100 

ATGGATTTAA ATATTTTGCT CATCATAATG AAGATTTAGG AAATGAAGAA 6150 

GGTGAAGAAA TCTCATATTC TGGTATATTA AATTTCAATA ATAAAATTTA 6200 

CTATTTTGAT GATTCATTTA CAGCTGTAGT TGGATGGAAA GATTTAGAGG 6250 

ATGGTTCAAA GTATTATTTT GATGAAGATA CAGCAGAAGC ATATATAGGT 63 00 
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TTGTCATTAA TAAATGATGG TCAATATTAT 
GCAAGTTGGA TTTGTCACTA TAAATGATAA 
CTGGAATTAT AGAATCTGGA GTACAAAACA 
ATAGATGATA ATGGTATAGT TCAAATTGGT 
ATATAAATAT TTTGCACCTG CTAATACTGT 
AAGCAGTTGA ATATAGTGGT TTAGTTAGAG 
TTTGGAGAAA CATATACAAT TGAGACTGGA 
TGAAAGTGAT AAATATTATT TCAATCCAGA 
GTATTAATTT AATTGATGAT ATAAAATATT 
.ATGAGAACGG. GT.CTTATATC _ ATTTC AAAAT . 
GAATGGTGAA ATGCAATTTG GTTATATAAA 
ATTTTGGTGA AGATGGTGTC ATGCAGATTG 
GGATTTAAAT ACTTTGCACA TCAAAATACT 
AGAATCAATA AACTATACTG GTTGGTTAGA 
ATTTTACAGA TGAATATATT GCAGCAACTG 
GAGGAGTATT ATTTTGATCC TGATACAGCT 
G 
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TTTAATGATG ATGGAATTAT 6350 

AGTCTTCTAC TTCTCTGACT 64 00 

TAGATGACAA TTATTTCTAT 64 50 

GTATTTGATA CTTCAGATGG 6500 

AAATGATAAT ATTTACGGAC 6550 

TTGGGGAAGA TGTATATTAT 6600 

TGGATATATG ATATGGAAAA 6650 

AACTAAAAAA GCATGCAAAG 6700 

ATTTTGATGA GAAGGGCATA 6750 

AATAATTATT ACTTTAATGA 6800 

TATAGAAGAT AAGATGTTCT 6850 

GAGTATTTAA TACACCAGAT 6900 

TTGGATGAGA ATTTTGAGGG 6950 

TTTAGATGAA AAGAGATATT 7000 

GTTCAGTTAT TATTGATGGT 7050 

CAATTAGTGA TTAGTGAATA 7100 

7101 



2) INFORMATION FOR SEQ ID NO: 2131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2131 
AAGCMATTGT TGTAAATTTT GAAAG 25 



2) INFORMATION FOR SEQ ID NO: 2132 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2132 
TCATATCCAT AGCAATAGTT CTA 23 



2) INFORMATION FOR SEQ ID NO: 2133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : • Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2133 
CCACGCACAW AAACTTGTTT TAGAAGTAGC AGCWCAGCGT GG 42 



2) INFORMATION FOR SEQ ID NO: 2134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2134 
CGCGTGAAGC TTCTGTG 17 



2) INFORMATION FOR SEQ ID NO: 2135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2135 
TCTCGCCGTT ATTCAGTTTC 20 



2) INFORMATION FOR SEQ ID NO: 2136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2136 
CCAACGCGTC CTCAATCATT TCTAACTTCT ATGGCCGGCG TTGG 44 
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2) INFORMATION FOR SEQ ID NO: 2137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2137 

ATGATCACCG GTGCTGCCCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

CGCCGATGGT CCGATGCCrC AAACCCGTGA GCACATCCTG CTGTCCCGTC 100 

AGGTAGGCGT TCCGTACATC GTGGTCTTCC TGAACAAGGC CGACCTGGTA 150 

GACGACGCTG AGCTGCTGGA ACTGGTCGAG ATGGAAGTTC GCGACCTGCT 200 

GTCCACCTAC GACTTCCCAG GCGACGACAC TCCGATCATC ATCGGTTCCG 250 

CTCGTATGGC CCTGGAAGGC AAAGACGACA ACGAAATGGG CACTACCGCT 300 

GTCAAGAAGC TGGTAGAAAC TCTGGATGCC TACATCCCTG AGCCAGTTCG 350 

TGCCATCGAC CAGCCGTTCC TGATGCCGAT CGAAGACGTG TTCTCGATCT 400 

CGGGTCGTGG TACCGTTGTT ACCGGTCGTA TCGAGCGTGG CATCGTTCGC 4 50 

GTTCAGGATC CGCTGGAAAT CGTTGGTCTG CGTGACACCA CCACCACCAC 500 

CTGCACCGGT GTTGAGATGT TCCGCAAGCT GCTGGACGAA GGTCGTGCTG 550 

GCGAGAACTG CGGCGTTCTG CTGCGTGGTA CCAAGCGTGA CGACGTTGAG 600 

CGTGGCCAGG TTCTGGTCAA GCCAGGTTCG GTCAAGCCGC ACACCAAGTT 650 

CACCGCAGAA GTCTACGTCC TGTCGAAGGA AGAAGGCGGC CGTCACACTC 700 

CGTTCTTCAA AGGCTACCGT CCACAGTTCT ACTTCCGTAC CACTGACGTG 750 

ACCGGTAACT GCGAACTGCC GGAAGGCGTT GAAATGGTAA TGCCAGGTGA 800 

CAACATTCAG ATGACTGTTA CCCTGATCAA GACCATCGCA ATGGAAGACG 850 

GTCTGCGCTT CGCTATCCGT GAAGGCGGTC GTACCGTCG 889 



2) INFORMATION FOR SEQ ID NO: 2138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(C) ACCESSION NUMBER: Genome project 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2138 
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AACATGATCA CCGGTGCTGC CCAGATGGAC 
GGCTGCCGAC GGCCCCATGC CGCAGACCCG 
GCCAGGTAGG CGTTCCCTAC ATCGTCGTGT 
GTCGACGACG CCGAGCTGCT GGAACTGGTC 
GCTGAACACC TACGACTTCC CGGGCGACGA 
CCGCGCTGAT GGCGCTGGAA GGCAAGGATG 
GCCGTGCAGA AGCTGGTAGA GACCCTGGAC 
TCGTGCCATC GACCAGCCGT TCCTGATGCC 
TCTCCGGCCG CGGTACCGTG GTAACCGGTC 
AAGGTCCAGG AAGAAGTGGA AATCGTCGGC 
TACCTGCACC GGCGTTGAAA TGTTCCGCAA 
CTGGTGAGAA CGTTGGTATC CTGCTGCGTG 
GAGCGTGGCC AGGTTCTGGC CAAGCCGGGC 
GTTCGAGTGC GAAGTGTACG TGCTGTCCAA 
CCCCGTTCTT CAAGGGCTAC CGTCCGCAGT 
GTGACCGGTA ACTGCGAACT GCCGGAAGGC 
CGACAACATC AAGATGGTTG. TCACCCTGAT 
ATGGCCTGCG CTTCGCGATC CGCGAAGGCG 
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GGCGCGATCC 


TGGTTTGCTC 


50 


CGAGCACATC 


CTGCTGTCCC 


100 


TCCTGAACAA 


GGCCGACATG 


150 


GAGATGGAAG 


TTCGCGATCT 


200 


CACTCCGATC 


ATCATCGGTT 


250 


ACAACGGCAT 


CGGCGTAAGC 


300 


TCCTACATTC 


CGGAGCCGGT 


350 


GATCGAAGAC 


GTGTTCTCGA 


400 


GTGTAGAGCG 


CGGCATCATC 


450 


ATCAAGGCGA 


_CCACCAAGAC 


500 


GCTGCTCGAC 


GAAGGTCGTG 


550 


GCACCAAGCG 


TGAAGACGTA 


600 


ACCATCAAGC 


CGCACACCAA 


650 


GGAAGAAGGT 


GGTCGTCACA 


700 


TCTACTTCCG 


TACCACCGAC 


750 


GTAGAGATGG 


TAATGCCGGG 


800 


CGCTCCGATC 


GCCATGGAAG 


850 


GCCGTACCGT 


TGGCGCCGGC 


900 



2) INFORMATION FOR SEQ ID NO: 2139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni 

(B) STRAIN: NCTC 11168 

(C) ACCESSION NUMBER: AL139074 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2139 

ATGCAAGGAT TTATTTCACA GGTATTAGGT CCGGTTGTTG ATGTAGATTT 50 

TAACGACTAT TTGCCTCAAA TTAATGAAGC AATTGTTGTA AATTTTGAAA 100 

GCGAAGGAAA AAAACATAAA CTTGTTTTAG AAGTAGCAGC TCATTTAGGA 150 

GATAATAGAG TTAGAACTAT TGCTATGGAT ATGACAGATG GTTTGGTAAG 200 

GGGCTTAAAA GCTGAGGCTT TAGGTGCTCC CATTAGTGTT CCTGTTGGTG 250 

AGAAAGTTTT AGGAAGAATT TTCAATGTTA CTGGAGATTT GATCGATGAA 300 

GGTGAAGAAA TTTCTTTCGA TAAAAAATGG GCAATTCATA GAGATCCACC 350 

AGCTTTTGAA GATCAAAGCA CAAAAAGTGA GATTTTTGAA ACAGGGATTA 4 00 

AAGTTGTAGA TTTACTTGCT CCTTATGCAA AAGGTGGTAA AGTAGGTCTT 4 50 

TTTGGTGGTG CAGGTGTTGG TAAAACTGTT ATTATTATGG AGCTTATTCA 500 

CAATGTTGCA TTTAAGCATA GCGGCTATTC TGTATTTGCA GGTGTGGGTG 550 

AGAGAACTCG TGAAGGAAAT GACCTTTATA ATGAAATGAA AGAAAGTAAT 600 

GTTTTAGACA AAGTTGCTCT ATGTTATGGA CAAATGAATG AACCACCAGG 650 

GGCAAGAAAT CGTATTGCTT TAACAGGTTT AACAATGGCT GAGTATTTTA 700 

GAGATGAAAT GGGTCTTGAT GTGCTTATGT TTATTGATAA TATCTTTAGA 750 

TTTTCACAAT CAGGTTCTGA AATGTCAGCA CTTTTAGGAA GAATTCCATC 8 00 

AGCTGTGGGT TATCAACCAA CCTTAGCAAG TGAAATGGGT AAATTCCAAG 850 

AAAGAATTAC TTCAACTAAA AAAGGCTCAA TCACTTCAGT TCAAGCTGTT 900 

TACGTTCCAG CTGATGACTT AACAGACCCA GCTCCAGCAA CTGTTTTTGC 950 
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TCACTTAGAT GCTACAACGG 
TTTATCCTGC AGTTGACCCA 
AATATCATTG GAGAAGAACA 
GCTTCAAAAA TACAAAGATT 
ATGAGCTTAG CGAAGAGGAT 
GAAAAATTCT TATCACAACC 
CCCAGGAAAA TATATAAGCC 
TTTTAGAAGG TAAATATGAT 
GGAAATATAG ATGAAGCTAT 



TTTTAAATAG AGCTATTGCT 
CTTGACTCAA CTTCAAGAAT 
TTATAAAGTT GCTCGTGGTG 
TACAAGATAT CATCGCCATT 
AAACTTGTAG TTGAAAGAGC 
ATTTTTCGTT GCAGAAGTTT 
TTGAAGATAC AATAGCAGGA 
CATTTACCAG AAAATGCTTT 
TGCAAAAGCG GATAAATTAA 



GAAAAGGGTA 


1000 


GCTTGACCCA 


1050 


TTCAATCAGT 


1100 


TTAGGTATGG 


1150 


AAGAAAGATT 


1200 


TCACAGGTAG 


1250 


TTTAAGGGAA 


1300 


CTATATGGTT 


1350 


AAGGTTAA 


1398 



2) INFORMATION FOR SEQ ID NO: 2140 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 2157 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 670 

(C) ACCESSION NUMBER: X67867 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2140 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 20 0 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 1200 

TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 1250 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 13 00 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 14 00 
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TAATACAACA GAATCTAATA 
CTGCTGCTTA TGCTGCCTTT 
TATATCAATA AGGTCGTCTT 
TGTAGGTACA CGAGCTATGA 
TGATGAAAAC TGTCTTGGCA 
TGGTTAGCGC AAGCTGGTAA 
AATTGAAAAA CACATCAAGA 
TTGTTGGTTA TACTCGTAAG 
AATCGTTTAA CTCCTATCGT 
TTATCGCTCA ATGATAACGT 
GGACGATGCC AGACGGACTT 
GGAGCTCGTT CTACGTGGAG 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCA 



AACAATACGG AGCAAGTAGT 
GCCAACGGTG GAATTTACCA 
CAGTGACGGT AGTAAAAAAG 
AAGAAACAAC TGCTTACATG 
TACGGAACTG GTCGTGGAGC 
GACAGGTACT TCTAACTACA 
ACACTGGCTA TGTAGCTCCA 
TATTCTATGG CTGTATGGAC 
TGGAGATGGT TTCCTAGTTG 
ATC_TATCAGA L AGATACTCAT 
TTCAGAAACG GGGAATTTGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCGATCCTAA 
TGATCAACAA AATCAGAATC 



GAAAAAATGG 


T4 50 


CAAACCCATG 


1500 


AATTTTCAGA 


1550 


ATGACCGAAA 


1600 


CTATCTCCCA 


1650 


CAGATGATGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTCAAAAAT 


1950 


CCCCATCAAC 


2000 


TCTAGCTCAA 


2050 


CAATAATACG 


2100 


CTCAACCAGC 


2150 




2157 



2) INFORMATION FOR SEQ ID NO: 2141 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2141 
CACCAGTCAC AGAAAAGC 18 



2) INFORMATION FOR SEQ ID NO: 2142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2142 
GCCTTAATTT CGGATAGTGC 20 



2) INFORMATION FOR SEQ ID NO: 2143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2143 
GAGAAAGAGC CCAATTATCT AATGT 25 



2) INFORMATION FOR SEQ ID NO: 2144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3075 bases , 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: J2870 

(C) ACCESSION NUMBER: X75439 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2144 

TTGACAAAGA AATATTTAAA CACCCAGAAT GAAATATCAG CATTTTGGAA 
TACTCAAAAG ATATTTAAAA AATCAATTGA CAATAGAAAA GGACAGGAAA 
GTTTTGTTTT TTATGACGGC CCCCCAACTG CAAATGGCCT TCCTCATGCT 
GGCCATGTTC TTGGAAGAGT AATCAAGGAT TTAGTTGCAA GATTAAAAAC 
TATGCAAGGT TTTTATGTAG AAAGAAAAGC AGGATGGGAT ACCCATGGCT 
TACCAGTTGA ATTAGAGGTT GAAAAAAAAA TTGGAATTAA AGGAAAACAA 
GACATTGAAA AGTATGGAAT AGAAAATTTT ATAAATGAAT GTAAAAAAAG 
TGTATTTAAT TATGAAAAAG AATGGCGGGA TTTTTCTAAA GATTTAGGAT 
ACTGGGTTGA CATGGACTCC CCCTATATAA CTCTTGAGAA TAATTATATT 
GAAAGTGTAT GGAATATATT ATCTACATTC CATAAAAAAG GACTATTATA 
TAAGGGACAT AAGGTGACTC CTTATTGTAC ACATGATCAA ACCGCTTTAA 
GTTCTCATGA AGTAGCGCAA GGCTATAAAA ACGTTAAAGA TTTATCAGCT 
GTTGTTAAAT TTCAACTTAC AAATAGTAAA GATACTTATT TCTTAAGTTG 
GACTACCACT CCCTGGACTT TGCCTGCAAA TGTAGCATTA GCTATAAATA 
AAGATCTTAA TTATTCAAAA ATTCGGGTAG AAAATGAGTA TTATATCTTA 
GCTACAGATC TAATTAATTC TATAATAACT GAAAAATACG AAATTATTGA 
TACCTTTTCA GGAAGTAATT TAATTAATTT AAAATACATT CCTCCTTTTG 
AAAGCGACGG TTTAGTTAAT GCATATTACG TTGTTGATGG AGAATTTGTT 
ACTAACTCAG AAGGAACTGG TATTGTTCAT ATAGCACCAG CTCATGGGGA 
AGATGACTAC CAATTGGTTT TAGAGCGTGA TTTGGATTTC TTAAATGTTA 
TAACAAGAGA AGGAGTATAT AATGATAGGT TCCCTGAATT AGTTGGTAAT 
AAAGCTAAAA ATAGTGATAT AGAAATCATA AAATTATTAT CCAAAAAACA 
ACTTTTATAT AAAAAACAAA AATATGAGCA TAATTATCCT CATTGTTGGA 
GATGTGGTAA TCCTTTGATA TATTATGCGA TGGAAGGTTG GTTTATTAAA 
ACAACTAATT TTAAGAATGA AATTATTAAC AATAATAATA ATATAGAGTG 
GTTTCCTTCT CATATTAAGG AAGGGAGAAT GGGAAATTTC TTAGAAAATA 
TGGTTGATTG GAACATTGGT AGAAATAGAT ATTGGGGAAC ACCATTAAAT 
GTATGGATTT GCAATGATTG TAATCACGAA TACGCACCAA GTAGTATTAA 
GGATTTACAA AATAATTCCA TCAATAAAAT TGATGAAGAT ATTGAGTTGC 
ATAGACCTTA TGTTGATAAT ATCACTCTTA GTTGCCCTAA GTGTAATGGG 
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AAAATGTCTC GAGTAGAAGA 
TATGCCGTTT GCTCAGCATC 
ATCAACACTT TCCAGCTGAT 
GGCTGGTTTT ACAGTTTACT 
TTCTTATAAA CGTGCTTTAT 
AAAAAATGTC TAAAAGTAAA 
AATAAGTACG GAGCCGATTC 
TCCATGGAAT AACAAAAGAT 
CGAAATTTAT AGATACGCTT 
AATAAAATAG ATCACTATAA 
TACATTAGAT AATTGGGCTC 
GTAATATTTA TGTAAATAAT 
AACGAATATA CCAATACAAT 
ACGATTTTGG GAACAAGGAA 
CGCTTTATGA AATTTTAACA 
CCATTTATAT CTGAAAAAAT 
TTTACAAGAT TATCCACAAT 
AAGATGAAAT GCATACCGTT 
CGCAAAAATG CAGATTTAAA 
TAAACCTAAT AGTCAATTAA 
TAATAAAAGA CGAATTAAAT 
AATGACTATA TTACCTATGA 
AAAACTAGGG AACAAAACGA 
CAGAGTATGA TAAAAAAAGT 
TCTTCTGATG CTGAGTTAAC 
TAAGGATAGT TATCAACTCA 
ATAAAAATTT ATCTCCTGAA 
ATTAGATTAA TTCAACAATT 
ACGTATTGAT ATTTATATCG 
AAACCAATAA AAATATGTTT 
TTAAATGTTA TAGATGAATA 
AATAAAAATT TCCTTATTAT 



AGTAATCGAT GTTTGGTTTG 
ATTATCCTTT TGATAACCAG 
TTTATTGCAG AAGGAGTTGA 
AGTAATTTCT ACTATTCTAA 
CTTTAGGACA TATTCTAGAC 
GGAAACGTTA TTAATCCAAC 
TTTAAGATGG GCCTTAATTT 
TCTCAGAAAA TATAGTAGCT 
GATAATATTT ATAAATTTTA 
TCCTAATAAT GAAATTACAA 
TTTCTCGCTT AAACACCTTA 
TACGATTTCA CTTCCGCAGC 
AAGTAATTGG TATATCGGAG 
TTTCTAACGA TAAAAAAGAT 
ACTTTATCAA GACTAGTGGC 
CCATTATAAT TTGACTGGAA 
ATAAAGAAAG TTTTATTAAT 
ATAAAAATTG TAGAATTATC 
AATTAAGCAA CCTTTATCGA 
ACTTAAGTTT TTTACCTAAT 
ATAAAAAACA TTGAATTAAC 
GCTTAAATTG AATTTTTCTT 
AAAATATTCA AACATTGATA 
TTAATTGAGT CTAATAACTT 
TAAGGATGAT TTTATAATTA 
GTGAAGATAA TGACTGCGTT 
TTAATTCGCG AAGGACATGC 
AAGAAAAAAG AAAAATTTAC 
GTGTAACTGG GGAATTATTA 
AAAGAAAATT TCGTGATTAA 
TGAAAATACT ATTCATTTTA 
ATTAA 



ATAGCGGCTC 


1550 


AAAATTTTTA 


1600 


TCAAACGAGA 


1650 


AAGGAAAATC 


1700 


AGTAATGGTA 


1750 


TGAATTAATT 


1800 


CGGATAGTGC 


1850 


CAGACCAAAT 


1900 


TAATATGTAT 


1950 


AAAGTAGAAA 


2000 


ATAAAAGAAA 


2050 


CAGATTAATT 


2100 


ATTCGAGAGG 


2150 


GCGTACAATA 


2200 


TCCATTTGTT 


2250 


AAAGTGTGCA 


2300 


CAAGCATTGG 


2350 


TAGACAGGCT 


2400 


AAATGGTGAT 


2450 


TACTATTCAA 


2500 


TGATAATATT 


2550 


CTGTGGGACC 


2600 


GACTCCCTAT 


2650 


CAAAAGTTTA 


2700 


AAACCTTACC 


2750 


ATATTATTAG 


2800 


TAGAGAGCTC 


2850 


CAATAAATCA 


2900 


GAATCAATAA 


2950 


AAATATACAC 


3000 


ATAATAAAGA 


3050 




3075 



2) INFORMATION FOR SEQ ID NO: 2145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2145 
GCAAGATGTG GCGTGTTACG GT 22 



2) INFORMATION FOR SEQ ID NO: 2146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2146 
GGGGCGAAGA AGTTGTCCAT ATT 23 



2) INFORMATION FOR SEQ ID NO: 2147 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00622 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2147 

ATGGAGAAAA AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA 50 

TCGTAAAGAA CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTACCTATA 100 

ACCAGACCGT TCAGCTGGAT ATTACGGCCT TTTTAAAGAC CGTAAAGAAA 150 

AATAAGCACA AGTTTTATCC GGCCTTTATT CACATTCTTG CCCGCCTGAT 200 

GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT GAGCTGGTGA 250 

TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA 300 

ACGTTTTCAT CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT 350 

ACACATATAT TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGCCTATT 4 00 

TCCCTAAAGG GTTTATTGAG AATATGTTTT TCGTCTCAGC CAATCCCTGG 4 50 

GTGAGTTTCA CCAGTTTTGA TTTAAACGTG GCCAATATGG ACAACTTCTT 500 

CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC GACAAGGTGC 5 50 

TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 6 00 

GTCGGCAGAA TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG 650 

CGGGGCGTAA 660 



2) INFORMATION FOR SEQ ID NO: 2148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2148 
CAGATTAAAT GCGGATTCAG CC 22 
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2) INFORMATION FOR SEQ ID NO: 2149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
(D). _ TOPOLOGY j. Linear . 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2149 
ATCAGGTAAA TCATCAGCGG ATA 



2) INFORMATION FOR SEQ ID NO: 2150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

(C) ACCESSION NUMBER: X537 96 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2150 

ATGAATTTTA CCCGGATTGA CCTGAATACC TGGAATCGCA GGGAACACTT 50 

TGCCCTTTAT CGTCAGCAGA TTAAATGCGG ATTCAGCCTG ACCACCAAAC 100 

TCGATATTAC CGCTTTGCGT ACCGCACTGG CGGAGACAGG TTATAAGTTT 150 

TATCCGCTGA TGATTTACCT GATCTCCCGG GCTGTTAATC AGTTTCCGGA 200 

GTTCCGGATG GCACTGAAAG ACAATGAACT TATTTACTGG GACCAGTCAG 250 

ACCCGGTCTT TACTGTCTTT CATAAAGAAA CCGAAACATT CTCTGCACTG 300 

TCCTGCCGTT ATTTTCCGGA TCTCAGTGAG TTTATGGCAG GTTATAATGC 350 

GGTAACGGCA GAATATCAGC ATGATACCAG ATTGTTTCCG CAGGGAAATT 400 

TACCGGAGAA TCACCTGAAT ATATCATCAT TACCGTGGGT GAGTTTTGAC 450 

GGATTTAACC TGAACATCAC CGGAAATGAT GATTATTTTG CCCCGGTTTT 500 

TACGATGGCA AAGTTTCAGC AGGAAGGTGA CCGCGTATTA TTACCTGTTT 550 

CTGTACAGGT TCATCATGCA GTATGTGATG GCTTTCATGC AGCACGGTTT 600 

ATTAATACAC TTCAGCTGAT GTGTGATAAC ATACTGAAAT AA 642 



2) INFORMATION FOR SEQ ID NO: 2151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2151 
ATATTTCAGC ATTACCTTGG GTT 23 



2) INFORMATION FOR SEQ ID NO: 2152 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2152 
TACACAACTC TTGTAGCCGA TTA 23 



2) INFORMATION FOR SEQ ID NO: 2153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexineri 
(C) ACCESSION NUMBER: X07848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2153 

ATGAACTATA CAAAATTTGA TGTAAAAAAT TGGGTTCGCC GTGAGCATTT 50 

TGAGTTTTAT CGGCATCGTT TACCATGTGG TTTTAGCTTA ACAAGCAAAA 100 

TTGATATCAC GACGTTAAAA AAGTCATTGG ATGATTCAGC GTATAAGTTT 150 

TATCCGGTAA TGATCTATCT GATTGCTCAG GCCGTGAATC AATTTGATGA 200 

GTTGAGAATG GCGATAAAAG ATGATGAATT GATCGTATGG GATTCAGTCG 250 

ACCCACAATT CACCGTATTC CATCAAGAAA CAGAGACATT TTCAGCACTG 300 

AGTTGCCCAT ACTCATCCGA TATTGATCAA TTTATGGTGA ATTATTTATC 350 

GGTAATGGAA. CGTTATAAAA GTGATACCAA GTTATTTCCT CAAGGGGTAA 4 00 

CACCAGAAAA TCATTTAAAT ATTTCAGCAT TACCTTGGGT TAATTTTGAT 450 

AGCTTTAATT TAAATGTTGC TAATTTTACC GATTATTTTG CACCCATTAT 500 

AACAATGGCA AAATATCAGC AAGAAGGGGA TAGACTGTTA TTGCCGCTCT 550 

CAGTACAGGT TCATCATGCA GTTTGTGATG GCTTCCATGT TGCACGCTTT 600 

ATTAATCGGC TACAAGAGTT GTGTAACAGT AAATTAAAAT AA 642 
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2) INFORMATION FOR SEQ ID NO: 2154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 2154 
CGCCATTCAG AGTTTAGGAC 20 

2) INFORMATION FOR SEQ ID NO: 2155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2155 
TTCCATACCG TTGCGTATCA CTT 23 



2) INFORMATION FOR SEQ ID NO: 2156 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: CP590 

(C) ACCESSION NUMBER: M74769 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2156 

ATGGTATTTG AAAAAATTGA TAAAAATAGT TGGAACAGAA AAGAGTATTT 50 

TGACCACTAC TTTGCAAGTG TACCTTGTAC ATACAGCATG ACCGTTAAAG 100 

TGGATATCAC ACAAATAAAG GAAAAGGGAA TGAAACTATA TCCTGCAATG 150 

CTTTATTATA TTGCAATGAT TGTAAACCGC CATTCAGAGT TTAGGACGGC 200 

AATCAATCAA GATGGTGAAT TGGGGATATA TGATGAGATG ATACCAAGCT 250 
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TGTAAGTCTG ACTTTAAATC ATTTTTAGCA 
ACGGTATGGA AACAATCATA GAATGGAAGG 
ACATTTTTAA TGTATCTATG ATACCGTGGT 
CTGAATTTGC AGAAAGGATA TGATTATTTG 
GAAATATTAT AAAGAAGATA ACAAAATTAT 
TTCATCACGC AGTATGTGAC GGATTTCACA 
TTGCAGGAAT TGATAAATAG TTAA 
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TTTCCAGCCT TTGGACTGAG 300 

GATTATGAAA GTGATACGCA 350 

AAAGCCAAAT GCTCCGGAAA 400 

CAACCTTCGA TGGCTTTAAT 450 

ATTCCTATTT TTACTATGGG 500 

ACTTCCTTTG GCAATTCAAG 550 

TTTGCCGTTT TGTAAACGAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 2157 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2157 
CCACAGAAAT TGATATTAGT GTTTTAT 27 



2) INFORMATION FOR SEQ ID NO: 2158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2158 
TCGCTATTGT AACCAGTTCT A 21 



2) INFORMATION FOR SEQ ID NO: 2159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2159 

ATGAACTTTA ATAAAATTGA TTTAGACAAT TGGAAGAGAA AAGAGATATT 50 

TAATCATTAT TTGAACCAAC AAACGACTTT TAGTATAACC ACAGAAATTG 100 

ATATTAGTGT TTTATACCGA AACATAAAAC AAGAAGGATA TAAATTTTAC 150 

CCTGCATTTA TTTTCTTAGT GACAAGGGTG ATAAACTCAA ATACAGCTTT 200 

TAGAACTGGT TACAATAGCG ACGGAGAGTT AGGTTATTGG GATAAGTTAG 250 

AGCCACTTTA TACAATTTTT GATGGTGTAT CTAAAACATT CTCTGGTATT 300 

TGGACTCCTG TAAAGAATGA CTTCAAAGAG TTTTATGATT TATACCTTTC 350 

TGATGXAGAG_A^TATAATG GTTCGGGGAA ATTGTTTCCC AAAACACCTA 400 
TACCTGAAAA TGCTTTTTCT CTTTCTATTA TTCCATGGAC TTCATTTACT " 450 

GGGTTTAACT TAAATATCAA TAATAATAGT AATTACCTTC TACCCATTAT 500 

TACAGCAGGA AAATTCATTA ATAAAGGTAA TTCAATATAT TTACCGCTAT 550 

CTTTACAGGT ACATCATTCT GTTTGTGATG GTTATCATGC AGGATTGTTT 60 0 

ATGAACTCTA TTCAGGAATT GTCAGATAGG CCTAATGACT GGCTTTTATA 650 

A 651 



2) INFORMATION FOR SEQ ID NO: 2160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2160 
TTTTGAACAC TATTTTAACC AGC 23 



2) INFORMATION FOR SEQ ID NO: 2161 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2161 
GATTTAACTT ATCCCAATAA CCT 23 



2) INFORMATION FOR SEQ ID NO: 2162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: X02166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2162 



ATGACTTTTA ATATTATCAA ATTAGAAAAT TGGGATAGAA AAGAATATTT 
TGAACACTAT TTTAACCAGC AAACTACGTA TAGCATTACT AAAGAAATTG 
ATATTACTTT GTTTAAAGAT ATGATAAAAA AGAAAGGATA TGAAATTTAT 
CCTTCTTTGA TTTATGCAAT TATGGAAGTT . GTAAATAAAA ATAAAGTGTT 
TAGAACAGGA ATTAATAGTG AGAATAAATT AGGTTATTGG GATAAGTTAA 
ATCCTTTGTA TACAGTTTTT AATAAGCAAA CTGAAAAATT TACTAACATT 
TGGACTGAAT CTGATAACAA CTTCACTTCT TTTTATAATA ATTATAAAAA 
TGACTTGCTT GAATATAAAG ATAAAGAAGA AATGTTTCCT AAAAAACCGA 
TACCTGAAAA CACCATACCG ATTTCAATGA TTCCTTGGAT TGATTTTAGT 
TCATTTAATT TAAACATTGG TAACAATAGC AACTTTTTAT TGCCTATTAT 
TACGATAGGT AAATTTTATA GTGAGAATAA TAAAATTTAT ATACCAGTTG 
CTTTGCAGCT TCATCATGCT GTATGTGATG GTTACCATGC TTCATTATTT 
ATGAATGAAT TTCAAGATAT AATTCATAAG GTAGATGATT GGATTTAG 

2) INFORMATION FOR SEQ ID NO: 2163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2163 
ACCTTCATCC TACCGATGTG GGTT 24 
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2) INFORMATION FOR SEQ ID NO: 2164 . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2164 
CAACGACACC AGCACTGCCA TTG 23 



2) INFORMATION FOR SEQ ID NO: 2165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

. _(vi) .ORIGINAL SOURCE: _ _ _ _ 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: H3380 

(C) ACCESSION NUMBER: AF071555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2165 

ATGACCACCA CACGCCCCGC GTGGGCCTAT ACGCTGCCGG CAGCACTGCT 50 

GCTGATGGCT CCTTTCGACA TCCTCGCTTC ACTGGCGATG GATATTTATC 100 

TCCCTGTCGT TCCAGCGATG CCCGGCATCC TGAACACGAC GCCCGCTATG 150 

ATCCAACTCA CGTTGAGCCT CTATATGGTG ATGCTCGGCG TGGGCCAAGT 200 

GATTTTTGGT CCGCTCTCAG ACAGAATCGG GCGACGGCCA ATTCTACTTG 250 

CGGGCGCAAC GGCTTTCGTC ATTGCGTCTC TGGGAGCAGC TTGGTCTTCA 300 

ACTGCACCGG CCTTTGTCGC TTTCCGTCTA CTTCAAGCAG TGGGCGCGTC 350 

GGCCATGCTG GTGGCGACGT TCGCGACGGT TCGCGACGTT TATGCCAACC 4 00 

GTCCTGAGGG TGTCGTCATC TACGGCCTTT TCAGTTCGAT GCTGGCGTTC 450 

GTGCCTGCGC TCGGCCCTAT CGCCGGAGCA TTGATCGGCG AGTTCTTGGG 500 

ATGGCAGGCG ATATTCATTA CTTTGGCTAT ACTGGCGATG CTCGCACTCC 550 

TAAATGCGGG TTTCAGGTGG CACGAAACCC GCCCTCTGGA TCAAGTCAAG 600 

ACGCGCCGAT CTGTCTTGCC GATCTTCGCG AGTCCGGCTT TTTGGGTTTA 650 

CACTGTCGGC TTTAGCGCCG GTATGGGCAC CTTCTTCGTC TTCTTCTCGA 700 

CGGCTCCCCG TGTGCTCATA GGCCAAGCGG AATATTCCGA GATCGGATTC 750 

AGCTTTGCCT TCGCCACTGT CGCGCTTGTA ATGATCGTGA CAACCCGTTT 800 

CGCGAAGTCC TTTGTCGTCA GATGGGGCAT CGCAGGATGC GTGGCGCGTG 850 

GGATGGCGTT GCTTGTTTGC GGAGCGGTCC TGTTGGGGAT CGGCGAACTT 900 

TACGGCTCGC CGTCATTCCT CACCTTCATC CTACCGATGT GGGTTGTCGC 950 

GGTCGGTATT GTCTTCACGG TGTCCGTTAC CGCGAACGGC GCTTTGGCAG 1000 

AGTTCGACGA CATCGCGGGA TCAGCGGTCG CGTTCTACTT CTGCATCCAA 1050 

AGCCTGATAG TCAGTATCGT CGGGACATTG GCGGTGACGC TGTTAAACGG 1100 

CGATACAGCG TGGCCCGTGA TTTGTTACGC CACGGCAATG GCAGTGCTGG 1150 

TGTCGTTGGG GCTGGCGCTC CTTCGATCCC GTGATGCTGC CACCGAGAAG 12 00 

TCGCCAGTCG TCTAG 1215 



2) INFORMATION FOR SEQ ID NO: 2166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2166 
GACAAACCAT TCCTGCTG 
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2) INFORMATION FOR SEQ ID NO: 2167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
(D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2167 
CAGCAGCTGG GCGGCGGT 



2) INFORMATION FOR SEQ ID NO: 2168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2168 
CATCAAAGTT GGTGAAGAAG TTG 



2) INFORMATION FOR SEQ ID NO: 2169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2169 
CCCGTTTGCG AAAGGTGG 



2) INFORMATION FOR SEQ ID NO: 2170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2170 
ACGTGACGTT GACAAACCA 19 

2) INFORMATION FOR SEQ ID NO: 2171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2171 
TCGTTGGATT AACTGAAGAA 20 
2) INFORMATION FOR SEQ ID NO: 2172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2172 
GTGTTGAAAT GTTCCGTAAA 20 

2) INFORMATION FOR SEQ ID NO: 2173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2173 
AAGAAAAAAT CTTCGAACTG GCTA 24 



2) INFORMATION FOR SEQ ID NO: 2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
_ ( x i J IS EQUENCE DES CR IPTION: S EQ_ I _D NO : 2 1 1 4 
TCTACACGGC CGGTG 



2) INFORMATION FOR SEQ ID NO: 2175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2175 
CCGCCATACC CCGTTT 



2) INFORMATION FOR SEQ ID NO: 2176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2176 
CGGCATTACC ATTTCCACAC CTTT 



2) INFORMATION FOR SEQ ID NO: 2177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2177 

GGCACGGACA AACCATTCCT GCTGCCTATC GAAGACGTGT TCCCGTGCC 49 



2) INFORMATION FOR SEQ ID NO: 2178 

(i) SEQUENCE CHARACTERISTICS: 
. _ . (A) LENGTH:, 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 2178 
GGCACGACAA ACCATTCCTG CTGCCTATCG AACGTGCC 38 



2) INFORMATION FOR SEQ ID NO: 2179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2179 
GGCAGCTCTA CTTCCGTACC ACTGACGTAA CCGGCTGCC 39 



2) INFORMATION FOR SEQ ID NO: 2180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2180 
TTCGCCGGCG TGGGC 15 



2) INFORMATION FOR SEQ ID NO: 2181 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2181 
-AGCGCCACGC GCAGG . . . ...... 15 



2) INFORMATION FOR SEQ ID NO: 2182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2182 
GCGCGCCAAC GACTTCTACC ACGAAATGGA AGAGTCGCGC GC 42 
2) INFORMATION FOR SEQ ID NO: 2183 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2183 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 
ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 
AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 
GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 
CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 
CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 
TCCTACGCCT GAGCGTGCCG TTGACGGTAC GTTCCTGATG CCTGTTGAAG 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG TTGTGACCGG TCGTATTGAG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC GAAATCGTGG GTATCAAAGA 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 
ACCAGGGCGA AGCTGGCGAT AACGTCGGTC TGCTGCTGCG TGGTACCAAG 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG GCCAAGCCAG GCTCGATCAA 
GCCACACACT GACTTCGACG CCGAGGTGTA CATTCTGTCC AAAGAAGAAG 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT ACCGTCCTCA GTTCTACTTC 
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CGTACAACTG ACGTGACCGG CACCATCGAG CTGCCAGAAG ACAAGGAAAT 750 
GGTTCTGCCA GGCGACAAGA TTTCGATGAA AGTGTCCCTG ATCGCTCCTA 800 
TCGCCATGGA AGAAGGT 817 



2) INFORMATION FOR SEQ ID NO: 2184 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH; _1652_ bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2184 

TACAATTGAA GTTGAAAGAT CTATGCGTGT TCTTGATGGT GCTGTTGCGG 50 

TATTTTGTTC AGTGGGTGGG GTGCAGCCTC AAAGTGAAAC AGTTTGGAGA 100 

CAAGCAAATA AATATGGTGT TCCAAGAATA GTATTTGTAA ATAAAATGGA 150 

TAGAATCGGT GCAAATTTCT ACAATGTAGA AGATCAAATT CGCAACCGTT 200 

TAAAAGCTAA TCCAGTTCCA CTTCAAATTC CAATCGGTGC TGAGGATAAT 250 

TTTAAAGGCG TAATCGATCT TGTAACTATG AAAGCTTTAG TTTGGGAAGA 300 

TGATACTAAG CCAACGGATT ATGTAGAAAA AGAAATTCCA GCTGAACTTA 350 

AAGAAAAGGC AGAAGAATAT CGCACAAAAA TGATAGAAGC AGTTTCTGAA 400 

ACTTCAGATG AGTTGATGGA AAAATATTTA GGTGGAGAAG AATTAAGCCT 450 

TGAAGAGATT AAAACAGGGA TTAAAGCAGG ATGTTTAAGT CTTTCTATCG 500 

TTCCTATGCT TTGCGGTACA GCGTTTAAAA ATAAAGGGGT TCAACCTTTG 550 

CTTGATGCTG TTGTGGCTTA TTTACCAGCT CCTGATGAAG TTGCTAATAT 600 

CAAAGGGGAA TATGAAGACG GCACAGAAGT TTCTGTAAAA TCAACTGATG 650 

ATGGCGAATT TGCAGGACTT GCATTTAAAA TTATGACAGA TCCATTTGTA 700 

GGACAACTTA CTTTCGTGCG TGTTTATCGT GGATGTTTAG AAAGCGGTTC 750 

TTATGCTTAT AACTCAACCA AAGATAAAAA AGAAAGAATT GGTCGTTTGT 800 

TAAAAATGCA CTCTAATAAA AGAGAAGAAA TTAAAGTTCT TTACGCAGGA 850 

GAAATCGGTG CAGTTGTAGG ACTTAAAGAT ACTTTAACAG GGGATACTCT 90 0 

TGCAAGTGAA AAAGATAAAG TAATTCTTGA AAGAATGGAT TTCCCAGACC 950 

CAGTTATTTC TGTTGCAGTA GAMCCAAAAA CTAAAGCAGA TCAAGAAAAA 1000 
ATGTCTATTG CACTAAATAA ATTAGCTCAA GAAGATCCAA GCTTTAGAGT . 1050 

TTCTACGGAT GAAGAAAGTG GTCAAACTAT CATTTCAGGT ATGGGTGAAT 1100 

TACACCTTGA AATTATTGTT GATCGTATGC TTCGTGAATT TAAAGTGGAA 1150 

GCTGAAGTGG GACAACCTCA AGTTGCTTAT CGTGAAACTA TCAGAAAAAC 1200 

TGTTGAGCAA GAATACAAAT ACGCTAAGCA ATCAGGCGGT CGTGGTCAGT 1250 

ATGGACATGT ATTCTTACGC CTTGAGCCAC TTGAGCCAGG AAGTGGATAC 13 00 

GAGTTTGTTA ACGACATCAA AGGTGGGGTA ATTCCAAAAG AATATATTCC 1350 

TGCAGTAGAT AAGGGTGTTC AAGAAGCATT GCAAAATGGT GTTTTAGCAG 14 00 

GTTATCCAGT AGAAGATGTT AAAGTAACTG TTTATGATGG AAGTTATCAC 1450 

GAGGTGGATT CATCTGAGAT GGCATTTAAA CTTGCTGCTT CTATGGGATT 1500 

TAAAGAGGGT GCTAGAAAAG CAGGTGCTGT GATCTTAGAG CCTATGATGA 1550 

AAGTTGAAGT AGAAACTCCT GAAGATTACA TGGGCGATGT TATTGGCGAT 1600 

CTTAATAAGC GTCGTGGTCA AGTAAATAGC ATGGATGAAA GAGGTGGTAA 1650 

TA 1652 
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2) INFORMATION FOR SEQ ID NO: 2185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
'(D) _ TOPOLOGY :_ Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2185 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 350 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 

ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC 800 

TATGGCTGAA GGCGAGAGAT 820 



2) INFORMATION FOR SEQ ID NO: 2186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2186 
ACGCGCTCAA AGCAGAAGTA TACGTATTAT CAAAAGACGC GCGT 



2) INFORMATION FOR SEQ ID NO: 2187 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM^ Campylobacter jejuni subsp. jejuni 

(B) STRAIN:" ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2187 

CAGTGGGTGG GGTGCAGCCT CAAAGTGAAA CAGTTTGGAG ACAAGCAAAT * 50 

AAATATGGTG TTCCAAGAAT AGTATTTGTA AACAAAATGG ATAGAATCGG 100 

TGCAAATTTC TACAATGTAG AAGATCAAAT TCGCAACCGT TTAAAAGCTA 150 

ATCCAGTTCC ACTTCAAATT CCAATCGGTG CTGAGGATAA TTTTAAAGGC 200 

GTAATCGATC TTGTAACTAT GAAAGCTTTA GTTTGGGAAG ATGATACTAA 250 

GCCAACGGAT TATGTAGAAA AAGAAATTCC AGCTGAACTT AAAGAAAAGG 3 00 

CAGAAGAATA TCGCACAAAA ATGATAGAAG CAGTTTCTGA AACTTCAGAT 350 

GAGTTGATGG AAAAATATTT AGGCGGAGAA GAATTAAGCC TTGAAGAGAT 400 

TAAAACAGGG ATTAAAGCAG GATGTTTAAG TCTTTCTATC GTTCCTATGC 450 

TTTGCGGTAC AGCGTTTAAA AATAAAGGGG TTCAACCTTT ACTTGATGCT 500 

GTTGTGGCTT ATTTACCAGC TCCTGATGAA GTGGCAAATA TCAAGGGTGA 550 

ATATGAAGAT GGCACAGAAG TTTCTGTAAA ATCAACTGAT GATGGCGAGT 600 

TTGCAGGACT TGCATTTAAA ATTATGACAG ATCCATTTGT AGGACAACTT 650 

ACTTTCGTGC GTGTTTATCG CGGTTGTTTA GAAAGTGGTT CTTATGCTTA 700 

TAACTCAACT AAAGATAAAA AAGAAAGAAT TGGTCGTTTG TTAAAAATGC 750 

ACTCTAACCA AAGAGAAGAG ATTAAAGTGC TTTACGCAGG CGAAATTGGT 800 

GCTGTTGTAG GACTTAAAGA TACTTTAACA GGGGATACTC TTGTAAGTGA 850 

AAAAGATAAG GTAATCCTTG AAAGAATGGA TTTTCCAGAT CCAGTTATTT 900 

CTGTTGCAGT TGAGCCAAAA ACTAAAGCAG ATCAAGAAAA AATGTCTATT 950 

GCTTTAAATA AATTAGCACA AGAAGATCCA AGTTTTAGAG TTTCTACAGA 1000 

TGAAGAAAGT GGCCAAACTA TCATTTCAGG TATGGGTGAG TTACACCTTG 1050 

AAATTATCGT TGATAGAATG CTTCGTGAAT TTAAAGTTGA AGCTGAAGTA 1100 

GGTCAACCAC AAGTTGCTTA TCGCGAAACT ATTAGAAAAA CTGTTGAACA 1150 

AGAATACAAA TACGCTAAAC AATCAGGTGG TCGTGGTCAG TATGGACATG 1200 

TATTCTTACG CCTTGAACCA CTTGAGCCAG GTAGTGGATA TGAATTTGTT 1250 

AATGATATCA AAGGTGGAGT AATTCCAAAA GAATACATTC CTGCAGTTGA 1300 

TAAAGGTGTT CAAGAAGCAT TACAAAATGG TGTTTTAGCA GGTTATCCTG 1350 

TGGAAGATGT TAAAGTAACT GTTTATGATG GAAGTTATCA CGAGGTGGAT 1400 

TCATCTGAGA TGGCGTTTAA ACTTGCTGCT TCTATGGGCT TTAAAGAAGG 1450 

TGCTAGAAAA GCAGGTGCTG TGATCTTAGA GCCTATGATG AAAGTTGAAG 1500 

TAGAAACTCC TGAAGATTAC ATGGGTGATG TTATTGGAGA TCTTAACAAA 1550 

CGCCGTGGTC AAGTAAATAG CATGGATGAG CGTGGTGGAA ATAAAATCAT 1600 

CACAGCATTT TG 1612 



2) INFORMATION FOR SEQ ID NO: 2188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

1123 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2188 

ACAATTGAAG . TTGAAAGATC .TATGCGTGTT CTTGATGGTG CTGTTGCGGT _ .50 
ATTTTGTTCA GTGGGTGGGG TGCAGCCTCA AAGTGAAACA GTTTGGAGAC 100 
AAGCAAATAA ATATGGTGTT CCAAGAATAG TATTTGTAAA TAAAATGGAT 150 
AGAATCGGTG CAAATTTCTA CAATGTAGAA GATCAAATTC GCAACCGTTT 200 
AAAAGCTAAT CCAGTTCCAC TTCAAATTCC AATTGGTGCT GAGGATAATT 250 
TTAAAGGCGT AATCGATCTT GTAACTATGA AAGCTTTAGT TTGGGAAGAT 300 

GATACTAAGC CAACGGATTA TGTAGAAAAA GAAATTCCAG CTGAACTTAA 350 

AGAAAAGGCA GAAGAATATC GCACAAAAAT GATAGAAGCA GTTTCTGAAA 4 00 

CTTCAGATGA GTTGATGGAA AAATATTTAG GCGGAGAAGA ATTAAGCCTT 450 

GAAGAGATTA AAACAGGGAT TAAAGCAGGA TGTTTAAGTC TTTCTATCGT 500 
TCCTATGCTT TGCGGTACAG CGTTTAAAAA TAAAGGGGTT CAACCTTTGC 550 
TTGATGCTGT TGTGGCTTAT TTACCAGCTC CTGATGAAGT GGCAAATATC 600 

AAGGGTGAAT ATGAAGATGG CACAGAAGTT TCTGTAAAAT CAACTGATGA 650 

TGGCGAGTTT GCAGGACTTG CATTTAAAAT CATGACAGAT CCATTTGTAG 700 

GACAACTTAC TTTCGTGCGT GTTTATCGCG GTTGTTTAGA AAGCGGTTCT 750 

TATGCGTATA ACTCAACTAA AGATAAAAAA GAAAGAATTG GTCGTTTGTT 800 

AAAAATGCAC TCTAACAAAA GAGAAGAGAT TAAAGTGCTT TACGCAGGCG 850 

AAATTGGTGC TGTTGTAGGA CTTAAAGATA CTTTAACAGG GGATACTCTT 900 

GCAAGTGAAA AAGATAAGGT AATCCTTGAA AGAATGGATT TTCCAGATCC 950 

AGTTATTTCT GTTGCAGTTG AGCCAAAAAC TAAAGCTGAT CAAGAAAAAA 1000 

TGTCTATTGC TTTAAATAAA TTAGCACAAG AAGATCCAAG TTTTAGAGTT 1050 

TCTACAGATG AAGAAAGTGG TCAAACTATC ATTTCAGGTA TGGGTGAGTT 1100 

ACACCTTGAA ATTATCGTTG ATAGAATGCT TCGTGAATTT AAAGTTGAAG 1150 

CTGAAGTAGG TCAACCACAA GTTGCTTATC GCGAAACTAT TAGAAAAACT 1200 

GTTGAACAAG AATACAAATA CGCTAAACAA TCAGGTGGTC GTGGTCAGTA 1250 

TGGACATGTA TTCTTACGCC TTGAACCACT TGAGCCAGGT AGTGGATATG 1300 

AATTTGTTAA TGATATCAAA GGTGGGGTAA TTCCAAAAGA ATACATTCCT 1350 

GCAGTTGATA AAGGTGTTCA AGAAGCATTA CAAAATGGTG TTTTAGCAGG 14 00 

TTATCCTGTG GAAGATGTTA AAGTAACTGT TTATGATGGA AGTTATCACG 1450 

AGGTGGATTC ATCTGAGATG GCGTTTAAAC TTGCTGCTTC TATGGGCTTT 1500 

AAAGAAGGTG CTAGAAAAGC AGGCGCTGTG ATCTTAGAGC CTATGATGAA 1550 

AGTTGAAGTA GAAACTCCTG AAGATTATAT GGGTGATGTT ATTGGAGATC 1600 

TTAACAAACG CCGTGGTCAA GTAAATAGCA TGGATGAGCG TGGTGGAAAT 1650 

AAAATCATCA CAGCATT 1667 



2) INFORMATION FOR SEQ ID NO: 2189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGACGGCGCT GGACGTGACG GAGGACCTCG 
CTGGAGATCG TGCAGCACTT GGATGCGAAC 
GCAGACGACG GACCTGCTGA AGCTGAAGTC 
GCAACATCTC CGTGCCGGTG GGCCGCGAGA 
GTGCTGGGCG ACGCGATTGA CCACCGCGGC 
CATGGCGATC CACGCCGAGG CGCCGAAGCT 
ACACGATCCT GACGACCGGC ATCAAGGTGA 
TGCAAGGGCG GCAAGATCGG CCTGTTCGGC 
TGTGATCATC ATGGAGCTGA TCAACAACGT 
TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA' 
TACCTGGAGA TGATGCAGTC AAAGGTGATT 
GTGCGTGCTT GTGTACGGGC AGATGAACGA 
GCGTTGCGCA GTCTGCGCTG ACGATGGCCG 
GGCCAGAACG TGCTGCTGTT CATCGACAAC 
GAACTCCGAG GTGTCTGCGC TGCTGGGCCG 
ACCAGCCGAC GCTTGCGGAG GATCTTGGCA 
TCGACGACGA AGGGATCGAT TACGTCTGTG 
GGATGATATC ACGGACCCCG CGCCCGCGAC 
CGACGACTGT GCTGGACCGC GCGGTGGCGG 
GTGAACCCGC TGGAGTGCGC GTCGCGCATC 
CGTGGACCAT TACAACGTTG CACAGGATAT 
ACAAGGAGCT GCAGGACATC ATTGCGGTGC 
GAGGAGGACA AGGTCGTGGT GGACCGCGCG 
GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT 
ACGTGCAGCT GAGCGACACG GTGGAGTCGT 
TCGTA 



guyanensis 



ID NO: 2189 



GCCGCGATGA 


GCCGCTGACG 


50 


ACCGGCCGCT 


GCATTGCGAT 


100 


GAAGGTTGTG 


TCGACCGGCG 


150 


CACTGGGCCG 


CATCTTCAAC 


200 


CCCGTGTGCG 


AGAAGATGCG 


250 


GGCGGACCAG 


GCTGCGGAGG 


300 


TCGACCTGAT 


TCTGCCCTAC 


350 


GGTGCCGGTG 


TGGGCAAGAC 


400 


CGCGAAGGGG 


CACGGCGGCT 


450 


CGCGCGAGGG 


CACGGACCTG 


500 


GACCTGAAGG 


GCGAGTCGAA 


550 


GCCCCCGGGT 


GCGCGCGCGC 


600 


AGTACTTCCG 


CGACGTGGAG 


650 


ATCTTCCGCT 


TCACGCAGGC 


700 


CATCCCGGCC 


GCCGTGGGTT 


750 


TGCTGCAGGA 


GCGCATTACG 


800 


CAGGCTGTGT 


ACGTGCCTGC 


850 


GACGTTCTCG 


CACCTGGACG 


900 


AGTCGGGCAT 


CTACCCTGCC 


950 


ATGGACCCCG 


ATGTGATCGA 


1000 


CGTCCAGATG 


CTGACCAAGT 


1050 


TTGGCATCGA 


CGAGCTGAGC 


1100 


CGCAAGGTGA 


CGCGGTTCCT 


1150 


CACCGGCATG 


ACGGGCCACT 


1200 


TCTCTGGCCT 


GCTGATGGGG 


1250 
1255 



2 ) INFORMATION FOR SEQ ID NO: 2190 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. Jbrucei 

(B) STRAIN: EATRO 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2190 

GCCCTTGACG TTGTTGACAA ACTTGGCCGT GATGAGCCTC TTACTCTTGA 50 

GATCGTGCAG CATCTTGACG CCCACACGGG CCGCTGTATC GCGATGCAAA 100 

CGACGGATCT CCTCAAACTG AAGGCAAAGG TCGTTTCGAC AGGTGGCAAC 150 

ATTTCCGTTC CTGTCGGCCG GGAAACACTA GGTCGTATCT TCAACGTGCT 2 00 

TGGAGACGCT ATTGACCAGC GCGGCCCCGT TGGTGAGAAA CTGCGCATGC 250 

CCATCCATGC CGTGGCTCCC AAGCTTGCGG ACCAGGCCGC TGAGGATGCG 3 00 

GTGCTCACAA CTGGTATTAA GGTGATTGAT CTCATTCTCC CTTACTGCAA 3 50 
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AGGTGGAAAA ATTGGCCTCT TTGGGGGTGC 
TTATTATGGA GCTCATTAAC AACGTTGCCA 
GTCTTCGCTG GTGTTGGTGA GCGTACCCGT 
TGAGATGATG CAGTCTAAGG TTATTGACCT 
TGTTGGTGTA CGGTCAGATG AACGAGCCCC 
GCGCAGTCGG CTCTGACGAT GGCTGAGTAC 
AGATGTGCTT CTTTTTATCG ACAATATTTT 
CCGAGGTGTC GGCGCTTCTG GGTCGTATTC 
CCTACCCTCG CTGAGGATCT AGGGCAGTTG 
AACGAAAGGC TCCATTACTT CTGTGCAGGC 
ACATTACCGA TCCAGCTCCA GCAACAACCT 
ACTGTGTTGG ACCGTGCTGT TGCCGAGTCT 
CCCACTGGAA TGCGCCTCGC GTATCATGGA 
ATCACTACAA TGTTGCACAA GATGTGGTAC 
GAATTACAGG ATATCATTGC TGTCCTTGGT 
GGACAAACTT ATCGTGGACC GTGCGCGTAA 
AGCCATTCCA AGTTGCTGAG, GTCTTCACAG 
CAGTTGGATG ACACCATCGA TTCCTTTTCT 



PCT/CA00/01150 



GGGTGTGGGC 


AAAACCGTCA 


400 


AGGGTCACGG 


TGGTTTCTCT 


450 


GAGGGAACGG 


ATTTGTATCT 


500 


TAAGGGTGAG 


TCCAAATGTG 


550 


CAGGTGCCCG 


TGCGCGTGTT 


600 


TTCCGTGATG 


TGGAGGGCCA 


650 


TCGTTTCACT 


CAGGCTAACT 


700 


CCGCCGCTGT 


TGGCTACCAG 


750 


CAGGAGCGTA 


TTACCTCAAC 


800 


CGTATACGTG 


CCGGCCGATG 


850 


TCTCACATCT 


GGACGCCACA 


900 


GGTATCTACC 


CCGCTGTTAA 


950 


CCCCGACGTT 


ATCAGTGTGG 


1000 


AGATGCTCAC 


CAAGTACAGG 


1050 


ATCGACGAGC 


TAAGCGAGGA 


1100 


GTTGGTGAAG 


TTCCTCTCCC 


1150 


GAATGACTGG 


CCATTACGTG 


1200 


GGTCTCCTCA 


TGGGTACG 


1248 



2) INFORMATION FOR SEQ ID NO: 2191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN : WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2191 

GAGCTGATTG TAAGTCATCG ACCACTTGAT ATATGAATAC ATCTAACAGT 50 

AGTAGAACAA CATCGCCAAG GCTCACGGTG GTTACTCCGT CTTCACTGGT 100 

GTCGGTGAGC GTACTCGTGA GGGTAACGAT CTGTACCACG AAATGCAGGA 150 

GACTGGTGTC ATTCAGCTCG ACGGCGAATC CAAGGTGTCT CTTGTGTTCG 200 

GTCAGATGAA CGAGCCCCCA GGTGCTCGTG CCCGTGTCGC CCTTACTGGT 250 

CTGACCATCG CCGAATACTT CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 300 

CTTCATTGAC AACATTTTCC GTTTCACCCA GGCCGGTTCC GAGGTGTCTG 350 

CCCTTCTTGG TCGTATCCCC TCTGCCGTCG GTTACCAGCC CACTCTGGCC 400 

GTCGACATGG GTGGTATGCA GGAACGTATT ACCACCACCA 440 



2) INFORMATION FOR SEQ ID NO: 2192 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1262 bases 
•(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania panamensis 

(B) STRAIN: ATCC 50158 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2192 



GCTGACGGCG CTGGACGTGA 
CGCTGGAGAT CGTGCAGCAC 
ATGCAGACGA CGGACCTGCT 
CGGCAACATC TCCGTGCCGG 
ACGTGCTGGG CGACGCGATT 
CGCATGGCGA TCCACGCCGA 
GGACACGATC CTGACGACCG 
ACTGCAAGGG CGGCAAGATC 
ACTGTGATCA TCATGGAGCT 
CTTCTCCGTG TTCGCCGGCG 
TGTACCTGGA GATGATGCAG 
AAGTGCGTGC TTGTGTACGG 
GCGCGTTGCG CAGTCTGCGC 
AGGGCCAGAA CGTGCTGCTG 
GCGAACTCCG AGGTGTCTGC 
TTACCAGCCG ACGCTTGCGG 
CGTCGACGAC GAAGGGATCG 
GCGGATGATA TCACGGACCC 
CGCGACGACT GTGCTGGACC 
CCGTGAACCC GCTGGAGTGC 
GACGTGGACC ATTACAACGT 
GTACAAGGAG CTGCAGGACA 
GCGAGGAGGA CAAGGTCGTG 
CTGTCGCAGC CGTTCCAGGT 
CTACGTGCAG CTGAGCGACA 
GGTCGTACGA CC 



CGGAGGACCT CGGCCGCGAT 
TTGGATGCGA ACACCGGCCG 
GAAGCTGAAG TCGAAGGTTG 
TGGGCCGCGA GACGCTGGGC 
GACCACCGCG GCCCCGTGTG 
GGCGCCGAAG , CTGGCGGACC 
GCATCAAGGT GATCGACCTG 
GGCCTGTTCG GCGGTGCCGG 
GATCAACAAC GTCGCGAAGG 
TTGGCGAGCG CACGCGCGAG 
TCAAAGGTGA TTGACCTGAA 
GCAGATGAAC GAGCCCCCGG 
TGACGATGGC CGAGTACTTC 
TTCATCGACA ACATCTTCCG 
GCTGCTGGGC CGCATCCCGG 
AGGATCTTGG CATGCTGCAG 
ATTACGTCTG TGCAGGCTGT 
CGCGCCCGCG ACGACGTTCT 
GCGCGGTGGC GGAGTCGGGC 
GCGTCGCGCA TCATGGACCC 
TGCACAGGAT ATCGTCCAGA 
TCATTGCGGT GCTTGGCATC 
GTGGACCGCG CGCGCAAGGT 
TGCGGAGGTG TTCACCGGCA 
CGGTGGAGTC GTTCTCTGGC 



GAGCCGCTGA 50 

CTGCATTGCG 100 

TGTCGACCGG 150 

CGCATCTTCA 200 

CGAGAAGATG 250 

AGGCTGCGGA 300 

ATTCTGCCCT 350 

TGTGGGCAAG 400 

GGCACGGCGG 4 50 

GGCACGGACC 500 

GGGCGAGTCG 550 

GTGCGCGCGC 600 

CGCGACGTGG 650 

CTTCACGCAG 700 

CCGCCGTGGG 750 

GAGCGCATTA 800 

GTACGTGCCT 850 

CGCACCTGGA 900 

ATCTACCCTG 950 

CGATGTGATC 1000 

TGCTGACCAA 1050 

GACGAGCTGA 1100 

GACGCGGTTC 1150 

TGACGGGCCA 1200 

CTGCTGATGG 1250 
1262 



2) INFORMATION FOR SEQ ID NO: 2193 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2193 

TTCCGATGGT CAAATGTACG ATTGATATTC CTTCCAGCCA GTCAGGATAA 50 

CAGCTGATAC CAGTTGCAAA TAGGCCCCAG ACTCGTGAGC ACTTGTTGCT 100 

TGCCCGTCAG GTTGGTGTCC AGAAGATCGT TGTCTTCGTC AACAAGGTTG 150 

ACGCTGTCGA TGACCCTGAG ATGTTGGAGC TTGTTGAGCT CGAGATGCGT 200 
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GAGCTCCTCA ACACTTACGG TTTCGAGGGA 
CGGTTCCGCC CTGTGCGCTC TCGAAGGCCG 
AGCGTATTGA CTCCCTCCTC GAGGCCGTTG 
CAGCGTGACT TGGACAAGCC CTTCCTGATG 
CATTGGTGGT CGTGGTACCG TCGCCTCTGG 
TCAAGAAGGA TACCGAAGTT GAAATTCACG 
ACCAAGGTCA CCGACATTGA GACCTTCAAG 
TGCTGGTGAC AACTCCGGTC TTCTCCTCCG 
TTCGTCGTGG TATGGTCATC GCTGCCCCTG 
AAGTTCATGG TCTCCATGTA CGTCTTGACT 
GAACGGCTTC GGTGCCAACT ACCGCCCCCA 
GTAAGTTTCG AACTATTTGA TTCATTGATC 
AGACGAGGCT TGCGACCTTC ATTTCCCTGA 
ACGTCATGCC CGGTGACAAC GTCGAAATGG 
GTTGCTGCTG AG 



PCT/CAOO/01150 

GAGGAGACCC CTATCATCTT 250 

CCGCGAGGAC ATTGGTACTC 300 

ACACTTGGAT CCCTACCCCC 350 

TCCATTGAGG AAGTTTTCTC 400 

TCGTGTCGAG CGTGGTCTCC 450 

GTGCTGATGG TATTCTGAAG 500 

AAGAGCTGCG ATGAGTCTCG 550 

TGGTATCCGT CGTGAGGATG 600 

GCTCCATCAA GGCCTCCAAG 650 

GAGGCTGAAG GTGGCCGCAA 700 

GGCTTTCATC CGCACTGCTG" 750 

ACGTCCCTAA CTGTTACTTT 800 

TGAGGCCGAC AAGGACCGCC 850 

TCCTCAACCT CAACAACCCC 900 

912 



2) INFORMATION FOR SEQ ID NO: 2194 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2194 



TCATCGTCGT CGCCGCTTCT GACGGTCAGA 
TTGCTTCTCG CCCGTCAGGT CGGTATCCAG 
CAAGGTCGAC GCCATCGAGG ACAAGGAGAT 
AGATGCGTGA GCTCCTCAGC AGCTACGGCT 
ATCGTCATGG GTTCCGCTCT TTGTGCCCTT 
TGGAACCACC CAGATCGACA ACCTGATGAA 
CCACTCCCCA GAGAGATCTT GAGAAGCCCT 
GTCTTCTCTA TCCCCGGTCG TGGTACTGTC 
TGGTACCCTG AAGAAGGATT CCGAAATCGA 
TCCCCATCAA GACCAAGGTC ACCGACATCG 
GACGAGTCCC GCGCTGGTGA CAACTCCGGT 
GCGTGAGGAT GTCAACCGTG GTATGGTTGT 
CCTCGCACAA GAAGTTCCTC GTCTCCATGT 
GGTGGTCGTC ACACTGGTTT CCACGGAAAC 
CCGTACCGCT GGTAAGTCCT GACTTTGAAC 
TCTAACATGT TTTACAGACG AGGCTGCCGC 
CCGAGGACGC TGACTCCAAG ATGGTCATGC 
GTCTGCTCTC TTCACAGACC TCTTGCCGTT 



TGCCCCAGAC CAGAGAGCAC 50 

AAGATTGTCG TCTTCGTCAA 100 

GCTTGAGCTC GTCGAGATGG 150 

TCGAGGGTGA CGAGACCCCC 200 

GAGAACCGCC AGCCCGAGAT 250 

CGCTGTCGAC GAGTGGATCC 300 

TCCTCATGTC CGTTGAGGAT 350 

GTTTCTGGCC GTGTTGAGCG 400 

GCTTGTCGGC AAGAACAAGG 450 

AGACCTTCAA GAAGTCTTGC 500 

CTTCTGCTCC GTGGTATCAA 550 

CGTCAAGCCC GGTACCGTCA 600 

ACGTCCTGAC CAAGGAGGAG 650 

TACCGTCCCC AGATCTTCAT 700 

TGCTGACCAA TTTTCGCATC 750 

TATTGACTGG CCCGAAGGCA 800 

CCGGTGACAA CGTCGAGATG 850 

GAACAGG 887 



2) INFORMATION FOR SEQ ID NO: 2195 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia parva 
"(B) STRAIN:" ATCC~i0784~ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2195 

TGACGGCCAA ATGTAGGACT CTTGCGCGGG ACTGACTGAT TGGGGGGAAT 50 

CCATTTTTTC TTTTTTCTTT TTCTTTTTGA GTACATGATT ATACTAATAT 100 

TTGGATAACG TACCAGGCCT CAAACACGTG AACATTTACT CCTTGCCCGA 150 

CAAGTCGGTG TCCAGAAGAT CGTCGTTTTC GTAAACAAGG TCGATGTTCT 200 

CGAAGATAAG GAGATGTTGG AGCTTGTCGA GTTGGAAATG AGAGAGCTCT 250 

TGAACACCTA CGGATTCGAG GGTGAGGAAA CGCCCATCAT CTTCGGTTCT 300 

GCCCTTTGTG CCATGGAGGG CCGCGAGCCT GAGCTGGGCG AGAAGAGAAT 350 

TGATGAATTG CTCGACGCTG TTGATAGCTG GATCCCTACG CCGCAACGTG 400 

ATACGGAAAA ACCCTTCCTA ATGTCCATTG AGGAAGTGTT CTCCATCTCC 450 

GGTCGTGGAA CCGTTGCCTC CGGCCGTGTT GAGCGTGGTG TCCTCAAGAA 500 

GGATTCCGAA GTCGAGCTTG TTGGCGGCGG CGTCGCCCCA ATCAGGACCA 550 

AGGTAACCGA TATCGAAACC TTCAAGAAGT CCTGCGACGA GTCCAGGGCT 600 

GGAGACAACT CTGGCCTTTT GTTGCGTGGT GTCAAGCGTG AGGATATCCG 650 

CCGTGGTATG GTCGTTGTCG TTCCTGGCAG CGTCAAGGCC CACGACAAGT 700 

TCTTGGTGTC CATGTATGTT CTGACCGAAG CCGAGGGTGG TCGCCGAACT 750 

GGATTCGGCC AAAACTATCG TCCTCAAATG TTTATCCGCA CAGCTGGTAC 800 

GTAACGTATA ATGCCTCTCC TCTTCATATA TACCACCCCC CCCACCACTG 850 

ACTCCCTGAC TCTTCGATTA CAGACGAGGC CGCCGATCTC AGCTTCCCTG 900 

ACGCAGCAGA CGAAACCAAA CTGGTTATGC CCGGTGACAA CGTCGAGATG 950 

ATCCTCAAGA CACACCGCCC CATAGCTGCC GAAG 984 



2) INFORMATION FOR SEQ ID NO: 2196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2196 

GCCGCCTCTG ACGGCCAAAT GCCCCAGACC CGTGAGCACT TGCTGCTTGC 50 

TCGCCAGGTC GGTGTTCAGA AGATTGTTGT TTTTGTCAAC AAGGTCGATG 100 

CCGTCGAGGA CAAGGAGATG TTGGAGCTCG TTGAGATGGA GATGCGTGAG 150 

TTGCTCAGCA GCTATGGCTT CGAGGGCGAT GAGACTCCCA TTGTCATGGG 2 00 
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CTCCGCTCTC TGCGCCATCG AGGGCCGTGA 
GTATTGATGA GCTGCTCGAG GCTGTCGACA 
CGTGATACCG ACAAGCCCTT CCTCATGGCT 
TGCTGGCCGT GGTACCGTCG TTTCTGGCCG 
AGCGCGATGC CGAAGTCGAG CTGGTTGGCA 
ACCAAGGTTA CCGACATTGA AACCTTCAAG 
CGCCGGTGAC AACTCGGGTC TCCTCCTCCG 
TCCGCCGTGG CATGGTCGTT TCTGTTCCTG 
AAGTTCCTCG TCTCCATGTA TGTGTTGAGC 
CACGGGCTTC GGCGAGAACT ACAGGCCACA 
ACGAGTCATG CGCGCTGCAC TGGCCAGAAG 
AAGCTTGTTA TGCCTGGTGA TAACGTTGAG 
GCCACA 



PCT/CA00/01150 

GCCCGAAATT GGTGTCAACC 250 

CCTGGATCCC CACCCCTCAG 300 

GTTGAGGACG TCTTCTCCAT 350 

TGTCGAGCGA GGTGTCTTGA 400 

AGGGCACTGC GCCCATCAAG 450 

AAGTCGTGTG AAGAGTCTCG 500 

TGGTGTCAAG CGTGACGACG 550 

GACAAGTCAA GGCCCACAAG 600 

AAAGAGGAAG GTGGCCGCCA 650 

AATGTTCATC CGTACTGCTG 700 

GTACCCCAGA TGCTCACGAC 750 

ATGGTTTGTG AGCTTCACGT 800 

806 



2) INFORMATION FOR SEQ ID NO: 2197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2197 

AGCGAATTGT 50 

ATGCTTGAGC 100 

CTTCGAAGGC 150 

TCCAGAACCA 200 

GCTGCCGTCG 250 

CTTCCTTATG 300 

TCGTGTCTGG 350 

GAGCTTGTTG 4 00 

CGAGACCTTC 450 

GTCTCCTCAT 500 

GTCTGCGCTC 550 

CTATGTCCTC 600 

ACTACCGACC 650 

ACTTTCCCTG 700 

CGACAACACC 750 

TTGGTCAGC 799 



ACCCGTGAAC ACTTGCTCCT 
CGTCTTTGTC AACAAGGTCG 
TCGTCGAGAT GGAGATGCGC 
GACGACACTC CCGTCATCAT 
GCGCCCTGAG ATTGGCACCG 
ACGAGTGGAT CCCAACCCCC 
TCCGTCGAGG ATGTCTTCTC 
CCGTGTGGAG CGTGGTGTTC 
GAAAGGGTCA GGAGGTTATC 
AAGAAGTCTT GTGAGCAGTC 
CCGAGGTGTT CGCCGTGAGG 
CTGGCACCGT GAAGTCTCAC 
TCCAAGGAGG AGGGTGGCCG 
CCAGCTCTAC CTCCGAACCG 
AGGGTACTGA GGATGCCTCC 
GAGATGGTTG TCACCATGGG 



CGCTCGTCAG GTTGGTGTTC 
ATGCCATTGA TGACCCCGAG 
GAGCTTCTTA ACACCTATGG 
GGGCTCGGCT CTCATGTCTC 
AGAAGATTGA TGAGCTTCTT 
GAGCGTGACC TTGACAAGCC 
CATTGCTGGC CGTGGTACCG 
TGAAGCGTGA CCAGGAGATC 
AAGACCAAGG TTACCGACAT 
CCAGGCTGGT GACAACTCTG 
ATGTCCGCCG TGGTATGGTC 
ACCCAGTTCC TCGCTTCCCT 
ACACACCGGT TTCCAGGAGC 
CAGATGAGTC CATTGACCTG 
AGCAAGATGG TCATGCCTGG 
TCACCCCAAT GCCATCGAGG 



2) INFORMATION FOR SEQ ID NO: 2198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



.( xi ) SEQUENCE . DESCRI PTION :_ SEQ 

CTCTGACGGT CAGATGCCCC AGACCCGTGA 
AGGTCGGTGT CCAGAAGATT GTCGTCTTCG 
GACGACCCTG AGATGCTTGA GCTCGTCGAG, 
CAACACCTAC GGCTTCGAGG GTGACGAGAC 
CTCTCATGTC CCTCCAGAAC CAGCGCCCCG 
GACGAGCTCC TTGCCGCCGT TGACGAGTGG 
CCTTGACAAG CCCTTCCTCA TGTCCGTTGA 
GCCGTGGTAC CGTCGTCTCT GGCCGTGTCG 
GACCAGGAGA TTGAGCTCGT CGGCAAGGGT 
GGTCACCGAC ATTGAGACCT TCAAGAAGTC 
GTGACAACTC TGGTCTCCTC ATCCGAGGTG 
CGTGGTATGG TCGTCTGCGC CCCCGGCACT 
CCTTTCTTCC CTCTACGTCC TCACCAAGGA 
GCTTCCAGGA GCACTACCGA CCCCAGCTCT 
TCCATCGACC TGACCTTCCC CGAGGGTACC 
GGTCATGCCC GGTGACAACA CCGAGATGGT 
ACGCCATTGA GGTCGGTCA 



ID NO: 2198 

GCACTTGCTG CTTGCCCGTC 50 

TCAACAAGGT CGATGCCATT 100 

ATGGAGATGC GTGAGCTCCT 150 

CCCTGTCATC ATGGGCTCTG 200 

AGATCGGTAG CCAGAAGATT 250 

ATCCCTACCC CCGAGCGTGA 300 

GGATGTCTTC TCCATTGCCG 350 

AGCGTGGTGT CCTGAAGCGC 400 

AACGAGGTCA TCAAGACCAA 450 

TTGCGAGCAG TCCCAGGCTG 500 

TCCGCCGTGA GGATGTCCGC 550 

GTCAAGTCCC ACACTCAGTT 600 

GGAGGGTGGC CGACACACTG 650 

ACCTCCGAAC TGCTGATGAG 700 

GAGGACGCCA GCAGCAAGAT 750 

CATCACCATG GGCCACCCCA 800 

819 



2) INFORMATION FOR SEQ ID NO: 2199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2199 

TGCTGACGGC CAAATGTAAG ACGCCGCGAG GGAGTGCTGA AGGTTTTATG 50 

CTTTTTAGGC CCTTTGTTTC TGAGAGCATG ATGATACTAA TACTCGGAAA 100 

CGTATCTATT AGGCCTCAAA CACGTGAGCA TTTGCTCCTT GCCCGACAGG 150 

TCGGTGTCCA AAAGATCGTC GTTTTCGTGA ACAAAGTCGA CGCCCTTGAG 200 

GACAAGGAGA TGTTGGAGCT TGTCGAGTTA GAAATGAGAG AGCTCTTAAA 250 

CACCTACGGA TTTGAGGGTG AAGAGACACC CATCATCTTT GGTTCTGCCC 300 

TTTGCGCCAT GGAAGGCCGT GAGCCTGAGT TGGGAGAAAA GAAAATTGAT 350 

GAATTGCTGG AGGCTGTTGA TACTTGGATC CCAACACCAC AACGTGATAC 400 

CGAAAAACCT TTCTTGATGT CCGTTGAGGA AGTATTCTCT ATCTCCGGTC 450 

GTGGAACCGT TGCCTCCGGT CGTGTTGAGC GCGGTGTCCT CAAGAAGGAT 500 
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TCAGAAGTCG AGCTAATTGG GGGCGGCTCC 
AACTGATATC GAAACTTTCA AGAAATCCTG 
ACAACTCCGG TCTTTTATTG CGTGGTATCA 
GGTATGGTAG TTGCCGTTCC TGGCAGCGTC 
GGTGTCGATG TATGTCCTGA CCGAAGCTGA 
TCGGCCAGAA CTATCGTCCT CAAATGTTCA 
AAATGGGGCC CCTTTTCATA ATCCTTTCTT 
ATCTCTCTCT CTGTTTCTTT TCAACTCGCC 
CCCCTTTGAT TATAGACGAA GCCGCCCATC 
GATGAAAGCA AACTCGTTAT GCCTGGTGAC 
GACACACCGC CCCGTGGCTG CTGAG" 



PCT/CA00/01150 

ACCCCCATCA GGACGAAGGT 550 

TGACGAGTCT AGAGCTGGGG 600 

AGCGTGAAGA TATCCGCCGT 650 

AAGGCCCACG ACAAGTTCTT 700 

GGGTGGTCGC CGAACCGGAT 750 

TCCGCACAGC TGGTATGTCA 800 

TTTTTCCTTT TCCTCTCTCT 850 

TGATTCACGA AATTAACTAA 900 

TCAGCTTCCC TAGTGGAGCA 950 

AACGTCGAGA TGATCCTCCA 1000 

1025 



2) INFORMATION FOR SEQ ID NO: 2200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kocuria kristinae 

(B) STRAIN: ATCC 27570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2200 

AGCACGTGCT GCTCGCCCGC CAGGTCGGCG TGCCGACCCT GCTGGTCGCC 50 

CTGAACAAGG CCGACATGGT CGAGGACGAG GAGCTGCTGG ACCTCGTCGA 100 

GATGGAGGTC CGCGAGCTGC TGTCCTCCCA GGAGTTCGAC GGCGACAACG 150 

CCCCCGTCAT CCGCGTCTCC GCGCTGAAGG CGCTGGAGGG CGACGAGAAG 200 

TGGGTCAAGT CCATCGAGGA GCTCATGGAG GCCGTGGACG AGTACATCCC 250 

GGACCCCGTG CGCGACAAGG ACAAGCCGTT CCTGATGCCC ATCGAGGACG 300 

TCTTCACCAT CACCGGGCGC GGCACCGTGG TGACCGGTCG CGCCGAGCGC 350 

GGGACCCTGG CCCTGAACTC CGAGGTCGAG ATCGTCGGCA TCCGCCCGAT 400 

CCAGAAGACC ACGGTCACCG GGATCGAGAT GTTCCACAAG CAGCTCGACG 450 

AGGCCTGGGC CGGCGAGAAC TGCGGTCTGC TGCTGCGCGG CCTGAAGCGC 500 

GACGACGTCG AGCGCGGCCA GGTCGTGGTG AAGCCGGGTT CCATCACCCC 550 

GCACACCAAC TTCGAGGCGA ACGTCTACAT CCTGTCCAAG GACGAGGGTG 600 

GGCGTCACAA CCCGTTCTAC TCGAACTACC GTCCGCAGTT CTACTTCCGG 650 

ACCACCGACG TCACCGG 1 667 



2) INFORMATION FOR SEQ ID NO: 2201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio mimicus 

(B) STRAIN: ATCC 33653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2201 



TGCTGCAACA GATGGTCCAA 
GTCGCCAAGT AGGTATTCCT 
ATGGTTGACG ATGAAGAGCT 
GCTTCTGTCT GAGTACGATT 
GTTCAGCACT AGGCGCGCTA 
GTTGAACTAG CAGAAGCACT 
AGTAGACATG GCATTCCTGA 
GTCGTGGTAC AGTAGTAACT 
GGTGACGAAG TTGCGATCGT 
TACAGGTGTA GAAATGTTCC 
AGAACGTTGG TGCACTGCTA 
GGTCAAGTAC TGGCGAAGCC 
ATCAGAAGTA TACGTACTGT 
TCTTCAAAGG TTACCGTCCA 
GGCAGCATCG AGCTTCCAGA 
CATCAAGATG GTTGTAGACC 



TGCCACAAAC TCGTGAGCAC 
TACATCATCG TATTCATGAA 
TCTAGAGCTG GTTGAGATGG 
TCCCAGGTGA TGACCTGCCA 
AACGGCGAAG CACAGTGGGA 
AGATTCATAC ATTCCAGAGC 
TGCCAATCGA AGACGTATTC 
GGCCGTATCG, AGCGCGGCAT 
TGGTATCAAA GACACAGTAA 
GTAAGCTGCT TGACGAAGGT 
CGTGGTACTA AGCGTGAAGA 
AGGTTCAATC ACCCCACACA 
CAAAAGACGA AGGTGGCCGT 
CAGTTCTACT TCCGTACAAC 
AGGCGTAGAA ATGGTAATGC 
TGATTGCA 



ATCCTGCTGG 50 

CAAATGTGAC 100 

AAGTTCGTGA 150 

GTAATCCAAG 200 

AGCGAAGATT 250 

CAGAGCGTGC 300 

TCAATCCAAG 350 

CCTGAAAGTG 400 

AAACTACCTG 450 

CGTGCAGGTG 500 

AGTAGAGCGT 550 

CTAAGTTCGA 600 

CATACTCCAT 650 

TGACGTAACA 700 

CAGGCGACAA 750 
778 



2) INFORMATION FOR SEQ ID NO: 2202 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2202 

AACGCTGACC CTGCAGGTTA TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 50 

GTGCATTTAT CGATGCAGAA CACGCACTGG ACCCGGTCTA TGCCCGTAAG 100 

CTTGGCGTTG ATATCGATAA CCTGCTGTGT TCTCAGCCGG ATACCGGTGA 150 

ACAAGCGCTG GAAATCTGTG ATGCACTGGC GCGCTCCGGT GCGGTTGACG 200 

TTATCGTTGT CGACTCCGTT GCCGCATTGA CGCCGAAGGC AGAAATCGAA 250 

GGCGAGATTG GCGACTCTCA CATGGGCCTT GCGGCGCGTA TGATGAGCCA 300 

GGCGATGCGT AAGCTGGCCG GTAACCTGAA GCAGTCCAAC ACGCTGCTGA 350 

TTTTCATCAA CCAGATCCGT ATGAAGATTG GCGTTATGTT CGGTAACCCG 400 

GAAACCACCA CC 412 



2) INFORMATION FOR SEQ ID NO: 2203 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:1.2 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 2203 



TTTAGATCCA TCTTATGCTA GAAATTTAGG 
TAGTTTCTCA ACCAGATACA GGAGAACAGG 
TTAGTAAGAT CAGGAGCAGT AGATGTTATA 
TTTAGTTCCT AGGGCAGAAA TAGAAGGAGA 
GTCTTCAAGC AAGACTTATG TCTCAAGCCC 
ATAAATAAAT CTAAGTGTGT AGCTATATTT 
GGTTGGTATA ATGTTTGGAA ATCCAGAAAC 



TGTTGATATA GATAACCTAA 50 

CTTTAGAGAT AACAGAAGCT 100 

GTTGTAGACT CTGTAGCAGC 150 

AATGGGAGAC TCACATGTAG 200 

TAAGAAAATT AGCAGGATCT 250 

ATAAACCAAT TAAGAGAAAA 300 

AACTCCT 337 



2) INFORMATION FOR SEQ ID NO: 2204 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN: ATCC 29684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2204 

AAAGCAAGGC GGTACTGCAG CATTTGTTGA TGCTGAGCAT GCACTAGATC 50 

CAAAATATGC AAAGCTTTTA GGTGTTGATG TTGATAATCT GATCGTGTCA 100 

CAGCCGGATA CGGGTGAGCA AGCTTTAGAG ATTGCTGATA TGTTGGTACG 150 

TTCTGGAGGA GTTGATATTG TAGTAATTGA CTCTGTTGCT GCACTTACGC 200 

CAAAGGCAGA GATTGAGGGT GACATGGGCG ACTCGCACAT GGGCTTACAA 250 

GCAAGATTAA TGTCACAAGC ACTAAGAAAA CTAACGGCAA ATATCAAGCG 300 

CTCAAATACT CTAGTGATAT TCATTAACCA AATTCGTATG AAGATCGGGG 350 

TTATGTTTGG TAACCCTGAA ACTACAACT 379 



2) INFORMATION FOR SEQ ID NO: 2205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2205 



ACTTGACCCA GTATATGCAA GGGCTCTTGG AGTGGATATA GACAACCTAG- 50 

- - TCATATCTCA GCCAGATACA GGAGAACAGG CCCTAGATAT AGCAGAGTCC- .100 

CTTATAAGAT CAGGAGCTGT AGATATACTA GTAATAGACT CAGTAGCTGC- 150 

CCTAGTACCT AAGGCAGAAA TAGAAGGTGA CATGGGAGAT TCTCACGTAG- 200 

GTCTACAGGC TAGACTTATG TCACAGGCAC TTAGAAAATT GACTGGATCT- 250 

ATAAAGAAGT CAAACTGTGT TGTTATATTT, ATCAACCAGT TGAGAGAAAA- 300 

AGTAGGGGTT ATGTTCGGTA ATCCAGAGAC AACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharblyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2206 



TCTTGATGCT GGATATGCAA AAAACCTTGG AGTAGATGTA GAAAATTTAA- 50 

TTATTTCTCA ACCTGATACA GGTGAGCAAG CCTTAGAAAT AACTGAAGCT- 100 

CTTGTAAGAT CTAACGCTGT TGATTTAATT ATTATAGACT CAGTTGCCGC- 150 

ACTTGTACCA AAAGCAGAAA TCGATGGTGA CATGGGAGCT GCACAAATAG- 200 

GTCTTCAAGC AAGACTTATG TCTCAAGCTC TTAGAAAATT AACTGGGGCA- 250 

ATCAACAAGT CAAAATGTAC CGTTGTATTT ATTAACCAAC TTAGAGAAAA- 300 

AGTTGGTATC ATGTTTGGTA ACCCAGAAAC TACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CTCACGTTGC AAGTTATTGC AGCAGCACAA 
ATTTATCGAC GCTGAACATG CGCTAGATCC 
GTGTTGATAT CGATAACCTT CTATGTTCTC 
GCATTAGAGA TTTGTGATGC ACTGACGCGT 
TATCGTTGAC TCCGTGGCCG CATTAACACC 
AAATCGGTGA CTCACACATG GGCTTAGCGG 
ATGCGTAAAT TAGCGGGTAA CTTAAAGAAC 
CATTAACCAA ATCCGTATGA AGATTGGCGT 
CCACTACA 



ID NO: 2207 

CGTAGCGGAA AAACCTGTGC 50 

AATCTATGCG AAAAAACTGG 100 

AACCTGATAC TGGTGAGCAA 150 

TCAGGCGCTG TTGATGTCAT 200 

AAAAGCTGAA ATTGAAGGTG 250 

CTCGTATGAT GAGCCAAGCG 300 

TCGAATACAC TTTTAATCTT 350 

TATGTTTGGT AACCCAGAAA 400 

408 



2) INFORMATION FOR SEQ ID NO: 2208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2208 

GCTGCAGGTG ATTGCCGCTG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA 
TCGATGCGGA ACACGCGCTT GACCCTGTTT ACGCACGCAA GCTGGGCGTC 
GATATCGATA ACCTGCTTTG TTCTCAGCCG GATACCGGCG AGCAGGCGCT 
GGAAATCTGT GACGCGCTGG CGCGTTCAGG CGCGGTGGAC GTCATTGTGG 
TCGACTCCGT AGCGGCGCTA ACGCCGAAAG CGGAAATCGA AGGCGAAATT 
GGCGACTCTC ACATGGGCCT CGCGGCGCGT ATGATGAGCC AGGCGATGCG 
TAAGCTGGCG GGGAACCTAA AACAGTCCAA CACGCTGTTG ATTTTCATCA 
ACCAGATCCG TATGAAGATT GGCGTGATGT TCGGTAACCC GGAAACCACC 
ACC 



50 
100 
150 
200 
250 
300 
350 
400 
403 



2) INFORMATION FOR SEQ ID NO: 2209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2209 



GACCCTGACG CTGCAGGTGA TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 50 

GTGCGTTTAT CGATGCGGAA CACGCGCTTG ACCCTGTTTA CGCACGCAAG 100 

CTGGGCGTCG ATATCGATAA CCTGCTCTGC TCTCAGCCGG ATACCGGCGA 150 

GCAGGCGCTG GAAATCTGTG ACGCGCTGGC GCGTTCAGGC GCGGTGGACG 200 

TCATTGTGGT CGACTCCGTA GCGGCGCTAA CGCCGAAAGC GGAAATCGAA 250 

GGCGAAATCG GCGACTCTCA CATGGGCCTC GCGGCGCGTA TGATGAGCCA 300 

GGCGATGCGT AAGCTGGCGG GGAACCTGAA ACAGTCCAAC ACGCTGTTGA 350 

TTTTCATCAA CCAGATCCGT ATGAAGATTG , GCGTGATGTT CGGTAACCCG 400 

GAAACCACCA CC 412 



2) INFORMATION FOR SEQ ID NO: 2210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2210 



ATTAGATCCT GTTTACGCAG AAGCTTTAGG CGTAGATATA CAAAATTTAT- 50 

ATTTATCTCA ACCTGATCAT GGGGAACAAG GTTTAGAAAT TGCCGAAGCA- 100 

TTTGTTAGAA GTGGCGCTGT TGATATCGTT GTGGTCGATT CAGTTGCTGC- 150 

GCTTACACCT AAAGCTGAAA TTGAAGGTGA AATGGGAGAT ACGCACGTTG - 200 

GTTTGCAAGC ACGTCTTATG TCCCAAGCCT TGAGAAAGCT TTCCGGTGCA- 250 

ATTTCAAAAT CAAATACAAC AGCAGTATTT ATCAACCAAA TCCGTGAAAA- 300 

AGTTGGTGTG ATGTTCGGTA ATCCTGAAGT TACACCA 337 



2 ) INFORMATION FOR SEQ ID NO: 2211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 
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(xi) SEQUENCE DESCRIPTION: SEQ 

GACACTGACA TTACAGGTTA TCGCCGCCGC 
GTGCATTTAT CGATGCCGAA CATGCCCTTG 
TTGGGTGTAG ATATTGATAA CCTACTGTGT 
GCAGGCACTG GAAATTTGTG ATGCGCTGAC 
TTATCATCGT TGACTCCGTA GCGGCATTGA 
GGTGAAATTG GCGATTCTCA TATGGGCCTT 
GGCTATGCGT AAGCTGGCGG GTAACCTGAA 
TTTTTATCAA CCAAATCCGC ATGAAAATTG 
GAAACCACTA CC 



ID NO: 2211 

ACAGCGTGAA GGCAAAACGT 50 

ACCCAATCTA TGCCAAGAAA 100 

TCTCAGCCAG ATACTGGCGA 150 

TCGCTCTGGT GCGGTTGACG 200 

CACCAAAAGC TGAAATTGAA 250 

GCCGCGCGTA TGATGAGCCA 3 00 

GAATGCGAAT ACCTTACTGA 3 50 

GCGTGATGTT TGGTAACCCA 4 00 

412 



2) INFORMATION FOR SEQ ID NO: 2212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2212 

CACGCTCACC TTGCAAACCA TCGCGGAAAT GCAAAAACTG GGCGGCACCT 50 

GCGCGTTTAT CGACGCCGAG CACGCACTGG ACGTCACGTA CGCGCAAAAG 100 

CTGGGCGTCA ACCTGAGCGA TCTGCTGATC TCGCAACCGG ACACCGGCGA 150 

ACAAGCGCTG GAAATCTGCG ACGCCCTGGT GCGTTCCGGT TCGGTGGACA 2 00 

TGGTCGTGAT CGACTCGGTC GCCGCGCTGA CCCCGCGCGC CGAGATCGAA 250 

GGCGACATGG GCGATTCGCT GCCAGGTTTG CAGGCACGTT TGATGTCGCA 300 

AGCACTGCGC AAGCTTACCG GTTCGATCAA CCGCACCAAC ACCCTGGTCA 350 

TCTTCATCAA CCAGATCCGC ATGAAAATCG GCGTCATGTT CGGCAGCCCG 4 00 

GAAA 4 04 



2) INFORMATION FOR SEQ ID NO: 2213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2213 
CGTGCCATTG ACATGATTTC CGAAGAAGAC GCTGAAGGCA CG 42 
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2) INFORMATION FOR SEQ ID NO: 2214 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 125 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) _ TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2214 

CAACTACATT ACGTTCTGCA ACACAAGGTC GTGGTACTTT CAGTATGACA 
TTTGACCACT ATGAAGATGT TCCTAAGAGC ATTGCAGAAG AAATCATCAA 
GAAAAATGGC GGTAACGGAG AATAA 



2) INFORMATION FOR SEQ ID NO: 2215 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2215 

CGACTCAAAT GCGTTCTATG TCTCAAGGTC GTGCGACATA CTCAATGGAA 50 
TTTGCTAAAT ATGCTGAAAC TCCACGTAAC GTGGCTGAAG GCATCATCGC 100 
TAAATTCCAA GCTGGCGGTA AAAAAGGTGA CGACGAGTAA 140 



2) INFORMATION FOR SEQ ID NO: 2216 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2216 



CCGGTGACCT GCGTTCTAAG ACGCAGGGTC GCGCTGTCTA CTCCATGGAG 
TTCGACAGCT ACGCCGAGGT TCCGCGCGCG GTCGCGGATG AGATCGTCGG 
CAAGTCTCGG GGCAACTGA 



50 
100 
119 



2) INFORMATION FOR SEQ ID NO: 2217 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2217 

CGACCGACGT TCGCTCCATG TCCCAGGGTC GCGCAAGCTA CTCTATGGAA 50 
TTCAAAAAAT ACAACACAGC TCCGGCGCAC ATCGCTGAAA CTGTATCCAA 100 
AAAACAAGGC TGA 113 



2) INFORMATION FOR SEQ ID NO: 2218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2218 

GCGACCTGCG TTCCCGTACC CAGGGCCGTG CAAACTACAC CATGATCTTC 50 
GACTCCTACG CTGAGGTTCC TACCAACGTG GCAGCTGAGA TCGTGGCAGA 100 
GCGCAACGGC ACTGCCTAA 119 



2) INFORMATION FOR SEQ ID NO: 2219 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enierobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2219 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCTAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AGTAA 115 



2) INFORMATION FOR SEQ ID NO: 2220 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2220 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCGAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AATAA 115 



2) INFORMATION FOR SEQ ID NO: 2221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2221 



CAACTCACCT TCGTTCAGGT ACGCAAGGTC GTGGTGTATA CACTATGCAA 
TTTGACCACT ATGAAGAAGT TCCTAAATCT ATTGCTGAAG AAATCATTAA 
AGCTAATGGT GGA 



50 
100 
113 



2) INFORMATION FOR SEQ ID NO: 2222 

(if SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2222 

CGGCGACCTG CGGTCCAAGA CCCAAGGCCG GGCGAACTAC TCCATGGTCT 50 
TCGACTCCTA CGCCGAAGTG CCGGCCAACG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACGG GTCAGTGA 118 

2) INFORMATION FOR SEQ ID NO: 2223 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2223 

CCGGCGACCT GCGGTCCAAG ACGCAAGGCC GGGCGAACTA CTCCATGGTG 50 
TTCGACTCGT ACGCCGAAGT TCCGGCGAAC GTGTCGAAGG AGATCATCGC 100 
GAAGGCGACG GGCGAATAG 119 



2) INFORMATION FOR SEQ ID NO: 2224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2224 

CGGCGACCTG CGGTCCAAGA CTCAAGGCCG GGCGAACTAC TCGATGGTGT 50 
TCGATTCCTA CGCCGAAGTG CCGGCTCAGG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACTG GCGAGTGA 118 



2) INFORMATION FOR SEQ ID NO: 2225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2225 

CGGAGACTTG CGGTCGAAGA CCCAGGGCCG GGCGAACTAC TCCATGGTGT 50 
TCGACTCCTA CGCCGAAGTG CCGGCGCAGG TGGCGAAGGA GATTATCGCG 100 
AAGGCAACGG GCGAGTAA 118 



2) INFORMATION FOR SEQ ID NO: 2226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2226 

GACCGACCTG CGTTCTGCAA CCCAAGGCCG CGCTACTTAC TCTATGGAGT 50 

TCAAGAAATA TTCTGAAGCT CCTGCCCACA TAGCTGCTGC TGTAACTGAA 100 

GCCCGTAAAG GCTAA 115 
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2) INFORMATION FOR SEQ ID NO: 2227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

. . (D) . TOPOLOGY: Linear ... . . _ 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2227 

AACTTCATTA CGTTCTAACA CGCAAGGTCG CGGTACTTAC ACAATGTACT 50 
TTGACCACTA TGCAGAAGTT CCTAAATCAA TTGCTGAAGA AATCATCAAG 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2228 

AACTTCATTA CGTTCTAACA CTCAAGGTCG CGGTACTTAC ACTATGTACT 50 
TCGATCACTA TGCAGAAGTT CCAAAATCAA TTGCTGATGA TATCATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1630 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2229 

CGTTCAATGC GTGTTCTAGA CGGTGCAGTT ATGGTTTACT GTGCTGTGGG 50 

TGGTGTTCAG CCTCAGTCTG AAACCGTTTG GAGACAGGCT CAGAAGTACA 100 

AGGTTCCTCG TATTGCTTTC GTTAATAAGA TGGACCGTAC TGGTGCTAAT 150 

.TTCCTACGTG TTGTAGAGCA GATTAAGACC CGTCTAAAGG GTAACCCTGT 200 

TCCTCTAATG TTACCTATCG GTAAAGAGGA CAGCTTTGTT GGTGTAGTTG 250 

ACCTAATCAA GCGTAAGGCT ATCGACTGGG ATGAGGCATC TCAGGGTATG 300 

AAGTTTGAGT ACGTTGACAT TCCAGCAGAT ATGGTTGAGG AAGTTGAAGA 350 

GTGGCGTGCA AAGCTTGTAG AAGCAGCTGC, AGAAGCTAAC GACGAGCTGA 400 

TGGATAAATT CTTCGGTGGT GAAGAGCTGA CCGAGGAAGA GATCAAGGCT 450 

GCTCTACGTG AGCGTACTCT TCGCAACGAA ATTATTCCTA TGTGCTGCGG 500 

TTCAGCATTT AAGAACAAGG GTGTTCAGGC AATGCTTGAC GCTGTTGTTG 550 

AGTATCTTCC ATCTCCAGCA GATGTTCCTG CTGTTGAGGG TAAGACCCTA 600 

ACCGGTGAAG CTGATACTCG TAAGGCTGAC GATAAAGAGC CATTCTCTGC 650 

TTTAGCATTT AAGCTAGCAA ATGACCCATT CGTAGGTAAC TTAACATTCT 700 

TACGTTGCTA CTCAGGCTTT ATTAAGTCTG GTGACACTGT AATGAACTCA 750 

GATAAGCAGA AGCGTGAGCG TTTCGGCCGT CTAGTTCAGA TGCACGCTAA 800 

TGCTCGTAAT GAGGTTAGCG AGGTTTATGC AGGTGACATC GTTGCTGCTA 850 

TTGGTCTGAA GGAAACCGTT ACCGGTGATA CCTTATGTGA CCCAGAGCAT 900 

CCAATCATTC TTGAGTCAAT CGACTTTGCA GAGCCAGTTA TCTCTGTAGC 950 

AGTTGAGCCT AAGACCAAGG ACGATCAGGA GAAGATGGCT CTTGCTTTAC 1000 

AGCGTTTAGC AAAAGAAGAT CCTTCATTCC GCGTTCGTAC AGACGAAGAG 1050 

TCTGGCCAGA CCATTATTTC TGGTATGGGT GAGCTTCACC TAGACATCAT 1100 

TGTTGACCGT CTACGCCGTG AGTTCAAGGT TGAGTGTAAT CAGGGTAAGC 1150 

CACAGGTTGC ATACCGTGAG ACCATTAAGA GCAAGGTTGA ACAGCAAGGT 12 00 

AAGTTTGCTC GTCAGTCTGG TGGTCGTGGT CAGTACGGTG ACTGCTGGTT 1250 

ACGTATGGAA CCTCTTGAGC CAGGTAAGGG CTACGAATTC GTGAATGAGA 1300 

TTGTTGGTGG TGTAATTCCT AAGGAATATA TCCCTGCAAT TGATAAGGGC 1350 

TGTCAGGAGC AGATCGCTAA CGGTGTTCTA GCTGGTTTCC CAGTTGTTGA 1400 

CATCAAGATC ACTGTATTCG ATGGTTCTTA CCACGAAGTT GACTCTTCAG 14 50 

AAATGGCATT CAAGATTGCT GCTTCTATGG CATTCAAAGA GGGCTTCAAG 1500 

AAGGCAAATC CTGTTCTTCT AGAGCCTTTA ATGAAGGTAG AAGTTGATAC 1550 

TCCTGAAGAC TACATGGGTG ACGTTATTGG TGACTTAAAC CGTCGTCGTG 1600 

CTATCGTTGA AGGCATGGAA GATGGTCCTA 163 0 



2) INFORMATION FOR SEQ ID NO: 2230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halcphilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2230 
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TTGAAGTGCA ACGTTCATTG CGTGTGCTAG 
GACTCACAAT CAGGTGTTGA ACCACAAACT 
AACAGAATAT CAAGTACCTC GTATTGTATT 
TTGGCGCAGA TTTCTTATAT TCTGTCCGAT 
GCTAATGCAC AACCTATCCA ATTGCCAATT 
AGGTATCATC GACCTTGTGA AGATGAAAGC 
TAGGGACTAC TTTCGAAGAA ACTGAAATCC 
GCTCAAGAAT GGCATAATAA TTTGGTAGAA 
AGATATCATG ATGAAATACT TGGAAGGTGA 
TACAAGCAGG TATTCGTAAA GCAACATTAT 
TTATGTGGTT CTGCATTTAA AAACAAAGGT 
AGTAATTGAT TACTTGCCTT CTCCAACCGA 
TCGATCCGAA AACAGATGAA GAAACTGAAC 
CCTTTTTCAT CACTTGCTTT TAAAGTTATG 
CTTAACTTTC TTCCGTGTTT ATTCAGGTGT 
TATTGAATGC TACTAAGGGT TCACGTGAAC 
ATGCATGCCA ATTCTCGTTC TGAGATCGAT 
TGCAGCTGCT GTAGGCTTGA AGAACACTAC 
ATGAGAAAAA TCCAGTTATT TTGGAAACTA 
ATTCAAGTTG CTGTTGAACC TAAGTCAAAA 
CGTAGCACTA CAAAAACTTG CAGAAGAAGA 
CCAACGCTGA AACTGGCGAA ACTGTAATTG 
TTAGACGTTC TTATTGACCG TATGAAGACT 
TATTGGTGCA CCACAAGTTT CTTATCGTGA 
AAGCTGAAGG GAAATTTATC CGCCAATCTG 
CACGTATGGG TTGAATTTAC TCCAAACGAA 
TAAAAACTCC ATTGTTGGTG GGGTTGTCCC 
TACAAAAAGG ACTGGAAGAC GCCATGGAAA 
CCATTAGTTG ACGTAAAGGC AGAACTGTTT 
CGACTCTAAT GAAACAGCCT TCCGTATTGC 
AAGCTGCGAA AAAGGCAGAT CCAGTTATTC 
ACAATTAGTA TCCCTGAAGA ATATCTAGGT 
AGCTCGTCGT GGTCGTGTTG AAGGAATGGA 
CTGTAAATGC GT 



ACGGTTCAGT TACTGTCTTG 50 

GAAACAGTTT GGCGTCAAGC 100 

CTGTAATAAA ATGGATAAGG 150 

CATTGCATGA TCGTTTAGAA 200 

GGTGCTGAAG ATAACTTTGA 250 

TGAATTTTAT AAAGATGATT 300 

CAGATGAATA TAAAGAAACA 350 

TCTGTAGCTG ATTTTGATGA 4 00 

AGAAATTACA CCTGAAGAGT 450 

CTGTTGAATT TTACCCAGTA 500 

GTTCAAATGA TGTTGGATGC 550 

CGTTCCCCCA ATTAAAGGGA 600 

, ATCCTGCTGA" TGATAGTGAG 650 

TCAGACCCTT ATGTTGGCCG 700 

GTTGGATACA GGTTCTTATG 750 

GAATTGGTCG TATTTTGCAA 800 

AAGGTTTATT CAGGTGACAT 850 

AACAGGGGAT ACCCTTTGTG 900 

TCAACTTCCC TGAACCAGTA 950 

GCTGACCAAG ATAAAATGAG 1000 

CCCATCTTTT AAAGTGGAAA 1050 

CTGGTATGGG TGAACTTCAA 1100 

GAATTTAAAG TGGATGCCAA 1150 

AACTTTCCGT TCATCAACTA 1200 

GTGGTAGAGG TCAATACGGT 1250 

GAAGGAGCAG GATTCGAATT 1300 

TCGTGACTAT ATACCTGCAG 135 0 

ATGGTGTGTT AGCTGGTTAT 1400 

GATGGTTCTT ACCATGACGT 1450 

GGCTTCAATG TCTCTACGTG 1500 

TTGAACCGAT GATGAAAGTA 1550 

GATATTATGG GACATGTTAC 1600 

TGCTCACGGT AATGCACAAA 1650 

1662 



2) INFORMATION FOR SEQ ID NO: 2231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2231 

AAGTAGAACG TTCTCTACGT GTACTTGACG GTTCTGTTGC GGTGTTCAGT 50 
GCTAAAGGTG GCGTTGAACC TCAATCCGAA ACAGTATGGC GTCAGGCTTC 100 
TAACTACGGC GTACCTCGTA TCGCTTATGT AAATAAGATG GATACTGTAG 150 
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GTGCTGACTT CTTCAACGTA GTTGACATGA 
AATTCCGTAG CTATCCAAGT ACCAATCGGT 
CATCATTGAC TTGATGACTA TGAAAGCGGA 
GTAAAGAATA TGAAATCACT GATATCCCTG 
GAAGCTCGTC GCGAAATGAT GATCGATGCT 
TATCATGATG AAATATTTGG AAGGCGAAGA 
AAGCGGCATT GCGTAAAGCT GTTATTGCTA 
TGTGGTTCTT CCTATAAAAA TAAAGGTGTT 
TATCGATTAC ATGCCAGCTC CAATCGACAT 
TTCCTGGTAC TGAAGAAGAA ACAACTCGTC 
TTCTCTGCAT TGGCATTCAA AATCATGGCT 
AGCGTTCTTC CGTGTGTACT CCGGTACATT 
TCAACTCCAC TAAAGGTAAA AAAGAACGTA 
CACGCTAACT CCCGTAAAGA AATCGAACGC 
TGCGGCGGTT GGCTTAAAGG ATACTACTAC 
AAAAATCTCC TGTAATCCTT GAGTCCATGG 
TCCGTTGCTG TTGAACCTAA AACAAAAGCT 
AGCTCTTGCT CGTTTGGCAG AAGAAGATCC 
ATGAAGAAAC AGGTCAAACT ATTATCTCTG 
GATATCATCG TTGACCGTAT GAACCGTGAA 
AGGTAAACCT CAAGTAGCAT ACCGCGAAAC 
CTGAAGGTAA ATTCGTACGT CAATCTGGTG 
TGCTGGTTGG AATTGATTCC TCAAGAACCA 
AAACAAGGTT GTAGGTGGTG CGATTCCTCG 
AAAGCGGTGT TAAAGAAGCT ATGGAATCCG 
ATGGTTGATG TTAAAGTTAT CGTATTTGAT 
CTCCAACGAA ATGGCCTTCA AAATTGCTGG 
GTGCTCGCAA AGCAGACCCT GCATTGCTTG 
GTAGACGTTC CTGAAGAATA CATGGGCGAC 
TCGTCGTGGT CGCATGGACG GCATGGAAGC 
TC 
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TGAAAGCTCG TTTGGGTGCA 2 00 

GCTGAAGATA CTTTCGAAGG 250 

AATTTATAAA TCCGATGACG 300 

CTGAATATCA AGAAGTAGCA 350 

ATCGCTGAAA CAGATGATGA 400 

AATTTCTGTG GAAGAATTGA 450 

ACCAATTATT CCCAGTTCTT 500 

CAAATGTTAT TGGATGCTGT 550 

CCCACCTATT AAAGGTGTTG 600 

CTTCTTCCGA TGAAGAGCCA 650 

GACCCTTATG TTGGTAAATT 700 

GGAATCTGGC TCCTACGTTT 750 

TCGGTCGTAT TCTTCAAATG 800 

GTATATTCTG GTGACATCGC 850 

AGGCGACACA TTGTGTGATG 900 

AATTCCCTGA ACCAGTTATC 950 

GACCAAGAAA AAATGGGTAC 1000 

TACTTTCAAA GTTCGTACTG 1050 

GTATGGGCGA ACTTCACTTG 1100 

TTCAAAGTAG ATTGTAACGT 1150 

TATCCGTAAA GCTGTTAAGG 1200 

GTCGTGGTCA ATATGGTCAC 1250 

GGTGCTGGCT TCGAGTTTGA 1300 

TGAATACATC GGACCTGTTG 1350 

GTGTTATCGC TGGGTACCCT 1400 

GGTTCTTACC ATGACGTTGA 1450 

TTCTATGGGC TTCAAAGAAG 1500 

AACCATATAT GGCTGTAGAA 1550 

GTTATCGGTG ACTTGAACTC 1600 

TCGTAATGGT TCCCAACATA 1650 

1652 



2) INFORMATION FOR SEQ ID NO: 2232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2232 

AGTAGAGCGT TCCATGCGTG TTCTTGACGG CGCGGTAATG GTTTACTGTG 50 

CAGTTGGTGG TGTTCAGCCA CAGTCTGAAA CCGTATGGCG TCAGGCTAAT 100 

AAATATAAAG TTCCACGTAT TGCGTTCGTT AACAAAATGG ACCGTATGGG 150 

TGCGAACTTC CTGCGCGTAG TTGGTCAACT GAAATCTCGC CTTGGTGCGA 200 

ACCCAGTTCC ACTGCAGTTG GCAATTGGCG CAGAAGAAAA ATTCACCGGT 250 

ATTATCGATC TGGTGAAAAT GAAAGCGATC AACTGGAACG AAGCTGATCA 3 00 

GGGCGTGACC TTCGAATATG AAGAAATCCC TGCTGATATG GCTGAACTGG 350 
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CTGCTGAATG GCACCAGAAT CTGGTTGAAT CTGCGGCAGA AGCGTCTGAC 4 00 

GAGCTGATGG ACAAATACTT GGGTGGCGAA GAGCTGACCG AAGAAGAAAT 4 50 

CAAGAAAGCT TTACGTCAAC GTGTTCTGAA AAGCGAAATT ATTCTTGTTA 500 

CCTGTGGTTC TGCGTTTAAA AACAAAGGCG TACAGGCAAT GCTGGATGCG 550 

GTTATTGAGT ACCTGCCTGC ACCAACTGAC GTTGAATCAA TCAACGGCAT 600 

CTTGGATGAT GGCAAAGATA CTCCGGCTGT TCGTCATTCT GACGACAAAG 650 

AGCCGTTCTC TGCTCTGGCG TTCAAAATCG CTACCGACCC ATTCGTGGGT 700 

AACCTGACGT TCTTCCGCGT GTACTCTGGT ATTGTTAATT CCGGTGATAC 750 

CGTTCTGAAC TCAGTGAAAT CGCAACGTGA ACGCTTAGGT CGTATCGTAC 800 

.AGATGCACGC .TAACAAGCGT GAAGAGATCA AAGAAGTTCA CGCCGGTGAT 850 

ATCGCAGCCG CTATCGGTCT GAAAGATGTG ACTACGGGTG ACACTTTGTG 900 

TGACCCGAAT AATCCGATCA TCTTGGAACG TATGGAGTTC CCAGAGCCGG 950 

TAATCTCTGT TGCTGTTGAA CCAAAAACCA AAGCTGACCA AGAAAAAATG 1000 

GGTATGGCTC TGGGGCGTTT GGCGAAAGAA, GATCCATCAT TCCGCGTTTG 1050 

GACTGACGAA GAATCTGGTC AGACTATCAT CGCTGGTATG GGTGAGTTGC 1100 

ATTTGGATAT CCTGGTTGAC CGTATGCGCC GCGAATTTAA CGTGGAAGCA 1150 

AACGTCGGTA AACCTCAGGT TGCGTACCGT GAAACTATCC GCGAAACCGT 1200 

TAAGGATGTG GAAGGTAAGC ACGCTAAGCA GTCAGGCGGT CGTGGTCAGT 1250 

ATGGTCATGT TGTTATCGAC ATGTCTCCAT TGCCACCGGG TGGTGTTGGG 1300 

TATGAATTCG TCAACGAAAT CGTTGGTGGT TCTATTCCTA AAGAATTCAT 1350 

TCCGGCCGTT GATAAAGGTA TTCAAGAACA GCTGAAATCT GGCCCTCTGG 1400 

CAGGTTACCC AGTTGTTGAC GTTAAAGTGC GTCTGCACTA CGGTTCTTAC 1450 

CATGACGTTG ACTCCTCAGA ATTGGCATTT AAATTAGCTG GTTCTATCGC 1500 

CTTTAAAGAA GGTTTCAAAC GAGCTAAACC AGTTCTGCTT GAGCCAATCA 1550 

TGAAGGTTGA AGTCGAAACC CCTGAAGATT ACATGGGTGA CGTAATGGGC 1600 

GACCTGAACC GTCGTCGCGG TATC 1624 



2) INFORMATION FOR SEQ ID NO: 2233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2233 

GAACGCTCGA TGCGCGTGCT TGACGGTGCT TGCATGGTTT ACTGCGCAGT 50 

GGGCGGTGTT CAGCCACAGT CGGAAACCGT GTGGCGTCAG GCTAACAAGT 100 

ACAAAGTGCC ACGTCTGGCC TTCGTCAACA AGATGGACCG TACCGGCGCC 150 

AACTTCTTCA AGGTGTACGA GCAGATGCGT GCTCGCCTGA AGGCCAACCC 200 

GGTCCTGATC CAGATCCCTA TCGGCGCTGA AGACAACTTC AAAGGCGTGA 250 

TCGATCTGGT CAAGATGAAG GCTATCCTGT GGGACGAAGC GTCGCAAGGC 3 00 

ATGAAATTCG ACTACGTCGA TATTCCTGCA GAGCTGGCTG ATTCGGCCGC 3 50 

CGAGTGGCGC GAAAAGATGG TTGAAGCTGC TGCTGAAGCC ACCGAAGAGC 4 00 

TGATGAACAA GTACCTGGAA GAAGGCGACC TGACCGAAGC CGAGATCAAG 450 

CAGGCGCTGC GTACCCGTAC CATCGCTTCG GAAATCGTTC CGATGATGTG 500 

CGGTACCGCC TTCAAGAACA AGGGCGTACA GGCCATGCTG GACGCGGTCA 550 

TCGAATACCT GCCATCGCCA CTGGACATCG ACGATGTCGG CGGTACGGAC 600 
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GAAGACGACC AGCCAACCAC 
GGCGCTGGCC TTCAAGATCA 
TCTTCCGCGT GTACTCGGGC 
TCGGTCAAAG GTCGTAAAGA 
GAATCAGCGC GAAGAGATCA 
CGGTTGGCCT GAAAGACGTG 
GCCATCATCA CGCTGGAAAA 
GGCAGTCGAG CCAAAAACCA 
TGAACCGCCT GGCACAGGAA 
. _ -GAATCGGGC.C AGACCATCAT 
TATCGTTGAC CGCATGAAGC 
AGCCACAAGT GGCTTACCGC 
GAAGGCAAGT TCGTCAAGCA 
TGTGCTGAAG ATCGAGCCGC 
ACGCCATCAA GGGCGGTACC 
AAAGGCGTGC GCGGCACCCT 
CGTGGACGTC AAGGTCACGG 
CGAACGAAAA CGCGTTCCAG 
TGCCGCAAAG CATCGCCAGT 
GGAAACGCCG GAAGACTACG 
GCCGCGGTAT GGTGCAGGGC 



CCGTCGCGCA GCTGACGACG 
TGACCGACCC GTTCGTCGGT 
GCCGTCAATT CGGGCGACAC 
GCGTCTGGGC CGTATTCTGC 
AAGAAGTGCG CGCCGGCGAC 
ACCACGGGCG AAACCCTGTG 
AATGATCTTC CCTGAGCCTG 
AGGCCGACCA GGAAAAAATG 
GATCCTTCGT TCCGCGTGAA 
CGGTGGWATG GGCGAGCTGC 
GCGAATTCGG CGTGGAAGCA 
GAAACGATCC GTAAAACCTG 
ATCCGGTGGT CGTGGTCAAT 
AAGAACCAGG CAAGGGCTTC 
GTTCCTCGCG AGTACATCCC 
GAACACCGGC GTGCTGGCTG 
TGTTCTTCGG TTCGTACCAC 
ATGGCCGCTT CGATGGCATT 
CATTCTGGAG CCAATGATGG 
CCGGTACCGT GATGGGCGAC 
ATGGACGAAA TCCCAG 



AGAAATTCTC 


650 


CAATTGGCCT 


700 


CGTGTACAAC 


750 


AGATGCACGC 


800 


ATCGCCGCTG 


850 


CGATCCGACC 


900 


TGATTCAACA 

x oil x x ^nQVrf/i 


950 

*S J \J 


GGCCTGGCAC 


1000 

-L U U U 


GACCGATGAA 


1050 

X \J J \J 




lino 

X X \J \J 




1150 

11 Ju 


PGAAGAATPG 


19 00 

x, A \J U 


AGGGTGAPGT 


19 50 

X A -J 


GAGTTGGTTG 

v_7/iVJ x X vvj X X VJ 


1500 


TGCGGTGGAA 


1350 


GTTACCCGGT 

vj X X rt\ — V_, V — \J\J X 


1400 


n a TGTGG Z\ PT 
vjA 1 vj l KjyjtW^. 1 


l" JU 


CAAAGAAGGC 


1500 


CTGTGGAAGT 


1550 


CTGTCGTCCC 


1600 




1S36 



2) INFORMATION FOR SEQ ID NO: 2234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2234 

GGCACGAACG ATCTTGCCGG ACGGGCCATC TTCCATACCT TCGATCAGGC 50 

CACGACGACG GTTCAGGTCA CCGATCACGT CGCCCATATA GTCTTCCGGA 100 

GTCTCGACTT CTACTTTCAT GATCGGTTCG AGCAGAACCG GGTTGGCCTT 150 

CATGAAGCCA GCCTTAAAGG CCATGGAAGC AGCGATCTTG AACGCCAGTT 200 

CGGAAGAGTC GACATCGTGG TAGGAACCGA AGTGCAGACG CACGCCCAGA 250 

TCCATAACCG GATAACCTGC CAGCGGGCCG GACTTCAGTT GCTCGCGGAT 300 

ACCCTTGTCA ACACCCGGGA TGAACTCACC AGGAATGACG CCGCCCTTGA 350 

TGTCGTTGAC GAATTCGTAG GCTTTGCCTT CTTCCAGCGG GTACATGTCG 4 00 

ATCACAACGT GACCGTACTG ACCACGACCA CCGGACTGCT TGGCGTGCTT 450 

ACCTTCGATA TCCTTGACGG TGTTACGAAT GGTTTCACGG TAGGCAACCT 500 

GCGGCTTACC TACGTTCGCT TCTACCTTGA ACTCGCGACG CATACGGTCA 550 

ACGATGATGT CCAGGTGCAG CTCACCCATA CCGGCGATGA TGGTTTGGCC 600 

AGACTCTTCG TCAGTCCATA CGCGGAAGGA CGGGTCTTCC TGAGCCAGAC 650 

GGCCCAGAGC CAGGCCCATC TTCTCTTGGT CAGCCTTGGT TTTCGGCTCA 700 

ACTGCGATGG AGATTACCGG TTCCGGGAAT TCCATACGCT CGAGGATGAT 750 

CGGCGCTTTT TCGTCACACA GGGTGTCACC GGTGGTCACG TCTTTCAGAC 800 

CAATGGCGGC AGCGATGTCG CCTGCGCGAA CTTCTTTGAT CTCTTCACGC 850 
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TTGTTGGCGT GCATCTGAAC 
AACGGAGTTC AGCACGGAGT 
GGAAGAAGGT CAGGTTGCCT 
AGAGCAGCAA ACGGCTCGTC 
GCCATCCAGC TTCAGACCGT 
GATAGTCAAC CACGGCATCC 
GCGGAGCCAC AGGTAACCAG 
AAGAGCTTTC TTGATCTCTT 
TTTCCATCAG GTCTTCAGAC 
CGCATTTCTT GCGCTTTTTC 
GAAGGATACG CCCTGGTCAG 
CCAGGTCGAT AACGCCCTTG 
AGCGGAACCG GGTTACCTTT 
CAGGAAGTTG GCACCGGTAC 
GAACCTTGTA CTTGTTAGCC 
ACGCCACHTA CGGCACAGTA 
GGAACG 



GATACGGCCA AAACGCTCGC 
CACCGGAGTT AACCACACCG 
ACGAACGGGT CGGTAGCAAT 
ATCAGAAGCA TGACGCTCGT 
CGATGGCTGC TACGTCGGTC 
AGCATGGCCT GTACGCCCTT 
GATGATTTCG TTGTTCAGAA 
CCTCGGTCAG TTCTTCACCA 
GCTTCAGCAG CGGCTTCAAC 
CAGCAGCTCA GCCGGGACGT 
CTTCGCTCCA "GTTGATGGCT 
AAGTTCTCTT CTGAACCGAT 
CAGACGGGTC TTGATGTGCT 
GGTCCATCTT, GTTGACGAAC 
TGACGCCATA CGGTTTCAGA 
CACCATCACG GCGCCGTCCA 



GCTTCTCTTT 900 

GAGTAAACGC 950 

CTTGAATGCC 1000 

CTTTGGTCTC 1050 

GGCGCCGGCA 1100 

GTTCTTGAAC 1150 

CACGCTGACG 1200 

CCCAGGTATT 1250 

CAGGGTCATG 1300 

CTTCGTAATC 1350 

TTCATCTTGA 1400 

GTTCAGTTGC 1450 

CAACGCAGCG 1500 

GCGATACGGG 1550 

CTGTGGCTGT 1600 

GAACACGCAT 1650 
1656 



2) INFORMATION FOR SEQ ID NO: 2235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2235 

TCTCCTCCCC ATTTGATAAC TACCAAATGA ACGCTATCGA CTGGTTATGT 50 

CAGTCATAAC CAGTTGATTT TTAAGAGAGT TCTTTGGTAT AATTACAATC 100 

GGTAGATACT GTTATAGAAT CTAACAAAAC TCAATTAATA GGAGGAATCA 150 

TTTAA 155 



2) INFORMATION FOR SEQ ID NO: 2236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2236 

TCTTTCGATT ACTATAAGCC CTAACTAATT CTTAGTTAAA AACCAAGTGC 50 
TCATGGAGCG ACCCTCCATG AGTAGTTAAT AAAGGAAGAT CATC 94 



2) INFORMATION FOR SEQ ID NO: 2237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2237 

GTCCCACCGG GGGTGAGTGG GGGCCGGGCA ATCGGCCCCT GCGAGCCGCC 50 
TCACACACCA GTAATCCCAG TAGGTCTCAT GCCCCATGGG TGTTAAAGTG 100 
ACACCTAGCC GTAGGCTGAG AATTTCTACC CGAGTCCAGG AGGACGAAAA 150 



2) INFORMATION FOR SEQ ID NO: 2238 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

• (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2238 
TTCAGTCCTT TAGGCAAGGA GTTAATTGTC 30 



2) INFORMATION FOR SEQ ID NO: 2239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2239 

CAACTAGCCT CTCGCTTATC TCCCTATGCA AAATTTCTAG TCAGTTACAG 50 

GGCATAGGGG GAGCGTAGGC GGGGGTAGCG GCTTGCTGAG CACTTCCTCT 100 

ACATCAAAGG GAATGTTGAG "CCGGCCGTTA" CCCTGTACGA TCCCATCTGG " 150" 

TTTCTTCGGT GGTTTGATAA ATACCCCGTT GTGACCCTAG GATCATGTAA 200 

CTGGCACAAT GTAAATAGCT GTACTGCCAG GCTGCCGAAT TAGCAGTCAG 250 

AAATGTACAG CACTGTCAAC TCGTGGCTGC , GAAATCGTAG CCACCACGAA 300 

GTCCAGGAGG ACACACA 317 



2) INFORMATION FOR SEQ ID NO: 2240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2240 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCTGAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 



2) INFORMATION FOR SEQ ID NO: 2241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2241 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCGAAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 
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2) INFORMATION FOR SEQ ID NO: 2242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2242 

AACAACAAAG AAGATTAATT GATTTTTTTC GCAACATCAA GTATAACTTT 50 
AGTTAGAAGT ATTACTTAGT TTAAATTTAA GCTAAGTAAA AAATAATTAT 100 
CGAATTATCG AGGAGGATAT TTTAAA 126 



2) INFORMATION FOR SEQ ID NO: 2243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2243 

GTGTCAACTC ACTGGCTCGG AGCCGAGCAA TCGGCTCAGC GAAGGCGACG 50 

GGTCAGTAGC TACTGGCAGC GGAGTAATCT TGCCGGGTCA TTGGAATGCC 100 

TTGGGCGCGG CACAACTGAA AACACCAACA CTGCTTTAAC AAGCACCAAC 150 

TAGTCCAGGA GGACACAGAA 170 



2) INFORMATION FOR SEQ ID NO: 2244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2244 



GGCCGGCAAG CCTGCGAGTA AGCTGACGCG GTTAGCACCG CGGCAAAACC 
AAGAAAAATC AACACTGCTT TTTTAAGCAC CAACAGTCCA GGAGGACAAC 



50 
100 
103 



AAA 



2) INFORMATION FOR SEQ ID NO: 2245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2245 

GCCGAGCGTT GCGCGTAAGC TAGCTCGGTT ACCACGGCGG CAAAACTAGA 50 
AAAACATCAA CACTGCTTTT ATAAGCACCA ACAGTCCAGG AGGACACAGA 100 
A 101 



2) INFORMATION FOR SEQ ID NO: 2246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2246 

CCAGCTTCCC GCCCGTGCCG CTACGGTGGC ACAAACCCAA AAAGATCAAT 50 
CCTGCTGTAA CCCAGCACCA ACAAGTCCAG GAGGACAAGA A 91 



2) INFORMATION FOR SEQ ID NO: 2247 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

..(vi). ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysacch'area 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2247 

TCAGGCAAAT AGGCCGTCTG AAAGGCTGAA ATGATTTTTC AGACGGCATT 50 
GTTCTTTAAT CGATCTTTAA TGTAAAGGAA TTAGCTC 87 



2) INFORMATION FOR SEQ ID NO: 2248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2248 

TATAACTTGT TAAGACTAGC TATGCTAGGT TAAAATACAG GTTGAGCTTA 50 

TTTATAAGCT GACATTTTTA TGATTTGATT TTTAGGGGTA AATGCATTAT 100 

AAAAGAATTA TAAATTCTTT TATGCTACAC TCAATCAATT TTCTTCTCAT 150 

GATGGTGAGA AACTATCATG AGAGATAAAT TTGAAATAAC TTTTATTAAG 200 

AATAGGAGAG ATTTAATA 218 



2) INFORMATION FOR SEQ ID NO: 2249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2249 



TTAAACTTGT TTTAGCTAGA ATTTCTAGGT AAAATACAGC GTAAGCTTAT 
TAATTAAGCT AACATCTTTA TGAATTGATT TTTTACTGAA AATGCATTAT 
AAATGAATTA TGAATTCTAA CAATCATTAT GTCTCATGAT GGTGAGAAAC 
TATCATGAGA GATAATATTG AAATAACTTT TACTAGAATA GGAGAGATTT 
AATA 



50 
100 
150 
200 
204 



2) INFORMATION FOR SEQ ID NO: 2250 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2250 
CCGTCGGATG GTGTCGTATA CCGCGGAGTC GCCGACGG 38 



2 ) INFORMATION FOR SEQ ID NO: 2251 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2251 
CGGAGCCGTT CTCGCTGCGT TACATGCTGG TGGCTCCG 38 



2) INFORMATION FOR SEQ ID NO: 2252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2252 
CCACATACAG TGTCTCTC 18 
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2) INFORMATION FOR SEQ ID NO: 2253 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2253 
CATTACCCAA CCGAAAGTA 19 



2) INFORMATION FOR SEQ ID NO: 2254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2254 
ACCTGAACAG AGAGAAATG 19 



2) INFORMATION FOR SEQ ID NO: 2255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2255 

ATGGCAAAAG AAAAATTTGA CCGTTCAAAA CCACACGTTA ACATTGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC ATTAACTGCT GCTATCACAA 100 

CTGTTTTAGC TAAGAAAGGT TTCGCGCAAG CTCAAGATTA CGGTTCAATC 150 

GATAAAGCTC CAGAAGAACG CGAACGTGGT ATCACAATCA ACACTTCTCA 200 

CGTTGAGTAC GAAACAGACA CTCGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCGGA CTACGTTAAA AAC 273 
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2) INFORMATION FOR SEQ ID NO: 2256 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
(D) TOPOLOGY; Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2256 

ATGGCTAAAG CCAAGTTTGA ACGTAATAAA CCACACGTAA ACGTGGGTAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCGATTGCAA 100 

CAATTTGTGC AAAAACTTAC GGCGGTGAAG CGAAAGATTA CTCACAAATC 150 

GACTCAGCAC CTGAAGAAAA AGCACGTGGT ATTACAATTA ATACATCACA 200 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTT GACTGCCCAG 250 

GCCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2257 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2257 

GTGGCGAAGG CCAAGTTTGA GCGCACCAAG CCGCACGTCA ACATCGGCAC 50 

GATTGGTCAC GTTGACCACG GCAAGACGAC GCTGACGGCA GCTATCACCA 100 

AGGTGCTGCA TGACAAGTAC CCCGAACTGA ACGAGTTCAC CCCCTTCGAT 150 

CAGGTCGACA ACGCTCCCGA GGAGCGCGAT CGTGGCATCA CGATCAACGT 200 

CTCTCACGTT GAGTACCAGA CCGAGGCGCG TCACTACGCG CACGTTGACG 250 

CTCCCGGCCA CGCCGACTAC GTCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2258 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 273 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ'lD NO: 2258 



GTGGCTAAAG AAAAATTTGA TCGTTCCCTA 
TATCGGTCAC GTTGACCATG GTAAAACCAC 
GCGTTTGCTC CGAAGTATTC GGTTCCGCAA 
GACAGCGCAC CAGAAGAAAA AGCTCGTGGT 
CGTTGAATAC AACTCGCTGA TCCGTCACTA 
GTCACGCTGA CTATGTGAAG AAC 



CCGCACGTCA ACGTTGGCAC 50 
TCTGACTGCT GCTCTGACTC 100 
TCGTTGATTT CGATAAAATC 150 
ATCACCATCA ACACCGCGCA 200 
CGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2259 



GTGGCAAAGG CTAAGTTCGA GCGTACCAAG 
CATCGGTCAC GTTGACCACG GTAAGACCAC 
AGGTTTTGGC AGACGCTTAC CCAGAGCTGA 
GCCATCGATA AGGCACCGGA AGAGAAAGAG 
CTCCCACGTG GAGTACCAGA CCGAGAAGCG 
CTCCAGGTCA CGCTGACTAC ATCAAGAAC 



CCGCACGTCA ACATCGGCAC 50 
CACCACCGCT GCTATCACCA 100 
ACGAAGCTTT CGCTTTCGAT 150 
CGTGGTATTA CCATCAACAT 200 
CCACTACGCA CACGTTGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN : ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2260 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GTCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2261 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCCATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCCG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2262 

ATGGCAAAAG AAAAATTTGA CCGCTCTAAA CCCCATGTTA ACATTGGTAC 50 
TATTGGACAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCAATTACAA 100 
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CACAAGCTTA TGACCAAATT 150 
ATCACAATCT CTACTGCTCA 200 
TGCACACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases " 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2263 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCCG CGCGTTCGAC 150 

CAGATCGACA ACGCGCCCGA GGAGCGTCAG CGCGGTATCA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGACAAGCG GCACTACGCT CACGTCGACG 250 

CCCCGGGTCA CGCCGACTAC ATCAAGAAC 279 

2) INFORMATION FOR SEQ ID NO: 2264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2264 



GTGGCGAAGG CGAAGTTCCA GCGGACCAAG 
CATCGGTCAC GTTGACCACG GCAAGACCAC 
AGGTCCTGCA CGACAAGTAC CCGGACCTGA 
CAGATCGACA ACGCGCCTGA GGAGCGTCAG 
CGCGCACGTG GAATACCAGA CCGAGAAGCG 
CCCCCGGCCA CGCCGACTAC ATCAAGAAC 



CCGCACGTCA ACATCGGGAC 50 
TCTGACCGCG GCTATCACCA 100 
ACGAGTCCAA GGCGTTCGAC 150 
CGCGGTATCA CGATCAACAT 200 
TCACTACGCG CACGTCGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2265 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 
'"(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2265 

GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCCCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC CCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA TGACAAGTTC CCGGACCTGA ACGAGTCGAA GGCGTTCGAC 150 

CAGATCGACA ACGCTCCTGA GGAGCGCCAG CGCGGTATCA CGATCAACAT 200 

CGCGCACGTG GAGTACCAGA CCGAGAAGCG GCACTATGCA CACGTCGACG 250 

CGCCGGGCCA CGCCGACTAC ATCAAGAAC .279 



2) INFORMATION FOR SEQ ID NO: 2266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2266 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTCA ACGAGTCGCG TGCGTTCGAC 150 

CAGATCGACA ACGCTCCCGA AGAGCGTCAG CGCGGTATCA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG GCACTACGCC CACGTCGACG 250 

CTCCTGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2267 

ATGGCTAAGG AAAAATTCGA ACGTAGCAAA CCGCACGTAA ACGTTGGCAC 50 

CATCGGTCAC GTTGACCATG GTAAAACCAC TCTGACTGCT GCTTTGACTA 100 

CTATTTTGGC TAAAAAATTC GGCGGTGCTG CAAAAGCTTA CGACCAAATC 150 

GACAACGCAC CCGAAGAAAA AGCACGCGGT ATTACCATTA ACACCTCGCA 200 

CGTAGAATAC GAAACCGAAA CCCGCCACTA CGCACACGTA GACTGCCCGG 250 

GTCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2268 

ATGGCAAAAG AAAAATTTGA TCGCTCAAAA GAACATGCCA ATATTGGTAC 50 

TATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCTATCGCAA 100 

CTGTATTAGC TAAAAATGGT GACACTGTTG CACAATCATA CGATATGATT 150 

GACAACGCTC CAGAAGAAAA AGAACGTGGT ATTACAATCA ATACTGCACA 200 

TATCGAATAC CAAACTGACA AACGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCTGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2269 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2269 
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ATGGCAAAAG AAAAATTTGA 
TATCGGTCAC GTTGACCATG 
CTGTATTAGC TAAAAATGGT 
GACAACGCTC CAGAAGAAAA 
CATCGAGTAT CAAACTGACA 
GACACGCTGA CTATGTTAAA 



TCGCTCAAAA GAACATGCCA ATATTGGTAC 
GTAAAACTAC TTTAACAGCT GCTATCGCAA 
GACACTGTAG CACAATCATA TGACATGATT 
AGAACGTGGT ATCACAATCA ATACTGCACA 
AACGTCACTA TGCTCACGTT GACTGCCCAG 
AAC 



50 
100 
150 
200 
250 
273 



2) INFORMATION FOR SEQ ID NO: 2270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2270 

CGATCCTGGT AGTAGCAGCG ACTGACGGCC CGATGCCGCA GACTCGTGAG 50 

CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCAT 100 

GAACAAGTGT GACATGGTAG ATGACGAAGA GCTGCTGGAA CTGGTCGAGA 150 

TGGAAGTTCG CGAACTGCTG TCCGAGTACG ACTTCCCGGG TGATGACCTG 200 

CCGGTAGTCC GTGGTTCYGC ACTGAAAGCG CTGGAAGGCG AAGCTCAGTG 250 

GGAAGAGAAG ATCCTGGAAC TGGCTGGCCA CCTGGACACC TACATTCCGG 300 

AGCCGGAGCG TGCCATCGAC CTGCCGTTCC TGATGCCTAT CGAAGACGTA 350 

TTCTCCATCG CTGGCCGYGG TACCGTAGTG ACCGGTCGTG TAGAGCGCGG 400 

TATCGTCAAA GTTGGTGAAG AAGTGGAAAT CGTKGGTATC AAAGATACCA 450 

CCAAGACCAC CTGTACCGGC GTTGAAATGT TCCGCAAACT GCTGGACGAA 500 

GGTCGTGCAG GCGAGAACAT CGGTGCACTG CTGCGTGGCG TGAAGCGTGA 550 

AGACGTAGAG CGTGGTCAGG TACTGGCCAA GCCGGGCACC ATCAAGCCGC 600 

ACACCAAGTT YGAATCTGAA GTGTACGTGC TGTCCAAAGA AGAAGGTGGT 650 

CGTCATACCC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 700 

TACCGACGTG ACCGGTACCA TCGAACTGCC GGAAGGCGTA GAGATGGTAA 750 

TGCCGGGCGA CAACATCAAG ATGGTTGTTA CCCTGATTGC GCCGATCGCG 800 

ATGGACGACG GC 812 



2) INFORMATION FOR SEQ ID NO: 2271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bilophila wadsworthia 
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(B) STRAIN: ATCC 4 9260 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGACGGTCCC ATGCCCCAGA CCCGTGAGCA 
TCGGCGTGCC TCACCTCGTC GTGTTCATGA 
GACCCCGAAC TGCTCGAACT CGTCGAAATG 
CTCCTACGGC TACCCCGGCG ATGAAATCCC 
TGAAGGCTCT GGAATCCGAT AGCGCTGATT 
CTCGAACTGC TCGCCGCTTG CGACAGCTAC 
AACCGACAAG CCCTTCCTGA TGCCCATCGA 
GCCGCGGTAC CGTGGTCACC GGTCGTGTGG 
GGCGAAGAAG TCGAAATCGT GGGTATCCGT 
CACCGGCGTC GAAATGTTCC GCAAGCTGCT 
ACAACATCGG CGCTCTGCTC CGCGGCACGA 
GGCCAGGTTC TCGCCGCTCC CAAGAGCATC 
GGCTGAAGTG TACGTTCTGT CCAAGGAAGA 
TCTTCACCGG CTATCGTCCT CAGTTCTACT 
GGTATCATCG CTCTTGAAGA AGGCGTTGAA 
CGCTACCTTT AATGTCGAGC TCATTCACCC 



ID NO: 2271 



CATCCTGCTC 


GCCCGTCAGG 


50 


ACAAGTGTGA 


CCTCGTCGAC 


100 


GAAGTCCGCG 


AACTGCTGAG 


150 


GGTTGTCCGC 


GGTTCCGCTC ' 


200 


CCCCTGACGC 


CCAGTGCGTG 


250 


TTCCCGGATC 


CGGTCCGCGA 


300 


AGACGTGTTC 


TCCATCTCCG" 


"350 " 


AACGTGGCAT 


CATCAAGGTC 


400 


CCCACCGTGA 


AGACGACCTG 


450 


CGATCAGGGC 


CAGGCCGGCG 


500 


AGCGTGACGA 


AGTGGAACGC 


550 


ACGCCCCACA 


AGAAGTTCAA 


600 


AGGCGGCCGC 


CATACCCCGT 


650 


TCCGTACCAC 


CGACATCACC 


700 


ATGGTTATGC 


CCGGCGATAA 


750 


CATCGCCATG 


GAAAAGGGC 


799 



2) INFORMATION FOR SEQ ID NO: 2272 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevundimonas diminuta 

(B) STRAIN: ATCC 11568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2272 

GATCCTGGTG TGCTCGGCCG CTGACGGCCC GATGCCGCAG ACCCGCGAGC 50 

ACATCCTGCT GTCGCGTCAG GTCGGCGTTC CGGCGCTGGT GGTGTTCCTG 100 

AACAAGGTCG ACATGGTCGA CGACGAGGAR CTGCTGGAGC TGGTCGAGAT 150 

GGAAGTGCGC GAGCTGCTGA GCTCGTACCA GTTCCCGGGC GACGACATCC 2 00 

CGGTGGTCAA GGGCTCGGCC CTGGCCGCGG TGGAGGATCG CGACCCGCAC 250 

ATCGGCGCCG AGCGCGTTCT GGAGCTGATG GCGGCGGTCG ACAGCTACAT 3 00 

CCCGCAGCCG GAACGTCCGA TCGACATGCC GTTCCTGATG CCGGTGGAAG 350 

ACGTGTTCTC GATCTCGGGC CGCGGCACCG TGGTGACGGG TCGCGTCGAG 4 00 

CGCGGCGTCG TCAAGGTCGG TGAAGAAGTC GAAATCGTCG GCATCCGTCC 450 

GGTTCAGAAG ACGACCTGCA CGGGCGTCGA AATGTTCCGC AAGCTGCTGG 500 

AYCAGGGTCA RGCCGGCGAC AACGTGGGCG TGCTGCTGCG CGGCACCAAG 550 

CGTGAAGACG TCGAGCGCGG CCAGGTGCTG TGCAAGCCGG GTTCGATCAC 6 00 

CCCGCACACC AAGTTCGTGG CTGAAGCCTA CATCCTGAAC AAGGAAGAAG 650 

GCGGCCGTCA CACGCCGTTC TTCACGAACT ACCGTCCGCA GTTCTACTTC 7 00 

CGCACGACGG ACGTGACCGG CATCGTGCGC CTGAAGGAAG GCGTCGAGAT 750 

GATCATGCCG GGCGACAACG CCGAGCTGGA CGTCGA 786 
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2) INFORMATION FOR SEQ ID NO: 2273 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2273 



GCTATTATGG CTGGATTACT ATCCAATACG CTATCCAAGA ATCCCGTAAC 50 

GTACCTGCCG TCAAATCGCT GGAAGCAGTC GGATTAGATA ATTCATTGAA 100 

ATTCCTCAAT GGCCTTGGTA TCAATTATCC TGAGATGCAT TATTCTAATG 150 

CGATTTCAAG TAATACAAGC GAATCTGGTA ACCAATACGG AGCAAGTAGC 200 

GAAAAAATGG CTGCCGCTTA CGCTGCCTTT GCTAATGGCG GTACATATTA 250 

CAAACCGCAA TACGTCAACC GAGTTGTCTT TAGCGACGGT ACAGAAAAAG 300 

TCTTTTCAAA TGGCGGATCA AAAGCCATGA AAGAGACGAC AGCCTACATG 350 

ATGACAGACA TGATGAAGAC CGTTCTTCAG TCTGGAACTG GTACCAATGC 400 

TGCAATCCCA GGAGTTTATC AAGCAGGTAA AACTGGTACT TCCAACTATG 450 

CAGATGATGA GCTAGAGAAG TTGACAAAAC CTTATTACAG TTCTAGCATT 500 

GTCACACCAG ACGAACTATT TGTTGGCTAT ACTCCACAAT ACTCTATGGC 550 

TGTTTGGACA 560 



2) INFORMATION FOR SEQ ID NO: 2274 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2274 



GCTACTTTGG AAACATTACT GTCCAATATG CTCTTCAACA ATCACGTAAT 50 

GTCACAGCCG TTGAAACTTT GAATAAGGTC GGTCTAGATA AGGCTAAAGC 100 

CTTCCTTAAT GGGCTTGGTA TTGATTATCC AAGCATGCAT TATGCAAACG 150 

CCATTTCAAG TAATACAACT GAATCCAACA AAAAATACGG TGCAAGTAGT 200 

GAAAAAATGG CTGCTGCCTA CGCTGCTTTT GCTAATGGTG GTATTTACCA 250 

CAAGCCAATG TACATCAATA AAATCGTCTT TAGCGACGGT AGTGAGAAAG 300 

AATTTTCTGA TGCCGGCACA CGAGCTATGA AAGAAACTAC TGCCTATATG 350 

ATGACTGAAA TGATGAAAAC AGTCCTAGTA TACGGTACCG GACGTGGAGC 400 

CTACCTACCA TGGCTTCCAC AAGCAGGTAA GACAGGTACT TCTAACTATA 450 
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ACACTGGCTA CGTAGCCCCA 500 
TATGCAATGG CTGTTTGGAC 550 

551 



2) INFORMATION FOR SEQ ID NO: 2275 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) " TYPE: Nucleic acid" - - - 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2275 

GCTATTATGG CTGGATTACT ATCCAATACG CTATCCAAGA ATCCCGTAAC 50 

GTACCAGCCG TCAAATCGCT GGAAGCAGTC GGATTAGATA ATTCATTGAA 100 

GTTCCTCAAT GGCCTTGGTA TTAATTACCC TGAAATGCAT TATTCTAATG 150 

CGATTTCAAG TAATACAAGC GAATCTGGTA ACCAATACGG AGCAAGTAGC 200 

GAAAAAATGG CTGCCGCTTA CGCTGCCTTT GCTAATGGCG GTACATATTA 250 

CAAACCGCAA TACGTCAACC GAGTTGTCTT TAGCGACGGT ACAGAAAAAG 300 

TCTTTTCAAA TGGCGGATCA AAAGCCATGA AAGAAACGAC AGCCTACATG 350 

ATGACAGACA TGATGAAGAC CGTTCTTCAA TCTGGAACTG GTACCAATGC 400 

TGCAATTCCA GGAGTCTATC AAGCAGGTAA AACCGGCACT TCCAACTATG 450 

CAGATGATGA ACTAGAGAAG TTGACAAAAC CTTATTACAG TTCTAGCATT 500 

GTCACACCAG ACGAGCTGTT TGTTGGCTAC ACTCCACAGT ACTCTATGGC 550 

TGTTTGGACA 560 



2) INFORMATION FOR SEQ ID NO: 2276 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2276 

TTATTTTGGA AATATCACCA TCCAATATGC GCTCCAACAA TCACGGAACG 50 

TTACAGCCGT AGAAACCTTG AACAAAGTCG GTTTGGATAG AGCCAAGACC 100 

TTCCTGAATG GAATCGGTAT TGACTATCCA GATATGCACT ATGCCAACGC 150 

GATTTCAAGT AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG 200 
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AGAAAATGGC TGCTGCTTAC GCTGCTTTTG CTAACGGTGG TATCTACCAT 250 

AAACCAATGT ATATCAACAA AATCGTCTTT AGCGATGGTA GCTCAAAAGA 300 

ATACGCTGAT CCTGGTACTC GTGCCATGAA AGAGACGACC GCCTATATGA 350 

TGACAGAAAT GATGAAGACT GTCTTGGCAT ACGGAACGGG TCGTGGTGCT 400 

TATCTCCCTT GGCTACCTCA AGCTGGTAAG ACTGGTACAT CAAACTATAC 450 

AGATGATGAA ATTGAAAACT ACATCAAAAA TACTGGTTAT GTAGCCCCAG 500 

ACGAAATGTT TGTTGGTTAT ACTCGCAAAT ATTCAATGGC TGTWTGGACA 550 



2) INFORMATION FOR SEQ ID NO: 2277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2277 

GCTCCTATCT GGATTATGCG ATGTCGGTCA TTGTTGGCCG TGCGCTGCCA 50 

GATGTCCGAG ATGGCCTGAA GCCGGTACAC CGTCGCGTAC TTTACGCCAT 100 

GAACGTACTA GGCAATGACT GGAACAAAGC CTATAAAAAA TCTGCCCGTG 150 

TCGTTGGTGA CGTAATCGGT AAATACCATC CCCATGGTGA CTCGGCGGTC 200 

TATGACACGA TCGTCCGCAT GGCGCAGCCA TTCTCGCTGC GTTATATGCT 250 

GGTAGACGGT CAGGGTAACT TCGGTTCTAT CGACGGCGAC TCTGCGGCGG 300 

CAATGCGTTA TACGGAAATC CGTCTGGCGA AAATTGCCCA TGAACTGATG 350 

GCCGAT 356 



2) INFORMATION FOR SEQ ID NO: 2278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2278 

CTCTTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGC GCGCTGCCGG . 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT CAGCCCGTGT 150 

CGTTGGTGAC GTGATCGGTA AATACCACCC GCACGGCGAC TCCGCGGTAT 200 
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ATGACACCAT CGTTCGTATG GCCCAGCCGT TCTCGCTGCG CTACATGCTG 250 
GTGGATGGCC AGGGGAACTT CGGTTCAATC GACGGCGACT CCGCCGCGGC 300 
AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCTCAC GAACTGA 347 



2) INFORMATION FOR SEQ ID NO: 2279 

(i) SEQUENCE CHARACTERISTICS: 
. . (A) LENGTH: 362 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia 

(B) STRAIN: ATCC 43886 

(xi) SEQUENCE DESCRIPTION: SEQ 

AGAGCTCCTA TCTGGATTAT GCGATGTCGG 
CCAGATGTCC GAGATGGCCT GAAGCCGGTA 
CATGAACGTA CTAGGCAATG ACTGGAACAA 
GTGTCGTTGG TGACGTAATC GGTAAATACC 
GTCTATGACA CGATCGTCCG CATGGCGCAG 
GCTGGTAGAC GGTCAGGGTA ACTTCGGTTC 
CGGCAATGCG TTATACGGAA ATCCGTCTGG 
ATGGCCGATC TC 



coli 



ID NO: 2279 

TCATTGTTGG CCGTGCGCTG 50 

CACCGTCGCG TACTTTACGC 100 

AGCCTATAAA AAATCTGCCC 150 

ATCCCCATGG TGACTCGGCG 200 

CCATTCTCGC TGCGTTATAT 250 

TATCGACGGC GACTCTGCGG 3 00 

CGAAAATTGC CCATGAACTG 350 

362 



2) INFORMATION FOR SEQ ID NO: 2280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2280 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCAG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTACTAG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTTGGTGAC GTAATCGGTA AATACCATCC CCATGGTGAC TCGGCGGTTT 200 

ATGACACGAT CGTCCGTATG GCGCAGCCAT TCTCGCTGCG TTACATGCTG 250 

GTAGACGGTC AGGGTAACTT CGGTTCCATC GACGGCGACT CTGCGGCGGC 300 

AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCCCAT GAACTGATGG 350 

CCGATCTC 358 
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2) INFORMATION FOR SEQ ID NO: 2281 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2281 
CCCCCAGCTG GGCGGCGGTA TCGATGGGGG 30 



2) INFORMATION FOR SEQ ID NO: 2282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2282 
AGRRGCIMAR ATGTATGA 18 
2) INFORMATION FOR SEQ ID NO: 2283 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2283 
ATITATGAYG GKITTCAGAG GC 22 



2) INFORMATION FOR SEQ ID NO: 2284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

1170 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2284 
TCTGWGTRAC IGGYTCKGAG A 



2) INFORMATION FOR SEQ ID NO: 2285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2285 
CMCCICCWGG TGGWGAWAC 



2) INFORMATION FOR SEQ ID NO: 2286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2286 
AGTTGCTGTA TTAGGAAATG 



2) INFORMATION FOR SEQ ID NO: 2287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2287 
TCGAAGTTGC TGTATTAGGA 



2) INFORMATION FOR SEQ ID NO: 2288 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecivm 

(B) STRAIN: BM4339 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2288 

TAGAAGCTGG CTCGTTTTTT TATAAATAAG TTATTCGTTT ATTTTTGTTT 50 

TGTGCTAAAA TATGAGAGTA AATCACTGAA CGATTTAGAA TACAGGAGGA 100 

CAATCTTTTG AAGATTACTT TACTATATGG CGGACGCAGC GCAGAGCAGA 150 

GCATGAAGTG TCCATTCTTT CCGCATTTTC AGTTTTAAAT GCCATTTATT 200 

ATAATTATTA CCAAGTTCAA CTCGTATTTA TTACAAAAGA AGGACAATGG 250 

GTCAAAGGTC CATTACTAAC AGAAAAACCT GCCAGCAAAG ATGTCTTGCA 300 

TCTTTCATGG GACCCAAGTG GACAGACAGA GGAAGGCTTT ACAGGAAAAG 350 

TGATCAATCC GGGCGAAATC AAAGAAGAAG GAGCCATCGT TTTTCCAGTT 4 00 

TTACATGGGC CAAACGGGGA AGATGGAACG ATCCAAGGCT TCTTAGAGAC 450 

ATTGAATATG CCTTATGTCG GCGCAGGCGT ATTGACCAGT GCATGTGCCA 500 

TGGATAAAAT CATGACCAAG TATATTTTAC AAGCTGCTGG TGTGCCGCAA 550 

GTTCCTTATG TACCAGTACT TAAGAATCAA TGGAAAGAAA ATCCTAAAAA 600 

AGTATTTGAT CAATGTGAAG GTTCTTTGCT TTATCCGATG TTTGTCAAAC 650 

CGGCGAATAT GGGTTCTAGT GTCGGCATTA CAAAAGCAGA AAACCGAGAA 700 

GAGCTGCAAA ATGCTTTAGC AACAGCCTAT CAGTATGATT CTCGAGCAAT 750 

CGTTGAACAA GGAATTGAAG CGCGCGAAAT CGAAGTTGCT GTATTAGGAA 800 

ATGAAGACGT TCGGACGACT TTGCCTGGTG AAGTCGTAAA AGACGTAGCA 850 

TTCTATGATT ATGAAGCAAA ATATATCAAT AATAAAATCG AAATGCAGAT 900 

TCCAGCCGAA GTGCCAGAAG AAGTTTATCA AAAAGCGCAA GAGTACGCGA 950 

AGTTAGCTTA CACGATGTTA GGTGGAAGCG GATTGAGCCG GTGCGATTTC 1000 

TTTTTGACAA ATAAAAATGA ATTATTCCTG AATGAATTAA ACTCTATGCC 1050 

AGGATTTACG GAGTTCAGTA TGTACCCACT CTTATGGGAA AATATGGGCT 1100 

TGAAATACGG TGATTTGATT GAAGAACTGA TCCAGTTAGG AATGAATCGA 1150 

TACCATCAGC GTCAATCTTT TTTTGAAAAA AATGAATAAA GAGAAATAAA 1200 

GAAGAGGCTG GAGTGATTGC GTAACCGCGT TCATTCTAGC 1240 



2) INFORMATION FOR SEQ ID NO: 2289 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2289 
CACCGAAGAA GATGAAAAAA 
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2) INFORMATION FOR SEQ ID NO: 2290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ , ID NO: 2290 
TGGCACCGAA GAAGATGA 



2) INFORMATION FOR SEQ ID NO: 2291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2291 
ATTTTGGCAC CGAAGAAGA 
2) INFORMATION FOR SEQ ID NO: 2292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2292 
GAATCGGCAA GACAATATG 



2) INFORMATION FOR SEQ ID NO: 2293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2293 

ATGAATAGAA TAAAAGTTGC AATACTGTTT GGGGGTTGCT CAGAGGAGCA 5 0 

TGACGTATCG GTAAAATCTG CAATAGAGAT AGCCGCTAAC ATTAATAAAG 100 

AAAAATACGA GCCGTTATAC ATTGGAATTA CGAAATCTGG TGTATGGAAA 150 

ATGTGCGAAA AACCTTGCGC GGAATGGGAA AACGACAATT GCTATTCAGC 200 

TGTACTCTCG CCGGATAAAA AAATGCACGG ATTACTTGTT AAAAAGAACC 250 

ATGAATATGA AATCAACCAT GTTGATGTAG CATTTTCAGC TTTGCATGGC 300 

AAGTCAGGTG AAGATGGATC CATACAAGGT CTGTTTGAAT TGTCCGGTAT 350 

CCCTTTTGTA GGCTGCGATA TTCAAAGCTC AGCAATTTGT ATGGACAAAT 400 

CGTTGACATA CATCGTTGCG AAAAATGCTG GGATAGCTAC TCCCGCCTTT 450 

TGGGTTATTA ATAAAGATGA TAGGCCGGTG GCAGCTACGT TTACCTATCC 500 

TGTTTTTGTT AAGCCGGCGC GTTCAGGCTC ATCCTTCGGT GTGAAAAAAG 550 

TCAATAGCGC GGACGAATTG GACTACGCAA TTGAATCGGC AAGACAATAT 600 

GACAGCAAAA TCTTAATTGA GCAGGCTGTT TCGGGCTGTG AGGTCGGTTG 650 

TGCGGTATTG GGAAACAGTG CCGCGTTAGT TGTTGGCGAG GTGGACCAAA 700 

TCAGGCTGCA GTACGGAATC TTTCGTATTC ATCAGGAAGT CGAGCCGGAA 750 

AAAGGCTCTG AAAACGCAGT TATAACCGTT CCCGCAGACC TTTCAGCAGA 800 

GGAGCGAGGA CGGATACAGG AAACGGCAAA AAAAATATAT AAAGCGCTCG 850 

GCTGTAGAGG TCTAGCCCGT GTGGATATGT TTTTACAAGA TAACGGCCGC 900 

ATTGTACTGA ACGAAGTCAA TACTCTGCCC GGTTTCACGT CATACAGTCG 950 

TTATCCCCGT ATGATGGCCG CTGCAGGTAT TGCACTTCCC GAACTGATTG 1000 

ACCGCTTGAT CGTATTAGCG TTAAAGGGGT GA 1032 



2) INFORMATION FOR SEQ ID NO: 2294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2294 
AAACGAGGAT GATTTGATTG 



2 ) INFORMATION FOR SEQ ID NO: 2295 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2295 
TTGAGCAAGC GATTTCGG 



2) INFORMATION FOR SEQ ID NO: 2296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 bases ' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2296 

ATGAATAAAA TAAAAGTCGC AATTATCTTC GGCGGTTGCT CGGAGGAACA 50 

TGATGTGTCG GTAAAATCCG CAATAGAAAT TGCTGCGAAC ATTAATACTG 100 

AAAAATTCGA TCCGCACTAC ATCGGAATTA CAAAAAACGG CGTATGGAAG 150 

CTATGCAAGA AGCCATGTAC GGAATGGGAA GCCGATAGTC TCCCCGCCAT 200 

ATTCTCCCCG GATAGGAAAA CGCATGGTCT GCTTGTCATG AAAGAAAGAG 250 

AATACGAAAC TCGGCGTATT GACGTGGCTT TCCCGGTTTT GCATGGCAAA 300 

TGCGGGGAGG ATGGTGCGAT ACAGGGTCTG TTTGAATTGT CTGGTATCCC 350 

CTATGTAGGC TGCGATATTC AAAGCTCCGC AGCTTGCATG GACAAATCAC 400 

TGGCCTACAT TCTTACAAAA AATGCGGGCA TCGCCGTCCC CGAATTTCAA 450 

ATGATTGAAA AAGGTGACAA ACCGGAGGCG AGGACGCTTA CCTACCCTGT 500 

CTTTGTGAAG CCGGCACGGT CAGGTTCGTC CTTTGGCGTA ACCAAAGTAA 550 

ACAGTACGGA AGAACTAAAC GCTGCGATAG AAGCAGCAGG ACAATATGAT 600 

GGAAAAATCT TAATTGAGCA AGCGATTTCG GGCTGTGAGG TCGGCTGCGC 650 

GGTCATGGGA AACGAGGATG ATTTGATTGT CGGCGAAGTG GATCAAATCC 700 

GGTTGAGCCA CGGTATCTTC CGCATCCATC AGGAAAACGA GCCGGAAAAA 750 

GGCTCAGAGA ATGCGATGAT TATCGTTCCA GCAGACATTC CGGTCGAGGA 800 

ACGAAATCGG GTGCAAGAAA CGGCAAAGAA AGTATATCGG GTGCTTGGAT 850 

GCAGAGGGCT TGCTCGTGTT GATCTTTTTT TGCAGGAGGA TGGCGGCATC 900 

GTTCTAAACG AGGTCAATAC CCTGCCCGGT TTTACATCGT ACAGCCGCTA 950 

TCCACGCATG GCGGCTGCCG CAGGAATCAC GCTTCCCGCA CTAATTGACA 1000 

GCCTGATTAC ATTGGCGATA GAGAGGTGA 1029 



2) INFORMATION FOR SEQ ID NO: 2297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2297 

TTCAGGAGGG GGATCGC 
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